PDF Arranger: [Github] | [ubuntuusers.de] (5/5 ⭐)
apt-get install -y poppler-utils
pdftoppm -jpeg -r 300 file.pdf out
Download Scantailor Universal - This version of scantailor works best on linux.
Move the the images all in one folder and open a new project.
Add a margin of 10mm to the output.
Download and compile https://github.com/agl/jbig2enc
sudo apt install libleptonica-dev python2
./configure
make
sudo make install
Prepare the files with Scantailor. Go to the folder out.
wget https://raw.githubusercontent.com/agl/jbig2enc/master/pdf.py
jbig2 -s -p -v *.tif
python2 pdf.py output > small.pdf
Install OCRmyPDF https://github.com/jbarlow83/OCRmyPDF
sudo apt install ocrmypdf tesseract-ocr-deu tesseract-ocr-eng tesseract-ocr-rus
ocrmypdf --jbig2-lossy -l eng small.pdf ocr.pdf
With Calibre
Serifen: Meta Serif Pro Serifenlos: Gilroy Nicht-pro: Source Code Pro
Schrift: 16px
Rand: 20px
Heuristisch: an
Format: A4