A command-line tool to turn web pages into beautiful, readable PDF, EPUB, or HTML docs. - danburzo/percollate
Vim+Zathura+Synctex. GitHub Gist: instantly share code, notes, and snippets.
a small .pdf management tool with a command-line UI - 2mol
How could I merge / join PDF on linux cmdline terminal
preprocess (unpaper) and ocr (tesseract) pdf files and 'sandwich' the text behind the image -> output is a selectable pdf
seems perfect for pdf pipeline
PDF viewer for linux:
Python PDF Parser -- fork with Python 2+3 support using six - pdfminer
Extracts and formats text annotations from a PDF file - 0xabu
Python script to do PDF OCR conversion using Tesseract - virantha
This is also what paperless uses for its OCR process
A small utility making use of the pypdf library to provide a (somewhat) lighter alternative to pdftk - hellerbarde
allows differentiating between physical and logical pages, might be useful
I want a python function that takes a pdf and returns a list of the text of the note annotations in the document. I have looked at python-poppler
I want to edit the metadata of a scanned PDF to assign custom page numbers to different pages. For example, what are now pages 1-3 I might want to call i, ii and iii, and what are pages 4-10, I wan...
MasterPDF Editor seems awesome, and can be customized almost to vim-like keybinds. It's free in the aur (watermark removed version)