2597 shaares
90 private links
90 private links
preprocess (unpaper) and ocr (tesseract) pdf files and 'sandwich' the text behind the image -> output is a selectable pdf
seems perfect for pdf pipeline
preprocess (unpaper) and ocr (tesseract) pdf files and 'sandwich' the text behind the image -> output is a selectable pdf
seems perfect for pdf pipeline