ocrmypdf
  • Introduction
  • Release notes
  • Installing OCRmyPDF
  • Installing additional language packs
  • Installing the JBIG2 encoder

Usage

  • Cookbook
  • OCRmyPDF Docker image
  • Advanced features
  • Batch processing
  • PDF security issues
  • Common error messages
ocrmypdf
  • Docs »
  • OCRmyPDF documentation
  • Edit on GitHub

OCRmyPDF documentation¶

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched.

PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR to existing PDFs.

  • Introduction
  • Release notes
  • Installing OCRmyPDF
  • Installing additional language packs
  • Installing the JBIG2 encoder

Usage

  • Cookbook
    • Basic examples
    • Image processing
    • Don’t actually OCR my PDF
    • Redo existing OCR
    • Improving OCR quality
    • PDF optimization
  • OCRmyPDF Docker image
    • Installing the Docker image
    • Using the Docker image on the command line
    • Adding languages to the Docker image
    • Executing the test suite
    • Using the OCRmyPDF web service wrapper
    • Legacy Ubuntu Docker images
  • Advanced features
    • Control of unpaper
    • Control of OCR options
    • Changing the PDF renderer
    • Return code policy
    • Debugging the intermediate files
  • Batch processing
    • Batch jobs
    • Directory trees
    • Hot (watched) folders
    • macOS Automator
  • PDF security issues
    • PDFs may contain malware
    • How OCRmyPDF processes PDFs
    • Using OCRmyPDF online or as a service
    • Password protection, digital signatures and certification
  • Common error messages
    • Page already has text
    • Input file ‘filename’ is not a valid PDF

Indices and tables¶

  • Index
  • Module Index
  • Search Page
Next

© Copyright 2019, James R. Barlow. Licensed under Creative Commons Attribution-ShareAlike 4.0. Revision 2cff6ad2.

Built with Sphinx using a theme provided by Read the Docs.
Read the Docs v: v8.3.1
Versions
latest
stable
v8.3.1
v8.3.0
v8.2.4
v8.2.3
v8.2.2
v8.2.0
v8.1.0
v8.0.1
v8.0.0
v7.4.0
v6.2.5
Downloads
On Read the Docs
Project Home
Builds

Free document hosting provided by Read the Docs.