ocrmypdf
  • Introduction
  • Release notes
  • Installation
  • Installing additional language packs
  • Installing the JBIG2 encoder

Usage

  • Cookbook
  • Advanced features
  • Batch processing
  • PDF security issues
  • Common error messages
ocrmypdf
  • Docs »
  • OCRmyPDF documentation
  • Edit on GitHub

OCRmyPDF documentation¶

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched.

PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR to existing PDFs.

  • Introduction
  • Release notes
  • Installation
  • Installing additional language packs
  • Installing the JBIG2 encoder

Usage

  • Cookbook
    • Basic examples
    • OCR images, not PDFs
    • Image processing
    • Improving OCR quality
    • PDF optimization
  • Advanced features
    • Control of OCR options
    • Changing the PDF renderer
    • Return code policy
    • Debugging the intermediate files
  • Batch processing
    • Batch jobs
    • Directory trees
    • Hot (watched) folders
  • PDF security issues
    • PDFs may contain malware
    • How OCRmyPDF processes PDFs
    • Using OCRmyPDF online or as a service
    • Password protection, digital signatures and certification
  • Common error messages
    • Page already has text
    • Input file ‘filename’ is not a valid PDF

Indices and tables¶

  • Index
  • Module Index
  • Search Page
Next

© Copyright 2018, James R. Barlow. Licensed under Creative Commons Attribution-ShareAlike 4.0. Revision b8cd3acd.

Built with Sphinx using a theme provided by Read the Docs.
Read the Docs v: v8.0.1
Versions
latest
stable
v8.0.1
v8.0.0
v7.4.0
v6.2.5
Downloads
On Read the Docs
Project Home
Builds

Free document hosting provided by Read the Docs.