# OCRmyPDF

> software for adding an OCR text layer to PDF files and optimizing PDF files

**Wikidata**: [Q110575428](https://www.wikidata.org/wiki/Q110575428)  
**Source**: https://4ort.xyz/entity/ocrmypdf

## Summary
OCRmyPDF is a free and open-source software application that adds an OCR text layer to PDF files and optimizes them for better accessibility and searchability. It is written in Python and runs on Linux, Windows, macOS, and FreeBSD operating systems.

## Key Facts
- OCRmyPDF is licensed under the Mozilla Public License, version 2.0 and the Expat license
- The software is written in Python and depends on Tesseract, Pillow, and other libraries
- Latest stable version is 15.1.0, released on October 4, 2023
- OCRmyPDF can read and write PDF files as its input and output formats
- The software is available as a package for Ubuntu, Debian, and other Linux distributions
- OCRmyPDF has a Docker Hub repository for containerized deployment
- The software is classified as a console application and free and open-source software

## FAQs
### Q: What is OCRmyPDF used for?
A: OCRmyPDF is used to add an OCR text layer to PDF files, making them searchable and accessible. It can also optimize PDF files for better performance and quality.

### Q: What operating systems does OCRmyPDF support?
A: OCRmyPDF runs on Linux, Microsoft Windows, macOS, and FreeBSD operating systems.

### Q: Is OCRmyPDF free to use?
A: Yes, OCRmyPDF is free and open-source software licensed under the Mozilla Public License, version 2.0 and the Expat license.

## Why It Matters
OCRmyPDF plays a crucial role in making PDF documents more accessible and searchable by adding an OCR text layer to scanned or image-based PDFs. This functionality is essential for users who need to search, copy, or edit text within PDF files that were originally created as images or scans. By optimizing PDF files, OCRmyPDF also helps improve their performance and quality, making them easier to share and view across different devices and platforms. As a free and open-source tool, OCRmyPDF provides an affordable and reliable solution for individuals and organizations looking to enhance their PDF documents without relying on proprietary software.

## Notable For
- Adding OCR text layer to PDF files for improved searchability and accessibility
- Optimizing PDF files for better performance and quality
- Cross-platform compatibility with Linux, Windows, macOS, and FreeBSD
- Integration with popular libraries like Tesseract and Pillow
- Active development with frequent updates and a strong community

## Body
### Overview
OCRmyPDF is a Python-based application designed to add an OCR text layer to PDF files and optimize them for better accessibility and performance. It is a free and open-source tool that runs on multiple operating systems, including Linux, Windows, macOS, and FreeBSD.

### Key Features
- OCR text layer addition: OCRmyPDF uses the Tesseract OCR engine to recognize text in scanned or image-based PDF files and adds a hidden text layer, making the content searchable and selectable.
- PDF optimization: The software can optimize PDF files by compressing images, removing unnecessary metadata, and applying other techniques to reduce file size and improve loading times.
- Cross-platform compatibility: OCRmyPDF is designed to work on various operating systems, ensuring that users can access its functionality regardless of their preferred platform.
- Integration with popular libraries: The software relies on libraries like Tesseract for OCR, Pillow for image processing, and pikepdf for PDF manipulation, ensuring high-quality results and compatibility with a wide range of PDF files.

### Development and Community
OCRmyPDF is actively developed, with frequent updates and a strong community of users and contributors. The software is hosted on GitHub, where users can report issues, suggest features, and contribute to the project's development. The latest stable version, 15.1.0, was released on October 4, 2023, demonstrating the project's ongoing commitment to improvement and innovation.

### Usage and Integration
OCRmyPDF can be used as a command-line tool or integrated into other applications and workflows. It is available as a package for popular Linux distributions like Ubuntu and Debian, making it easy to install and use on these platforms. The software also has a Docker Hub repository, allowing users to deploy it in containerized environments for added flexibility and scalability.

## Schema Markup
```json
{
  "@context": "https://schema.org",
  "@type": "Thing",
  "name": "OCRmyPDF",
  "description": "Software for adding an OCR text layer to PDF files and optimizing PDF files",
  "url": "https://ocrmypdf.readthedocs.io/en/latest/",
  "sameAs": [
    "https://github.com/ocrmypdf/OCRmyPDF",
    "https://pypi.org/project/ocrmypdf/",
    "https://hub.docker.com/r/jbarlow83/ocrmypdf"
  ],
  "additionalType": "SoftwareApplication"
}

## References

1. [Source](https://github.com/ocrmypdf/OCRmyPDF/blob/master/LICENSE)
2. [Source](https://metadata.ftp-master.debian.org/changelogs//main/o/ocrmypdf/ocrmypdf_13.2.0+dfsg-1_copyright)
3. [Release 3.0. 2015](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v3.0)
4. [Release 3.1. 2015](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v3.1)
5. [Release 3.1.1. 2016](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v3.1.1)
6. [Release 4.0. 2016](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v4.0)
7. [Release 14.3.0. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v14.3.0)
8. [Release 14.4.0. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v14.4.0)
9. [Release 15.0.0. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v15.0.0)
10. [Release 15.0.1. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v15.0.1)
11. [Release 15.0.2. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v15.0.2)
12. [Release 15.1.0. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v15.1.0)
13. [Release 15.2.0. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v15.2.0)
14. [Release 15.3.0. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v15.3.0)
15. [Release 15.3.1. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v15.3.1)
16. [Release 15.4.0. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v15.4.0)
17. [Release 15.4.1. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v15.4.1)
18. [Release 15.4.2. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v15.4.2)
19. [Release 15.4.3. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v15.4.3)
20. [Release 15.4.4. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v15.4.4)
21. [Release 16.0.0. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.0.0)
22. [Release 16.0.2. 2023](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.0.2)
23. [Release 16.0.3. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.0.3)
24. [Release 16.0.4. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.0.4)
25. [Release 16.1.0. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.1.0)
26. [Release 16.1.1. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.1.1)
27. [Release 16.1.2. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.1.2)
28. [Release 16.2.0. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.2.0)
29. [Release 16.3.0. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.3.0)
30. [Release 16.3.1. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.3.1)
31. [Release 16.4.0. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.4.0)
32. [Release 16.4.1. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.4.1)
33. [Release 16.4.2. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.4.2)
34. [Release 16.4.3. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.4.3)
35. [Release 16.5.0. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.5.0)
36. [Release 16.6.0. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.6.0)
37. [Release 16.6.1. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.6.1)
38. [Release 16.6.2. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.6.2)
39. [Release 16.7.0. 2024](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.7.0)
40. [Release v16.10.0 · ocrmypdf/OCRmyPDF](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.10.0)
41. [Release 16.10.1. 2025](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.10.1)
42. [Release 16.10.2. 2025](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.10.2)
43. [Release 16.10.4. 2025](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.10.4)
44. [Release 16.10.3. 2025](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.10.3)
45. [Release 16.11.0. 2025](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.11.0)
46. [Release 16.11.1. 2025](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.11.1)
47. [Release 16.12.0. 2025](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.12.0)
48. [Release 16.13.0. 2025](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v16.13.0)
49. [Release 17.0.0. 2026](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v17.0.0)
50. [Release 17.0.1. 2026](https://github.com/ocrmypdf/OCRmyPDF/releases/tag/v17.0.1)