Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.
Showing results for Show only | Search instead for Did you mean:/t5/acrobat-discussions/replace-or-repair-ocr-in-scanned-documents/td-p/9581580 Jan 18, 2018 Jan 18, 2018
Copy link to clipboard
We have about 2000 documents (reports) that were scanned to PDF. These have a text (OCR) layer, but the OCR is very bad, with breaks within most words and complete mis-alignment. The text layer is virtually useless. We have an immediate need to remove the OCR layer and re-create it with a better tool. I do not know the tool that was utilized when these were scanned.
Can Acrobat perform this task? If so, can it do a batch process?
I have looked in the past, and struggled with the fact that the pdf's have a text layer already, so no new OCR is performed.
Thanks in advance for any insights.
Scan documents and OCR Community guidelinesBe kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
4 Replies 4 Adobe Employee ,/t5/acrobat-discussions/replace-or-repair-ocr-in-scanned-documents/m-p/9581581#M83420 Mar 06, 2018 Mar 06, 2018
Copy link to clipboard
Please run OCR again on all the documents. It will create a new layer of recognized text. Please use latest Acrobat DC for this as it supports running OCR again on OCRed documents instead of giving an error.
And you can OCR multiple files at a time using "In multiple files" option of Recognize text or by creating an action.
Community guidelinesBe kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
/t5/acrobat-discussions/replace-or-repair-ocr-in-scanned-documents/m-p/9581582#M83421 Mar 28, 2018 Mar 28, 2018
Copy link to clipboard
Thank you for the tip, but it doesn't seem to work on the latest Acrobat Pro DC for Mac (2018.011.20038). It still throws up the error about the PDF already having recognized text.
Is there something I missed?
Community guidelinesBe kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
Adobe Employee ,/t5/acrobat-discussions/replace-or-repair-ocr-in-scanned-documents/m-p/9581583#M83422 Apr 27, 2018 Apr 27, 2018
Copy link to clipboard
Sorry for the delayed response and inconvenience caused. You may try sanitizing the current PDF file and see if that helps.
To sanitize the PDF, you can refer to the Adobe article Removing sensitive content from PDFs in Adobe Acrobat DC
You may also try to print the PDF through Print to Adobe PDF.
Is it possible to share the PDF file with us? To share the file, please use Adobe Send feature, upload the file, share the link to files via private message only, How Do I Send Private Message
Let us know how it goes and share your findings.