I am setting up a new SharePoint 2013 install, and have put a handful files in a doc library to test search. The content has been indexed, and I can find the content inside many files and file types without issue--including "native" PDF files. However, it doesn't seem to index the content of a scanned and OCR'd (text with image overlay) PDF. I have verified that the text is indeed in the OCR text by copying and pasting phrases, and I also confirmed that the crawl log shows the file as successfully crawled. The filename is also indexed.
So... it would seem that the SharePoint 2013 indexer does not index the text in scanned and OCR'd PDF files. Am I missing something? Can anyone else confirm this behavior?
Thanks!
Ryan