Chrome will soon automatically OCR PDFs

ray@lemmy.ml · 2 years ago

Chrome will soon automatically OCR PDFs

bionicjoey@lemmy.ca · 2 years ago

The issue with OCR’ing pdfs is typically that it doesn’t understand the document formatting. So if you’re reading a document which is formatted as two columns per page, the OCR text will be a mess.

anon@lemm.ee · 2 years ago

I’m willing to bet that given that most scientific papers are in that two format column, this ocr will take that into account or it’s dead on arrival.