How to read pdf with tesseract?

slmarcos · January 9, 2019, 11:55pm

Hello, I need to develop a solution to read the contents of a PDF via OCR. I saw tesseract allow me to do this reading, but it only reads images. Does anyone know how I can convert PDF to image and feed the tesseract?

Tnks

Judgewest2000 · January 10, 2019, 12:07am

Text is stored in a PDF as text, unless the text itself is an image of course.

slmarcos · January 11, 2019, 2:34pm

in case the PDF is a scanner, then I need to get the contents through the same OCR

Topic		Replies	Views
Implementing OCR using tesseract.js on a real device Ionic Native	1	1713	September 12, 2018
How To , PDF Text To Speech ionic-v3	3	922	April 17, 2018
Ionic PDF Creator - Scan documents & Convert images to PDF showcase	8	6379	June 2, 2019
How to read a PDF file converted to base64	1	1263	September 12, 2017
Convert pdf as image ionic-v1	2	2181	March 22, 2016

How to read pdf with tesseract?

Related topics