All Questions
8 questions
1
vote
0
answers
491
views
Creating a searchable PDF from one already existing PDF and text (with coordinates)
My Situation:
I have an existing PDF with only images
I have a preprocessed OCR with all text identified and their respective coordinates
An application running in C#
I can use other programming ...
0
votes
1
answer
2k
views
Extract Text from Multipage Attachment PDF Using Google Apps Script
I have a Gmail attachment PDF with multiple scanned pages. When I use Google Apps Script to save the blob from the attachment to a Drive file, open the PDF manually from Google Drive, then select ...
9
votes
1
answer
13k
views
Extract text from a scanned pdf with images?
I've tried to extract text from a pdf created from the computer and it worked but I wasn't able to extract text from a scanned pdf, which you can find here, with images and several pages such as this ...
0
votes
1
answer
127
views
Search for pages with no text in a PDF
How would you proceed to search for "empty" pages in a pdf file?
In the next step I want to convert those pages to .tiff using ghostscript. If it is possible to do everything with one command on a ...
0
votes
1
answer
128
views
converting pdf to txt
I am trying to convert pdf's to text of the Flint water crisis emails from Gov. Snyder. Basically they have 20k+ pages of emails printed from Outlook and then scanned in as .pdf's. (Obnoxious, I ...
1
vote
2
answers
3k
views
How to detect if a pdf is text or image
I have an application where users can upload pdf which are converted to text for further processing.
The thing is that some of the uploaded files are image pdf, so conversion it does not work. Instead ...
1
vote
2
answers
603
views
Creating "text" in OCRed PDF
I want to programmtically create a PDF of an image I've OCR'ed. I can to make it selectable/searchable.
I know what and where each letter is. My thought was to create a invisible text letter at each ...
2
votes
2
answers
1k
views
Converting an Image type PDF to an OCR enabled PDF
I'm not sure if my title is overly descriptive of what I'm trying to do so I will try and elabarate.
I've been asked to develop a small application where someone can upload a PDF to the website. The ...