Skip to main content

All Questions

Tagged with
Filter by
Sorted by
Tagged with
1 vote
0 answers
491 views

Creating a searchable PDF from one already existing PDF and text (with coordinates)

My Situation: I have an existing PDF with only images I have a preprocessed OCR with all text identified and their respective coordinates An application running in C# I can use other programming ...
Vitor Moscolini's user avatar
0 votes
1 answer
2k views

Extract Text from Multipage Attachment PDF Using Google Apps Script

I have a Gmail attachment PDF with multiple scanned pages. When I use Google Apps Script to save the blob from the attachment to a Drive file, open the PDF manually from Google Drive, then select ...
Janine White's user avatar
9 votes
1 answer
13k views

Extract text from a scanned pdf with images?

I've tried to extract text from a pdf created from the computer and it worked but I wasn't able to extract text from a scanned pdf, which you can find here, with images and several pages such as this ...
Revolucion for Monica's user avatar
0 votes
1 answer
127 views

Search for pages with no text in a PDF

How would you proceed to search for "empty" pages in a pdf file? In the next step I want to convert those pages to .tiff using ghostscript. If it is possible to do everything with one command on a ...
Matthias Pitscher's user avatar
0 votes
1 answer
128 views

converting pdf to txt

I am trying to convert pdf's to text of the Flint water crisis emails from Gov. Snyder. Basically they have 20k+ pages of emails printed from Outlook and then scanned in as .pdf's. (Obnoxious, I ...
CJ Sullivan's user avatar
1 vote
2 answers
3k views

How to detect if a pdf is text or image

I have an application where users can upload pdf which are converted to text for further processing. The thing is that some of the uploaded files are image pdf, so conversion it does not work. Instead ...
Andrés Chandía's user avatar
1 vote
2 answers
603 views

Creating "text" in OCRed PDF

I want to programmtically create a PDF of an image I've OCR'ed. I can to make it selectable/searchable. I know what and where each letter is. My thought was to create a invisible text letter at each ...
Paul Draper's user avatar
  • 83.6k
2 votes
2 answers
1k views

Converting an Image type PDF to an OCR enabled PDF

I'm not sure if my title is overly descriptive of what I'm trying to do so I will try and elabarate. I've been asked to develop a small application where someone can upload a PDF to the website. The ...
L2wis's user avatar
  • 294