Skip to main content

All Questions

Tagged with
Filter by
Sorted by
Tagged with
-4 votes
0 answers
84 views

How to improve handwriting recognition in an image [closed]

I'm trying to develop a system to read the handwriting in a chart within a written page, using a multimodal LLM. I'm using Google apps script So far I've experimented: function openRouterApiRequest() {...
user1592380's user avatar
  • 36.6k
0 votes
0 answers
16 views

Getting proper crop coordinates of a snip

I am building a chrome extension for OCR. Users can upload pdf and it gets converted into an image and I render it in a new tab. I create a overlay on top of the image container and let the user take ...
Ashwin B's user avatar
1 vote
0 answers
39 views

Text is positioned incorrectly (Tesseract.js)

I'm creating a image to text converter where the text in the image can be selected (like Live Text on iOS). I would like the generated text to be the same size and position as the original text in an ...
That wolphin's user avatar
1 vote
2 answers
214 views

OCR.space error 99 in javascript POST request using base64

I am using the free OCR.space api for my website, and using the base64Image for uploading the image (received from a jpg-only file input in the html file), and got this error: {"OCRExitCode":...
Marley Lamparter's user avatar
2 votes
1 answer
728 views

With node javascript : How use OpenAI api as OCR for a local image?

I try to use openApi's api as OCR in node with gpt-4o model from a local image . const api_key = "mykey" import OpenAI from 'openai'; import fs from "fs" const openai = new OpenAI(...
Pipo's user avatar
  • 5,651
1 vote
0 answers
59 views

Trying to highlight words in image using ocr

When I try to highlight words in chosen image it fails saying there's no words for the text(Tessereact.js) and I tried many things to make it mark words in the image after the ocr but still fails to, ...
tabs's user avatar
  • 11
3 votes
0 answers
884 views

Tesseract.js OCR How do I properly set Page Segmentation Mode (PSM, pageseg) to detect a single number in an image

I've been using tesseract to read various numbers (up to 99,999.9) in the format below: Example of image that OCR failed on: It seems to get a proper read about 80% of the time, but I need 95% ...
justiceorjustus's user avatar
-1 votes
1 answer
314 views

How to find patterns in image in js

I am new to AI and text recognition from an image world, I am trying to recognition characters patterns from an image using javascript, but I have absolutely no idea where to even start. My goal is to ...
roee's user avatar
  • 105
0 votes
1 answer
719 views

Why can't the imlgy background remover find the resource metadata?

I am trying to use the imgly/background-removal package in node.js. However, I keep getting this error: Error: Resource metadata not found. Ensure that the config.publicPath is configured correctly. ...
Rikudou's user avatar
  • 189
0 votes
1 answer
1k views

Integration of Image-To-Text issue with react-native and @react-native-ml-kit/text-recognition

I have a react native project and within this project. I want to integrate the abillity for the user to take a photo or select a photo from their library. Once the image is selected, I want to then ...
ojandali's user avatar
  • 203
0 votes
1 answer
384 views

how to convert two/there column images to text with ( tesseract.js ocr)?

I am working on a react.js project, I have almost done but my problem is if I want to convert two/three column images to text by Tesseract (OCR) does not convert as I want. because two columns' text ...
Ali hussain's user avatar
2 votes
0 answers
384 views

Dynamically add OCR predictions to the label-studio setup

I have a label studio setup on my own website using JS. setup looks like this: and this is the label-studio config: var data = JSON.parse(document.getElementById('annotations').textContent); var ...
Krishna's user avatar
  • 349
1 vote
1 answer
2k views

How to get the coordinates of text on an image with Node.js?

I am trying to get the x and y coordinates of specific text on an image like this. On this image I am trying to detect where X:input Y:input is located which could be anywhere on future images. In ...
user avatar
1 vote
0 answers
135 views

Use native document picture scanner from the web

I am building a web application that needs to have document scanner functionalities. I made some research to do that fully with web technologies but it seems that it's a bit too complicated for what I ...
user1780729's user avatar
0 votes
3 answers
945 views

Convert SVG path of a glyph to the character the glyph represents

This working CodePen demo shows a number which is represented as a vector. May I ask how to convert the number vector (SVG Format) as shown in demo to a real number? From my search on the web, ...
Buzz Su's user avatar
  • 47

15 30 50 per page
1
2 3 4 5
9