Skip to main content

Questions tagged [ocr]

Optical Character Recognition, usually abbreviated to OCR, is the mechanical or electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text. The following topics, although some being distinct fields of application, are also commonly referred to as OCR: Handwritten Text Recognition (HTR), Optical Word Recognition (OWR), Intelligent Character Recognition (ICR), Intelligent Word Recognition (IWR).

0 votes
0 answers
12 views

How do I use Tesseract to read any Unicode character from an image?

I have the following implementation of Tesseract to identify any specified single English character: class Program { static void Main() { string imgPath = "./image.png"; ...
CareeeBugger's user avatar
0 votes
0 answers
11 views

How can I extract tables from an image into excel using optical character recognition?

As an example, I have this image and will like to convert this to an modifiable excel table. In have tried using the 'pytesseract' library, but it doesn't accurately extract the text from the image ...
UsangR01's user avatar
-3 votes
0 answers
52 views

How to add horizontal lines to the table in image opencv

I want to add horizontal lines to the below image resultant output should be like: Any methods on how to work it out, even if there are diff images which don't have horizontal lines. I've tried ...
Elysian's user avatar
  • 19
1 vote
0 answers
45 views

How to extract a rectangle in an image from identified lines

I have thousands of images that look like this: I need to run an OCR algorithm on the "1930 E.D." column. I find that when I crop the image down to just that column, I get much better ...
Nick K9's user avatar
  • 4,436
0 votes
1 answer
55 views

How to recognize single characters from an image using Tesseract?

This is the original image: This is the processed image: I'm trying to automate a mini-game, in which characters appear on the screen. I did some light reaserch and managed to process the image to ...
Flako's user avatar
  • 1
0 votes
0 answers
20 views

How to convert non-readable PDF into readable PDF with OcrMyPdf: troubles with tesseract and configparser

I'm trying to convert a scanned PDF into a readable one. The original PDF contains text, tables, images/logos. The desired output file should be exactly the same of the original file. I found ...
eljamba's user avatar
  • 319
0 votes
0 answers
43 views

OpenCV contours sorting x-axis and y-axis

I am working on a python program to solve a wordsearch. I am using pytesseract and opencv to process an image of the wordsearch and the solution will be displayed as a text. The script processes the ...
HND's user avatar
  • 1
0 votes
0 answers
22 views

How to make tesseract (pytesseract) recognise '±'?

Plus or minus character I'm trying to detect text (mostly numbers) from an image (technical diagram). Do I need to train (if yes, how) tesstrain? jtessboxeditor? On doing the OCR from a set of ...
Dominic Mukilan's user avatar
-1 votes
1 answer
31 views

Implement text detection/extraction from "live screen capture" on Windows [closed]

Hi what would be the best way to implement a way to detect text from a "live feed or capture" of a pc screen itself, maybe a portion of the screen, so the program can execute based on what ...
Kcharliee's user avatar
0 votes
0 answers
22 views

Adobe OCR limitations for PDF extraction [closed]

I am using Adobe OCR in my Python application to extract the text from PDFs. There is a limitation with Adobe OCR which allows only 400 pages to be extracted. Any features available to extract PDFs ...
Prasath Balu's user avatar
-1 votes
0 answers
26 views

Using OCR to try identify 7 segment numbers

Why my code just identify 3 of the 4 numbers correctly? Here I make an outline in the numbers And here I try to identify correctly the numbers import cv2 import numpy as np import matplotlib.pyplot ...
George's user avatar
  • 21
0 votes
0 answers
11 views

Can I integrate Drupal document_ocr with tesseract?

I would like to use an open source image to text converter to populate text fields with in Drupal. I tried setting the tesseract executable path in settings.php and thought that the document_ocr ...
user282623's user avatar
-2 votes
0 answers
18 views

Enhancing Document Layout Analysis by Adding Positional and Character Information to CNN Inputs

I am working on document layout analysis and have been exploring CNNs and transformer-based networks for this task. Typically, images are passed as 3-channel RGB inputs to these networks. However, my ...
HARSH DEVMURARI's user avatar
0 votes
0 answers
20 views

Scan Card Payment like Visa Card, Union Pay Card using Vision framewok

enter image description here I would like scan card number, placeholder name , and expired date in Card Payment. I see other dependcies and libraries on the internet but some can't able to scan with ...
Yen Pheayuth's user avatar
-2 votes
0 answers
30 views

Detect arithmetic operators in an image [closed]

I created a OCR script using keras_ocr. The input is a flowchart diagram(Grayscale). I want to extract texts and coordinatres of shapes of flowchart image. However, it does not extract arithmetic ...
Kanchana Kariyawasam's user avatar

15 30 50 per page
1
2 3 4 5
421