Questions tagged [ocr]
Optical Character Recognition, usually abbreviated to OCR, is the mechanical or electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text. The following topics, although some being distinct fields of application, are also commonly referred to as OCR: Handwritten Text Recognition (HTR), Optical Word Recognition (OWR), Intelligent Character Recognition (ICR), Intelligent Word Recognition (IWR).
6,309
questions
0
votes
0
answers
12
views
How do I use Tesseract to read any Unicode character from an image?
I have the following implementation of Tesseract to identify any specified single English character:
class Program
{
static void Main()
{
string imgPath = "./image.png";
...
0
votes
0
answers
11
views
How can I extract tables from an image into excel using optical character recognition?
As an example, I have this image and will like to convert this to an modifiable excel table. In have tried using the 'pytesseract' library, but it doesn't accurately extract the text from the image ...
-3
votes
0
answers
52
views
How to add horizontal lines to the table in image opencv
I want to add horizontal lines to the below image
resultant output should be like:
Any methods on how to work it out, even if there are diff images which don't have horizontal lines.
I've tried ...
1
vote
0
answers
45
views
How to extract a rectangle in an image from identified lines
I have thousands of images that look like this:
I need to run an OCR algorithm on the "1930 E.D." column. I find that when I crop the image down to just that column, I get much better ...
0
votes
1
answer
55
views
How to recognize single characters from an image using Tesseract?
This is the original image:
This is the processed image:
I'm trying to automate a mini-game, in which characters appear on the screen. I did some light reaserch and managed to process the image to ...
0
votes
0
answers
20
views
How to convert non-readable PDF into readable PDF with OcrMyPdf: troubles with tesseract and configparser
I'm trying to convert a scanned PDF into a readable one.
The original PDF contains text, tables, images/logos. The desired output file should be exactly the same of the original file.
I found ...
0
votes
0
answers
43
views
OpenCV contours sorting x-axis and y-axis
I am working on a python program to solve a wordsearch. I am using pytesseract and opencv to process an image of the wordsearch and the solution will be displayed as a text. The script processes the ...
0
votes
0
answers
22
views
How to make tesseract (pytesseract) recognise '±'?
Plus or minus character
I'm trying to detect text (mostly numbers) from an image (technical diagram). Do I need to train (if yes, how) tesstrain? jtessboxeditor?
On doing the OCR from a set of ...
-1
votes
1
answer
31
views
Implement text detection/extraction from "live screen capture" on Windows [closed]
Hi what would be the best way to implement a way to detect text from a "live feed or capture" of a pc screen itself, maybe a portion of the screen, so the program can execute based on what ...
0
votes
0
answers
22
views
Adobe OCR limitations for PDF extraction [closed]
I am using Adobe OCR in my Python application to extract the text from PDFs. There is a limitation with Adobe OCR which allows only 400 pages to be extracted. Any features available to extract PDFs ...
-1
votes
0
answers
26
views
Using OCR to try identify 7 segment numbers
Why my code just identify 3 of the 4 numbers correctly?
Here I make an outline in the numbers
And here I try to identify correctly the numbers
import cv2
import numpy as np
import matplotlib.pyplot ...
0
votes
0
answers
11
views
Can I integrate Drupal document_ocr with tesseract?
I would like to use an open source image to text converter to populate text fields with in Drupal.
I tried setting the tesseract executable path in settings.php and thought that the document_ocr ...
-2
votes
0
answers
18
views
Enhancing Document Layout Analysis by Adding Positional and Character Information to CNN Inputs
I am working on document layout analysis and have been exploring CNNs and transformer-based networks for this task. Typically, images are passed as 3-channel RGB inputs to these networks. However, my ...
0
votes
0
answers
20
views
Scan Card Payment like Visa Card, Union Pay Card using Vision framewok
enter image description here
I would like scan card number, placeholder name , and expired date in Card Payment.
I see other dependcies and libraries on the internet but some can't able to scan with ...
-2
votes
0
answers
30
views
Detect arithmetic operators in an image [closed]
I created a OCR script using keras_ocr. The input is a flowchart diagram(Grayscale). I want to extract texts and coordinatres of shapes of flowchart image. However, it does not extract arithmetic ...