Explore tens of thousands of sets crafted by our community.
Optical Character Recognition (OCR) Basics
12
Flashcards
0/12
Preprocessing
Preprocessing involves preparing the raw image for OCR by enhancing it through noise reduction, binarization, and normalization, to improve the accuracy of text recognition.
Character Segmentation
Character Segmentation is the process of separating text into individual characters, which is critical for character recognition, especially in cursive or non-standard fonts.
Pattern Recognition
Pattern recognition is the core of OCR, where the system uses algorithms to identify and classify the shapes of characters within the segmented image.
Optical Character Recognition (OCR)
OCR is a technology that converts different types of documents, such as scanned paper documents, PDFs or images captured by a digital camera, into editable and searchable data.
Feature Extraction
Feature extraction involves identifying and extracting key features from character segments that help the OCR algorithm differentiate between various characters.
Neural Networks
Neural networks, particularly Convolutional Neural Networks (CNNs), are used in modern OCR systems to classify characters based on learned features from large datasets of text.
Binarization
Binarization is the process of converting a grayscale image into a binary image, where each pixel is either black or white, to simplify the analysis for OCR.
Language Model
The language model in OCR is used to predict the likelihood of a sequence of characters or words, helping to resolve ambiguities and improve the accuracy of the text recognition.
Text Line Detection
Text line detection involves identifying and separating lines of text within the image, which is necessary for processing multi-line documents and maintaining the structure of the text.
Deskewing
Deskewing corrects the alignment of an image by detecting and fixing any slant or irregular orientation, which is essential for accurate character recognition.
Noise Reduction
Noise Reduction in OCR is the process of removing irrelevant information and distortions from the image to improve the clarity of text for the OCR engine.
Adaptive Thresholding
Adaptive thresholding is a technique used during binarization that adjusts the threshold for different regions of the image based on local image characteristics, facilitating better foreground and background separation.
© Hypatia.Tech. 2024 All rights reserved.