MichalSimonsen958

来自女性百科
MichalSimonsen958讨论 | 贡献2013年5月8日 (三) 09:29的版本 (新页面: Optical Character Recognition (OCR) identifies a computer software technology and procedures that require the interpretation of printed text in to computer searchable text. Done correct...)

(差异) ←上一版本 | 最后版本 (差异) | 下一版本→ (差异)
跳转至: 导航搜索

Optical Character Recognition (OCR) identifies a computer software technology and procedures that require the interpretation of printed text in to computer searchable text.

Done correctly, OCR allows people to search for and access individual words contained in just a record or page. Furthermore, whenever a pair of files is indexed, consumers can retrieve each page with exact precision and search for key words across a complete record library. OCR enables users to perform searches in seconds, searches that once could take several hours or days to accomplish.

But, this technology did not work nicely on older or low quality papers that included mixed fonts or mixtures of texts and graphics. Until now!!

Due to many recent technology advances, it is now possible to acquire six-sigma level character reliability from these types of document libraries.

Even though it is very important to keep in mind that the quality and situation of the paper files remain crucial elements in the successful OCR conversion, dramatically improved results can be acquired by increasing the quality of the scanned image just before running.

Noise removal of boundaries, speckles and skews are now actually common on the more complex document scanners.

Furthermore, advanced level color filter technologies may be used to reduce any page background colors, in conjunction with multi-light image capture technologies to remove any shadows cast by page wrinkles that can influence image quality or recognition accuracy.

Once document scanning and processing are full, an OCR text layer can in fact be included and hidden behind each picture. One more direction filter may be used to make sure that the best image is introduced to the OCR motors.

To achieve the greatest conversion accuracy possible, the people in the image may be prepared using multi-engine OCR voting systems that list each figure to look for the best text identification suit. Then once a term is produced, it'll be filtered by way of a private lexicon to ensure the best quality results.

Finally, this text may be prepared utilizing sophisticated layout retention systems to represent the picture text layout, to offer the best possible text representation for correct search and retrieval. In the end, isnt that why they call it Optical Character Recognition?Saxon Archives Palm Beach, LLC 1601-C Hill Avenue Mangonia Park, FL 33407 Toll-free: 1-800-747-3334 Local: 561-882-1170

Saxon Archives Treasure Coast 6526 South Kanner Highway Stuart, FL 34997 Toll-free: 866-457-2966 the saxon archives