Search
Recommended Products
Related Links


 

 

Informative Articles

Computer Data Backups: Test Now or Cry Later
If you’re like most small business owners, your computer data backups are one of those things that you rarely pay attention to. Computer data backups are kind of like flossing your teeth and eating low-fat, high-fiber foods... everyone knows what...

Microsoft Business Solutions - Navision Customization: C/SIDE, C/ODBC, C/FRONT, XBRL
Microsoft bought Navision, Denmark based software development company, along with Great Plains Software. Now Microsoft Business Solutions offers following ERP applications: Navision (former Navision Attain), Microsoft Great Plains (former Great...

Protect Your Computer!
Surfing on the Internet can be a major problem if you don't employ a reasonable amount of protective software. Hackers, spyware and viruses all combine to make it essential that you have a firewall, a spyware detector and an anti-virus program -...

Some Computer Forensics Basics
- Computer Forensics in a Nutshell Computer forensics are examinations of computers made during a criminal investigation. When police look into the files and data on a computer during an investigation, they are using computer forensics. It is...

The Benefits of Personnel Recruiting Software
Personnel recruiting software makes the process of finding and hiring qualified employees much quicker and easier. Personnel recruiting software is used by both professional staffing firms and by companies that do their own hiring. These software...

 
Google
OCR Software-- Optical Character Recognition Or Optical Crud Recognition?

Optical Character Recognition (OCR) refers to a software technology and processes that involve the translation of printed text into computer searchable text.

Done correctly, OCR enables users to search for and retrieve individual words contained within a file or page. In addition, when a set of files is indexed, users are able to search for keywords across an entire document library and retrieve each page with exact precision. OCR enables users to execute searches in seconds, searches that once could take several hours or days to complete.

However, this technology did not work well on older or poor quality documents that contained mixed fonts or combinations of texts and graphics. Until now!!

Due to several recent technology advances, it is now possible to obtain six-sigma level character accuracy from these types of document collections.

Although it is important to keep in mind that the quality and condition of the paper documents are still key factors in the successful OCR conversion, dramatically improved results can be obtained by enhancing the quality of the scanned image prior to processing.

Noise removal of borders, speckles and skews are now common on the more advanced document scanners.

Furthermore, advanced color filter technologies may be used to reduce any page background colors, in conjunction with multi-light image capture technologies to remove any shadows cast by page creases that could impact image


quality or recognition accuracy.

Once document scanning and processing are complete, an OCR text layer can actually be added and hidden behind each image. An additional orientation filter can be used to ensure that the best image is presented to the OCR engines.

To achieve the highest conversion accuracy possible, the characters in the image can be processed using multi-engine OCR voting technologies that rank each character to determine the best text recognition fit. Then once a word is generated, it will be filtered through a proprietary lexicon to ensure the highest quality results.

Finally, this text can be processed utilizing sophisticated layout retention technologies to represent the image text layout, to provide the best possible text representation for precise search and retrieval. After all, isn’t that why they call it Optical Character Recognition?


About the Author: Founded in 2001, we have successfully completed over 500 document scanning projects within a variety of industry vertical markets and have provided our services to clients throughout the United States. In addition, we are currently supporting the global document management needs of several clients. http://www.DigitalDocumentLLC.com

Source: www.isnare.com