CVISION Technologies

Document Imaging, Information, and Tech Support

Archive for the 'OCR Verification and Confidence' Category

OCR Free Software Download

July 16th, 2007 by Chris

Questions: Does the PdfCompressor have the ability to make files text searchable, even if the files are JPEG or TIFF? Also, what are the advantages of text searchable documents?

Answer: PdfCompressor compressed files & inputs OCR to make files text-searchable. If you have TIFF files or JPEG file, we can convert TIFFs and JPEGs into compressed, searchable PDFs. The OCR engine with PdfCompressor is made with corporate business needs in mind. The OCR engine is designed for large volume, business needs.

Through robust functionality, PdfCompressor provides configurations for speed, volume, and automation. CVISION automates the OCR process with Watch Folder capabilities; through Watch Folders, users can leave the process unattended as documents are processed. In Watch Folder mode, files are OCR’d by simply being dropped into a folder. To accommodate large volume scanning, the Batch OCR feature within PdfCompressor enables scanned documents to be processed fast; PdfCompressor OCR processing rates are about 1 page per second..

To try our OCR free software, click the link below:

http://www.cvisiontech.com/pdfpro31_download.html

Category: All, Color OCR, OCR, OCR Accuracy, OCR Download, OCR Software, OCR Verification and Confidence, OCR with Application to the Digital Mailroom, Optical Character Recognition | No Comments »

Zone OCR

March 23rd, 2007 by Chris

Question: I work in a hospital. We are planning to scan very old files into our computer. What we want to do is to get specific data from certain parts of the files, so that we can put this in our database. Is this possible?

Answer: Yes, this is possible. There is a function in the PdfCompressor called zone OCR. Once you set that setting for where you want the OCR to occur, the data is then put into a Rich Text File. Then, you can put the RTF file into your database. However, to optimize zone OCR results with very old files, you can follow these steps:

1. Verify that the existing resolution (dpi) is correct. OCR engines are calibrated based on the dpi that is typically given in the image header file. If this value is incorrect, then the OCR results will degrade.

2. Assuming the dpi has now been set correctly, up sample to a reasonably high dpi. Typically, 300 dpi is a good number. The up sampling method does matter - use bicubic spline interpolation.

3. OCR engines usually perform better on bitonal documents that are thresholded correctly than on the original color files. Of course, if the threshold is poorly chosen, the OCR engine is better off with the original color or grayscale image file. So if possible, threshold each upsampled image file manually so that the text is most readable.

You can try out the PdfCompressor’s zone OCR below:

http://www.cvisiontech.com/pdf_compressor_31.html

Category: All, OCR, OCR Accuracy, OCR Download, OCR Languages, OCR PDF, OCR Software, OCR Verification and Confidence, OCR with Application to the Digital Mailroom, Optical Character Recognition | No Comments »