CVision Tech CVision Tech
English French German Italian Japanese Korean Norwegian Polish Portuguese Spanish Swedish Thai Turkish
  • Download
  • Contact
  • Store
Store CVision Tech Contact Info
**
  • Home
  • Products
    • PDF Compressor
    • Maestro OCR Recognition Server
    • Rendition Server Enterprise
    • Foxit PDF Editor
    • PdfCompressor Developer’s SDK
    • OCR Engine
    • PDF Optimization Suite for Captiva
    • PDFOptimizer for OpenText Captiva
    • ImageOptimization for Documentum
    • PdfCompressor for Kofax
  • Solutions
    • Convert Electronic Documents to PDF
    • File Compression
    • OCR
    • PDF Conversion
    • PDF Linearization
    • PDF/A Compliance for Archiving
      • DocArchiver
  • Industries
    • Banking and Financial Services
    • Tax and Accounting
    • Legal
      • Legal Document Management
      • Specific Needs for Legal Market
      • Specific Needs for Legal Market
    • Government
    • Education
    • Healthcare
    • Insurance
    • Wireless Telecom
    • Scanning Bureaus
    • Web Repositories
    • ASPs
    • News & Media
  • Resources
    • PDF Compressor 8 Resource Hub
      • Document Archiving: A Quick Guide to PDF/A Subtypes
    • OCR and PDF Compressor Licensing
    • Resource Library
    • PdfCompressor Overview
    • Compression
    • White Papers
      • PDF/A Document Archiving Primer
        • Challenges and Complexity of Document Archiving
        • Converting Documents to a Standard Electronic Format
        • PDF Evolves into the Electronic Document Standard
        • PDF as a Records Management Document Solution
        • PDF/A: Document Solution for Archiving and RM
      • Advanced Document Compression Primer
        • Reduced Storage Costs
        • Improved Collaboration Capabilities
        • Fully Searchable PDF Files
        • PdfCompressor’s Adjustable Settings
        • PdfCompressor - Complementing Document Management Workflow
      • OCR Software Primer
        • Thresholding within OCR
        • Texture Patterns and Small Fonts OCR
        • OCR, Neural Networks and other Machine learning Techniques
        • OCR, Crytorithms, Cryptograms and Substitution Ciphers
        • CAPTCHA: Human and Machine Readability & OCR
        • OCR & Novel Fonts, Multidirectional and Undersampled Text
        • Relationship between OCR & JBIG2
        • OCR, MRC & JPEG2000
        • Reverse Video & OCR
        • OCR & How they relate to MFPs (MultiFunctional Peripheral devices)
        • Dictionary Lookup and OCR
        • Rating an OCR System
        • Tweaking the System to Optimize OCR Performance
        • Searchable PDF using OCR
        • Electronic File Conversion & OCR
        • Bar Codes, OCR & ICR
        • OCR & Form Recognition
        • Data Extraction with OCR
        • Business Process Automation and How it Relates to OCR
        • OCR-based ROI
        • Towards the Paperless Office
      • JBIG2 Compression Primer
        • The Business Case for JBIG2 Compression
        • JBIG2 Compression Success Stories
        • JBIG2: A short history
        • Digital file formats: The short definition of JBIG2
        • JBIG2 and TIFF compared
        • JBIG2 and JBIG Comparison
        • Essential compression issues
        • Smart Compression Codecs: JBIG2, JPEG2000, and MPEG4
        • JBIG2: The Compression Connection
        • The JBIG2 Standard
        • Lossless, Lossy, and Perceptually Lossless Compression
        • JBIG2 Technical Advantages for Business Solutions
        • JBIG2 Technical Advantages: File Size
        • Efficient Encoding
        • OCR Support within PDF Format
        • PDF Web Optimization
        • Scanner Distortions Resolved
        • JBIG2-Compressed PDF Documents
        • Pattern Matching & Substitution
        • The Dangers of PM&S: Proceed with Caution
        • Verification
        • Halftoning in JBIG2
        • Utilizing a JBIG2 Encoder with No Information Loss
        • Overview: Benefits of PDF Compression and PDF Conversion
        • JBIG2 Compression Summary
    • Product Video Tutorials
      • PdfCompressor Demo Video
      • Maestro Demo Video
  • News & Events
    • Document Imaging Blog
    • Subscribe to the Visionary Newsletter
  • Support
    • Support Login
    • System Requirements
    • Documentation
    • FAQs
      • Automatic Licensing Documentation
    • OCR Languages Supported
    • Submit a Ticket
  • About Us
    • Company Information
    • Partners
    • Success Stories
      • The Debt Exchange Celebrates 7 Years of Outstanding OCR Support from CVISION
      • File Compression and Dept. of Homeland Security
      • Legal Industry Enjoys Freedom from Paper
      • University benefits from Improved Document Capture
      • Media Organization enjoys benefits of OCR, compression, conversion
      • Law Firm benefits from Auto-Routing & Filing of Image Documents
      • Improved Efficiency for the Legal Industry
      • New York City based law firm accelerates document efficiency with OCR
      • Leading hospital optimizes documents with compression and OCR
      • Global financial company utilizes digital mailroom
      • Energy Consulting and Construction Company Improves Document Accessibility
      • Manufacturing Company Reduces Accounts Payable Costs with Advanced Solution
      • Frontier Farm Credit Optimizes Accessibility with Distributed Capture Solution
      • Technology Company Reduces Storage Costs
      • CVISION Provides American Radio History a PDF Optimization Solution
      • Top 5 Global Financial Firm Processes 1.25 Billion Pages Yearly with PdfCompressor
      • Global Law Firm Resolves Bottleneck of Scanning and OCR with CVISION
      • Leading Distribution Company Realizes ROI Within 6 Months
      • Non-Profit Leverages Compression for Document Workflow
      • Large Government Agency Uses Compression to Accelerate File Transmission and Retrieval
      • Global Credit Card Company Accelerates Merchant Statement Processing Speed
      • Global Power Industry Leader Increases Document Handling Efficiency by More Than 50% with PdfCompressor
      • Argus der Presse Case Study
      • Healthcare Provider Improves Patient Care with Maestro OCR Software for EHR
      • Government Agency Improves OCR Efficiency with PdfCompressor
    • Client Testimonials
    • Customer Feedback
    • Careers
    • Contact
  • Home
  • Resources
  • White Papers
  • JBIG2 Compression Primer
  • Verification
 

Verification

The best way to verify the accuracy of a JBIG2 implementation is to run it on a set of files and visually inspect the results. Whenever the JBIG2 compressed files differ from the original images you would want them to have an improvement in image quality. At a minimum you should insist that they contain no degradation in image quality. While there is no substitute for looking at the compressed images and seeing if they appear acceptable, it can be a time consuming process. As a result, many people are interested in verification methods that can be automated.

For this reason, we recommend that an OCR engine be used to compare the quality of the image before compression and afterwards. The words in the text files produced by each image can be programmatically looked up in a dictionary to see if they are valid or not. This can produce an easily measurable score of how well each document did. A good JBIG2 implementation should produce a compressed file that does about as well, if not better, than the original image.

For example, when measured by an OCR Validation tool, the original document shown in the previous section had a score of 469 (i.e. 469 words in the OCRed text file had a match in a standard dictionary) while the CVISION compressed file had a score of 472. On the other hand, the badly mismatched document produced by another vendor had an OCR score of 449.

Figure 12 Figure 12. The OCR engine found more valid words in the CVISION compressed image than it did in the original image. The compressed image created by the competition did much worse than the original.


OCR Accuracy

No OCR engine is 100% accurate. They all miss an occasional word that is clear to a human reader. Since there is an element of chance in even the best OCR engines, you can’t be certain that the JBIG2 implementation degrades quality by testing it against the original on just a few files. However, over a large database it can be a very good measure of image quality. Within a small margin of error, you want the JBIG2 files to have OCR recognition rates about the same as or even better than those of the original files.

Shown below is a chart for a 186-page file in which a file compressed with one JBIG2 encoder (CVISION PdfCompressor) had a few more OCR hits than the original. This is a good indication that the compression preserves image quality for this type of document, i.e., no OCR-based information loss.

Figure 13 Figure 13. Across the entire 186-page document, the OCR engine found more valid words in the CVISION compressed files than it did in the original, thereby indicating that image quality was preserved.


« To Section 20: The Dangers of PM&S: Proceed with Caution
To JBIG2 Compression Primer
To Section 22: Halftoning in JBIG2 »

  • Privacy
  • Cookies
  • Sitemap
  • Reference
  • Library
  • Contact Us
CVISION Technologies Facebook Page CVISION Technologies LinkedIn Company Page CVISION Technologies Twitter Page Subscribe to The Visionary Newsletter CVISION Technologies Blog CVISION Technologies YouTube Channel
 
Copyright © 1998-2019 CVISION Technologies, Inc.
CVISION, CVista, CBatch, and the CVISION logo are registered trademarks of CVISION Technologies, Inc.