CVision Tech CVision Tech
English French German Italian Japanese Korean Norwegian Polish Portuguese Spanish Swedish Thai Turkish
  • Download
  • Contact
  • Live Chat
  • Store
Store CVision Tech Contact Info
**
  • Home
  • Products
    • PdfCompressor
    • Maestro Recognition Server
    • PdfCompressor Developer’s SDK
    • OCR Engine
    • PDF Optimization Suite for Captiva
    • PDFOptimizer for OpenText Captiva
    • ImageOptimization for Documentum
    • PdfCompressor for Kofax
    • PdfCompressor
  • Solutions
    • File Compression
    • OCR
    • PDF Conversion
    • PDF Linearization
    • PDF/A Compliance for Archiving
      • DocArchiver
  • Industries
    • Banking and Financial Services
    • Tax and Accounting
    • Legal
      • Legal Document Management
      • Specific Needs for Legal Market
      • Specific Needs for Legal Market
    • Government
    • Education
    • Healthcare
    • Insurance
    • Wireless Telecom
    • Scanning Bureaus
    • Web Repositories
    • ASPs
    • News & Media
  • Resources
    • Resource Library
    • PdfCompressor Overview
    • Document Imaging Blog
    • The Visionary Newsletter
    • Compression
    • White Papers
      • PDF/A Document Archiving Primer
        • Challenges and Complexity of Document Archiving
        • Converting Documents to a Standard Electronic Format
        • PDF Evolves into the Electronic Document Standard
        • PDF as a Records Management Document Solution
        • PDF/A: Document Solution for Archiving and RM
      • Advanced Document Compression Primer
        • Reduced Storage Costs
        • Improved Collaboration Capabilities
        • Fully Searchable PDF Files
        • PdfCompressor’s Adjustable Settings
        • PdfCompressor - Complementing Document Management Workflow
      • OCR Software Primer
        • Thresholding within OCR
        • Texture Patterns and Small Fonts OCR
        • OCR, Neural Networks and other Machine learning Techniques
        • OCR, Crytorithms, Cryptograms and Substitution Ciphers
        • CAPTCHA: Human and Machine Readability & OCR
        • OCR & Novel Fonts, Multidirectional and Undersampled Text
        • Relationship between OCR & JBIG2
        • OCR, MRC & JPEG2000
        • Reverse Video & OCR
        • OCR & How they relate to MFPs (MultiFunctional Peripheral devices)
        • Dictionary Lookup and OCR
        • Rating an OCR System
        • Tweaking the System to Optimize OCR Performance
        • Searchable PDF using OCR
        • Electronic File Conversion & OCR
        • Bar Codes, OCR & ICR
        • OCR & Form Recognition
        • Data Extraction with OCR
        • Business Process Automation and How it Relates to OCR
        • OCR-based ROI
        • Towards the Paperless Office
      • JBIG2 Compression Primer
        • The Business Case for JBIG2 Compression
        • JBIG2 Compression Success Stories
        • JBIG2: A short history
        • Digital file formats: The short definition of JBIG2
        • JBIG2 and TIFF compared
        • JBIG2 and JBIG Comparison
        • Essential compression issues
        • Smart Compression Codecs: JBIG2, JPEG2000, and MPEG4
        • JBIG2: The Compression Connection
        • The JBIG2 Standard
        • Lossless, Lossy, and Perceptually Lossless Compression
        • JBIG2 Technical Advantages for Business Solutions
        • JBIG2 Technical Advantages: File Size
        • Efficient Encoding
        • OCR Support within PDF Format
        • PDF Web Optimization
        • Scanner Distortions Resolved
        • JBIG2-Compressed PDF Documents
        • Pattern Matching & Substitution
        • The Dangers of PM&S: Proceed with Caution
        • Verification
        • Halftoning in JBIG2
        • Utilizing a JBIG2 Encoder with No Information Loss
        • Overview: Benefits of PDF Compression and PDF Conversion
        • JBIG2 Compression Summary
    • Product Video Tutorials
      • PdfCompressor Demo Video
      • Maestro Demo Video
  • News & Events
    • Recent News and Events
      • CVISION Releases PdfCompressor 6.6
      • CVISION Releases PdfCompressor 6.5
      • CVISION Technologies will exhibit at Prophet 21 WWUG Conference in New Orleans
    • Industry News
  • Support
    • Support Login
    • System Requirements
    • Documentation
    • FAQs
      • Automatic Licensing Documentation
    • OCR Languages Supported
    • Submit a Ticket
  • About Us
    • Company Information
    • Partners
    • Success Stories
      • File Compression and Dept. of Homeland Security
      • Legal Industry Enjoys Freedom from Paper
      • University benefits from Improved Document Capture
      • Media Organization enjoys benefits of OCR, compression, conversion
      • Law Firm benefits from Auto-Routing & Filing of Image Documents
      • Improved Efficiency for the Legal Industry
      • New York City based law firm accelerates document efficiency with OCR
      • Leading hospital optimizes documents with compression and OCR
      • Global financial company utilizes digital mailroom
      • Energy Consulting and Construction Company Improves Document Accessibility
      • Manufacturing Company Reduces Accounts Payable Costs with Advanced Solution
      • Frontier Farm Credit Optimizes Accessibility with Distributed Capture Solution
      • Technology Company Reduces Storage Costs
      • CVISION Provides American Radio History a PDF Optimization Solution
      • Top 5 Global Financial Firm Processes 1.25 Billion Pages Yearly with PdfCompressor
      • Global Law Firm Resolves Bottleneck of Scanning and OCR with CVISION
      • Leading Distribution Company Realizes ROI Within 6 Months
      • Non-Profit Leverages Compression for Document Workflow
      • Large Government Agency Uses Compression to Accelerate File Transmission and Retrieval
      • Global Credit Card Company Accelerates Merchant Statement Processing Speed
      • Global Power Industry Leader Increases Document Handling Efficiency by More Than 50% with PdfCompressor
      • Argus der Presse Case Study
      • Healthcare Provider Improves Patient Care with Maestro OCR Software for EHR
      • Government Agency Improves OCR Efficiency with PdfCompressor
    • Client Testimonials
    • Customer Feedback
    • Careers
    • Contact
  • Home
  • Support
  • FAQs
 

CVISION Technologies PDF Compressor FAQ

CVISION's Solutions for the Education Industry

PDF Compressor Frequently Asked Questions

  1. What are the benefits of PdfCompressor?
  2. What documents can be compressed using PdfCompressor?
  3. What product settings are recommended for compression of black and white TIFF files?
  4. What product settings are recommended for compression of color files?
  5. What product settings are recommended for compression of generated documents, such as company brochures that include embedded images?
  6. What PdfCompressor settings are recommended for scanned bitonal files that may include pictures or halftone regions, e.g., newspapers, photos, etc., captured using an image scanner or a digital copier?
  7. Is PdfCompressor useful on files that are completely generated, i.e., contain no image streams?
  8. What should I do if scanned documents in TIFF format, which appear to be black and white, are still very large after being processed with PdfCompressor?
  9. What other image processing options in PdfCompressor can reduce file size while improving image quality?
  10. When should optical character recognition (aka OCR) be turned ON and how does it affect overall file size?
  11. Is the web optimization feature always set ON?
  12. Why is web optimization an important document feature?
  13. I want to run PdfCompressor in the background without setting up a batch run each time files need to be compressed?
  14. I've tried different PdfCompressor options, but the resulting file is still too large. What should I do?
  15. What is Multi-threading and how does it improve document processing speed?

PDF/A Frequently Asked Questions

  1. Why archive with PDF/A?
  2. Why not archive with microfiche?
  3. Is there a special viewer for PDF/A?
  4. Who should be archiving with PDF/A?
  5. What is the difference between PDF and PDF/A?
  6. Why can I rely on PDF/A for long-term archiving?
  7. What is the difference between PDF/A, A-1a, A-1b?

Technical Support Question

  1. What are the support hours?

PDF Compressor

1. What are the benefits of PdfCompressor?

PdfCompressor is designed to minimize the size of your PDF files. PdfCompressor facilitates fast Internet transmission, efficient web-hosting and reduction in storage space requirements. It also enhances the use of scanned documents as email attachments, digital copying of documents to the Web, and remote scanning.

Back to top of page

2. What documents can be compressed using PdfCompressor?

PdfCompressor is most effective on scanned input documents that are in TIFF, JPEG, or PDF format. Compression ratios will depend on the image resolution and compression methods, if any, that are already being used on the input data. PdfCompressor can also be used effectively on PDF files that contain embedded image streams such as corporate earnings reports, company brochures, product user manuals, etc.

Back to top of page

3. What product settings are recommended for compression of black and white TIFF files?

The default settings for PdfCompressor, both in Quick Run and Full Options modes, utilize the JBIG2 compression filter. JBIG2 is a recently-approved ITU black and white (aka bitonal) compression standard that allows for much more efficient coding of black and white scanned documents. More information on JBIG2 is available at http://www.jbig2.org and http://www.jbig.org. JBIG2 compression can be up to 10x smaller than TIFF (G4). We recommend the product default settings of JBIG2 in perceptually lossless mode. The perceptually lossless mode, which differentiates the PdfCompressor product from variations that are either lossless or lossy, obtains very compact JBIG2 image streams with no loss in image quality and no OCR degradation.

Back to top of page

4. What product settings are recommended for compression of color files?

The setting usually required for effective color compression, up to 100x smaller than JPEG, is auto-segmentation, which is located on the Compression Options panel. This setting should already be ON by default, but this should be verified at runtime. The auto-segmentation feature separates each scanned color file into text, picture, graphics, halftones, and other document regions. These image regions are then each encoded using the appropriate compression filter. The auto-segmentation filter is set to quality 5 by default, but this image quality setting can be adjusted up or down to fit your documents. To adjust the PdfCompressor default settings, make sure to set the Job Type (upper left box of GUI panel) to Batch Compression mode. If there is any image quality loss with auto-segmentation ON and Image Quality 5, then a higher Image Quality setting of 8 or higher should be used. If auto-segmentation using a higher image quality settings of 8 or above still results in some image degradation, then the auto-segmentation mode should be set to OFF.

Back to top of page

5. What product settings are recommended for compression of generated documents, such as company brochures that include embedded images?

If the generated PDFs contain embedded bitonal image streams then the default system settings should be satisfactory. If the generated PDF documents contain embedded color images then some adjustments may be required as these documents can be difficult for PdfCompressor to segment. The user can first try auto-segmentation both in the medium quality range (e.g., Image Quality 5) and then in the higher quality range (e.g., Image Quality 8). If in using the auto-segmentation mode there is some element of visual image degradation, then PdfCompressor should be run with auto-segmentation feature set to OFF.

Back to top of page

6. What PdfCompressor settings are recommended for scanned bitonal files that may include pictures or halftone regions, e.g., newspapers, photos, etc., captured using an image scanner or a digital copier ?

For most effective compression results on scanned bitonal documents that include picture regions, the Use halftone algorithm option should be selected. This option is located in the Compression Options panel. This option can also useful if the image background is not white but some shade of grey or if the document contains some shaded regions (e.g., tax form).

Back to top of page

7. Is PdfCompressor useful on files that are completely generated, i.e., contain no image streams?

Even though PdfCompressor is most useful on files that contain image streams, it can sometimes be very useful in compressing files that do not contain image streams. In addition, PDF files are not always as they appear. For example, it may appear that a generated PDF document such as a company invoice does not contain any images when it actually does contain embedded image streams, e.g., company logo. When in doubt, try running the file through PdfCompressor (in default mode) to see if there is any file size reduction. If the file appears to be generated but the file size seems very high, in excess of 100 KB per page, we recommend first trying PdfCompressor using the default settings (e.g., stream compression mode). If this does not reduce the file size, try PdfCompressing the document in raster mode which is shown on the "Image Processing >> PDF Input Processing" page as the Rasterize each page before compressing option. Leave the other rasterization options in their default settings.

Back to top of page

8. What should I do if scanned documents in TIFF format, which appear to be black and white, are still very large after being processed with PdfCompressor?

Sometimes scanned files are saved as color or greyscale files even though the actual document being scanned is black and white. We have seen this occur quite frequently using various capture devices including digital copiers and digital senders. While PdfCompressor in auto-segmentation mode can sometimes correct for this misleading image type, at least partially, it can often result in significantly larger file size. If the input document appears to be bitonal but the file size in TIFF format seems excessively large, e.g., exceeds 100 KB per page, there is a reasonable chance that these files are mapped as color or greyscale. To correct for this, go to the Image Processing page. In the Greyscale Images column, select the Remap images to bitonal option. Similarly, in the Color Images column, select the Remap images to bitonal option.

Back to top of page

9. What other image processing options in PdfCompressor can reduce file size while improving image quality?

The despeckling option, on the Image Processing page, can sometimes reduce the size of scanned black and white documents while improving image quality. This option, however, can occasionally cause image artifacts so it should be used with caution. The deskew option, also available on the Image Processing page as the option to Automatically rotate scanned pages to proper viewing angle, should improve the overall image quality of scanned bitonal files and slightly reduce file size (assuming the files have not already been skew-corrected).

Back to top of page

10. When should optical character recognition (aka OCR) be turned ON and how does it affect overall file size?

OCR is the process of converting an image into corresponding text. It is a necessary step if the input files i. are scanned, ii. have not yet been OCRed, and iii. the intended application requires text searchability. With respect to file size, there is very little increase in file size (about 5%) as a result of running PdfCompressor with OCR set to ON.

Back to top of page

11. Is the web optimization feature always set ON?

Web optimization in PdfCompressor is ON by default. It can be turned ON or OFF from the Document Features page. This feature should normally be set ON since it allows PDF files to view more efficiently over the web. It is a relatively fast operation (unlike OCR) so it does not slow down PdfCompressor's rate of processing.

Back to top of page

12. Why is web optimization an important document feature?

Viewing documents on the Web is different than viewing them locally, e.g., on a LAN. Files viewed over the Web must generally be downloaded in order to display. Without web optimization, which typical image formats (e.g., TIFF) do not support, a file would have to download completely just to view the first page. This can make viewing a large multipage scanned file very inefficient. PDFs that are web optimized can be viewed on a per page basis, without waiting for the rest of the file to transfer. So a 300-page PDF file could have its first page displayed as soon as the page had been downloaded. Web optimized PDFs also support opening to a random page in a multipage file. This is useful for efficient display of search query results.

Back to top of page

13. I want to run PdfCompressor in the background without setting up a batch run each time files need to be compressed?

PdfCompressor Professional supports watched folder mode. This allows any number of folders to be watched, and includes a scheduler to determine exact schedules for monitoring each watched folder (e.g., evening, weekends, etc.).

Back to top of page

14. I've tried different PdfCompressor options, but the resulting file is still too large. What should I do?

Although it is our intent to make PdfCompressor easy to use, sometimes understanding the input data and selecting the appropriate runtime settings is not simple. If you are not able to obtain the results you expected using PdfCompressor, send some sample input documents along with a brief explanation of the problem to This email address is being protected from spambots. You need JavaScript enabled to view it.. This e-mail address is being protected from spam bots, you need JavaScript enabled to view it . Please do not send email file attachments in excess of 10 MB per email. If your files are larger than 10 MB, we can set you up with a password-protected ftp account to upload your documents. Someone from CVISION support should generally get back to you in 24-48 hours.

Back to top of page

15. What is multi-threading and how does it improve document processing speed?

Multi-threading allows concurrent execution of multiple documents by maximizing the application of central processing units (CPU). The main benefit associated with multi-threading is the speed, as unlimited threads can exist to complete the single process. As a result, it enables clients to compress, OCR, and optimize high volumes of documents in a short amount of time. If processing on 2 cores instead of 1, OCR documents in half the time. If processing on 17 cores instead of 1, convert to PDF 17x faster. With its ability to have unlimited cords, it allows clients to meet tight deadlines or handle sudden spikes at peak processing periods.

Back to top of page

PDF/A Frequently Asked Questions

1. Why archive with PDF/A?

PDF/A is the most responsible method to ensure your electronic documents can be viewed tomorrow, or decades later.  PDF/A combines the reliability and security of microfiche with the convenience of electronic documents. Without conversion to PDF/A, companies risk losing valuable information within their electronic documents, or worse have documents that will not open at all.

Back to top of page

2. Why not archive with microfiche?

Microfiche has been a reliable option for archiving for quite some time. However, microfiche does not provide the same convenience, accessibility, and ease of use as PDF/A. PDF/A documents can be opened instantly on a computer and be emailed all around the world.  Recent advances including PDF/A makes archiving with microfiche an outdated method of document preservation.

Back to top of page

3. Is there a special viewer for PDF/A?

Similar to PDF files, PDF/A can be viewed with Adobe’s free reader. The PDF/A format is actually a valid PDF file, with some built-in -limitations.

Back to top of page

4. Who should be archiving with PDF/A?

Any responsible record manager, or company concerned with the integrity of their documents should rely on PDF/A for their archival needs.  Microfiche and paper are  reliable, time-tested methods for archiving documents but both  lack the convenience and accessibility of PDF/A.  PDF/A provides complete security  that your documents will view and open correctly  in the future.

Back to top of page

5. What is the difference between PDF and PDF/A?

The PDF/A format is actually a restricted version of  PDF.  In particular, the PDF/A part I specs as currently adopted are a restricted version of the PDF 1.4 specifications.  These specifications  will guarantee that your documents will view and print correctly now, and forever.

Back to top of page

6. Why can I rely on PDF/A for long-term archiving?

PDF/A is recognized by leading record management organizations for electronic document archiving. Both ARMA (Association of Records Managers and Administrators), and NARA (National Archives and Records Administration) have recommend archiving documents using the PDF/A format, http://www.archives.gov/records-mgmt/pdf/strategic-directions-status-sept2004.pdf

Back to top of page

7. What is the difference between PDF/A, A-1a, A-1b?

PDF/A-1a ensures the preservation of a document’s logical structure and content text stream in natural reading order. The text extraction is especially important when the document must be displayed on a mobile device (for example a PDA) or other devices in accordance with Section 508 of the US Rehabilitation Act. In such cases the text must be reorganized on the limited screen size (re-flow). This feature is also known as “Tagged PDFs”.PDF/A-1b ensures that the text (and additional content) can be correctly displayed (e.g. on a computer monitor), but does not guarantee that extracted text will be legible or comprehensible. It therefore does not guarantee compliance with Section 508. The difference between PDF/A-1a and -1b has no impact for scanned documents, provided the files have not been enhanced by means of OCR for searching.

Back to top of page

Technical Support Questions

1. What are the support hours?

The technical support hours are from 9am to 6pm (Eastern Time) available from Monday to Friday. Please email at This email address is being protected from spambots. You need JavaScript enabled to view it..

Back to top of page
CVISION's Solutions for the Education Industry
  • Privacy
  • Cookies
  • Sitemap
  • Reference
  • Library
  • Contact Us
CVISION Technologies Facebook Page CVISION Technologies LinkedIn Company Page CVISION Technologies Twitter Page Subscribe to The Visionary Newsletter CVISION Technologies Blog CVISION Technologies YouTube Channel
 
Copyright © 1998-2018 CVISION Technologies, Inc.
CVISION, CVista, CBatch, and the CVISION logo are registered trademarks of CVISION Technologies, Inc.