HomeGuides :: Symphony OCRNot Processed List DescriptionToo Big

20.13. Too Big

Too Big (to 8.0.0 and higher)

If a document falls into this list, it does NOT mean the document is contains too many pages.  Symphony OCR processes files one page at a time.  So if a document falls into this list, it means the document contains one or more pages with pixel dimensions larger than a specified value.  In this version of Symphony OCR that value is 32512 x 32512 pixels.  

This is a hard limit and cannot be overwritten.


Too Big (Prior to 8.0.0)

If a document falls into this list, it does NOT mean the document is too big.  Symphony OCR processes files one page at a time.  So if a document falls into this list, it means the document contains one or more pages with pixel dimensions larger than a specified value (ie. The page couldn't be loaded into memory).  We usually see this in documents like blueprints of schematic drawings.  But there are some things we can do to try to get these types of documents processed, if you find that it needs to be processed.  

Clicking on the document in the 'Too Big' list will tell you the size of the offending page. 

1) Click on the 'Too big' list.

2) Click on the individual document in question.

3) The offending size of the document is available in the document details.



If you find you have a series of the same type of documents, it's usually the case where the same size file is exceeding the limit.  You can attempt to process these documents by modifying the value(s) declared in the setting.xml file.  (Defaults differ depending on the version you're running.)

For versions NEWER than 6.5.32

Default: If an individual page contains a total pixel count higher than 36,000,000 pixels the entire document will be filed under the "Too Big" list.

Advanced Configuration Setting

If you wish to attempt to process documents that contain pages with a total pixel count larger than 36,000,000 pixels, you may opt to do so by updating the settings.xml file.  Here's how:

  • Close Symphony OCR (stop Service if installed as Service).
  • Navigate to C:\Program Files\Trumpet\SymphonyOCR\Config\ and open the settings.xml file using notepad.
  • The setting you want to adjust is highlighted in yellow below:
          <documentPreProcessor ..... maxPixels="36000000" ..... />
  • Update the maxPixels variable (within the " ") to whatever you feel is appropriate.
    • Tip: Reference the details on your document, that SOCR reports, to reference the actual size of the page. Set the to equal or exceed that.
  • Save the settings.xml file
  • Launch Symphony OCR (Start Service if installed as Service)

Note:  If Symphony OCR is not able to process these documents they may end up in the Needs Attention list.


General reference guide for page sizes in inches to total pixels:

A Size (8.5x11 inches) = 8415000 pixels

Legal (8.5x14) = 10710000

B size (two A sizes — 17x11) = 16830000

C size (two B sizes — 17x22) = 33660000

Default (20x20) = 36000000

D size (two C sizes — 22×34) = 67320000


For versions OLDER than 6.5.32

Default: If an individual page is larger than 10,000 x 12,000 pixels the entire document will be filed under the "Too Big" list.

Advanced Configuration Setting

If you wish to attempt to process documents that have individual page larger than 10,000 x 12,000 pixels, you may opt to do so by updating the settings.xlm file.  Here's how:

  • Close Symphony OCR (stop Service if installed as Service)
  • Navigate to C:\Program Files\Trumpet\SymphonyOCR\Config\ and open the settings.xml file using notepad
  • The setting you want to adjust is highlighted in yellow below:
          <documentPreProcessor ..... maxHeightPixels="10000" maxWidthPixels="12000" ..... />
  • Update the maxHeight and maxWidth variables (within the "") to whatever you feel is appropriate.
    • Tip: Reference the details that SOCR reports to reference the actual size of the page. Set the max just above that.
  • Save the settings.xml file
  • Launch Symphony OCR (Start Service if installed as Service)

Note:  If Symphony OCR is not able to process these documents they may end up in the Needs Attention list.

Note: If you update your version to 6.5.32 or above then tell SOCR to re-analyze the documents in the 'Too Big' list. Once it re-analyzes them it will now reference their Total Pixel Count, instead of the Height Width ratio.


This page was: Helpful | Not Helpful

© 2012 Trumpet, Inc., All Rights Reserved