OCR Setup


..
Document Pages Output Options (This option can only be set from PDS Admin. program)
This option allows the OCR preference to be set to save the OCR'd document as a "multi-page" or "single-page" text file. 

Output File Naming (This option can only be set from PDS Admin. program)
By default, the OCR File name is the same as the Image File Name. 
If the image file is imported from the Import module, there is a change for the duplicate names. Selecting ‘Create Unique File name’ will cause the OCR module to assign a new name for the OCR text file.  
..

OCR Processing
In this window you must select either All Batches (which will process all batches in the Batches to be Processed window) or Selected Batch (which will process only the batch selected). There is also two options available; Auto Run (which will start the OCR process automatically) and Auto Close Batch (which will close the batch upon completion of the OCR process).
..

Show Process Status Bar
If this option is selected, the OCR-processing window will popup while the OCR function is in progress.
If  a faster OCR speed is desired this process should be turned off.

Activate OCR Verification Mode
If this option is selected, the OCR module will allow you to modify the OCR text before saving each page. After each document is OCR'd a window appears that shows the OCR'd text and if you left-click on any word a display pane at the top of the Verification window shows the section of the scanned document where the text originated. This is good for people who need 100% accuracy in OCR processing. If  a faster speed is desired this process should be turned off. 

Show Image while OCR processing
If this option is selected, OCR module will display the image being processed. This is very useful for people who need to know which page is being OCR'd. 

Perform One Pass OCR
By default, the OCR module will perform a second OCR when the quality of the first one is not satisfactory. Since the OCR engine might bring the OCR program down by trying too hard to perform the  second pass, the
Perform One Pass OCR avoids the possible OCR'ng problem by forcing the module to accept the data received in the first pass.
..


..

Output Format (This option can only be set from PDS Admin. program)
The OCR module is able to write to different formats of text file. The purpose of the OCR module is for building the Full Text Search Engine. We recommend selecting the simple TEXT as the output format, since it takes less space and runs faster. 

Quality Level
This is the OCR satisfying level. The default is 75, which is good enough for the OCR quality.
..

Trace Events for Debugging
This is to help programmers to trace the program logic when OCR module performs unexpected results.

Auto launch next job step
If this option is selected and the next job step is the “RELEASE’, then the Release module will be launched if the Release program is not running. It is recommend that this option be turned off for multiple scanner environments or if the data Release time is scheduled during the night.

Back to PDS - OCR Module menubar descriptions