Image Pre-Processing The AIForged Image Processor Service is used to pre-process documents before being sent to OCR services in order to increase the accuracy of the OCR.
The following operations are available when pre-processing documents:
Operation
Description
Remove Comments
Remove any electronic comments from PDFs.
Rotate
Rotate the pages of a document. This is mostly used to deskew pages.
Greyscale
Convert the image to Greyscale.
Invert
Invert the colour of an image.
Mirror
Mirrors an image .
Remove Garbage
Used to remove small spots such as pen marks.
Remove Color Marks
Remove marks that may affect OCR accuracy, such as bank stamps.
Whiten Background
By specifying lower- and upper greyscale bounds, remove possible watermarks that may negatively impact OCR accuracy. Bound values are between 0 and 255, with 0 being black and 255 being white.
Consider the document below, with a "DRAFT" watermark.
The AIForged Image Processor Service can be used to remove the watermark and whiten the background.