The Microsoft OCR Service in AIForged uses Microsoft’s OCR (Azure AI Vision Read) to extract plain text from documents. It returns raw text without preserving layout or structure and stores the output in the document’s Result property for downstream processing, regex, analytics, or integration. Optionally, you can save lines and words for verification.
Info
Use this service when you need reliable plain text extraction from images or PDFs for downstream rules, search indexing, or analytics. For structured field and table extraction, consider Microsoft Document Intelligence.
Very large or long PDF/TIFF files may be processed asynchronously by the underlying provider and can take longer.
Password‑protected PDFs cannot be processed directly.
Image constraints
Low-resolution, skewed, or noisy images reduce OCR accuracy.
Extremely large image dimensions can slow processing without improving quality.
Languages and handwriting
Printed text is supported for many languages.
Handwriting support is available but accuracy varies by script and image quality; test representative samples.
Mixed multi-language pages may have variable results.
Layout
This service returns raw text in the Result property (no preserved layout). Enable IsVerification to store lines and words when you need visual verification.
Info
Tip: Use the AIForged Custom Code utility as a pre-processor to set the password per document, so that AIForged can unlock documents before processing.
Info
Tip: For very large PDFs or TIFFs, consider chunking by page range to avoid hitting provider limits and to keep processing responsive. Use the AIForged Document Splitter for this task.
No. The service returns plain text in the Result property. Enable IsVerification to store lines and words for visual review. For structured extraction, use Microsoft Document Intelligence.
Can I process handwriting?
Handwriting is supported but accuracy varies by script and image quality. Test representative samples first.
Do I need Microsoft cloud credentials or regional setup?
No. AIForged provides a unified access layer and manages provider accounts and regional routing on your behalf.
How do I process password‑protected PDFs?
Use the AIForged Custom Code utility to set the password per document so AIForged can unlock files before processing.
What if my PDFs are very large?
Split large PDFs/TIFFs by page range with the AIForged Document Splitter to keep processing responsive and avoid provider limits.