Data Toolchain for AI

Image – OCR

The OCR node extracts text from image-based documents or raw images using Optical Character Recognition (OCR). This is useful for converting scanned files, screenshots, or image-based documents into machine-readable text for downstream processing.

Inputs

Documents – Accepts structured document-like files (e.g., PDFs or multi-page scans) as input for OCR processing.
Image – Accepts raw image files (e.g., PNG, JPG) that contain embedded text to be recognized.

Output

Text – Emits extracted plain text from either the “Documents” or “Image” input. The output can be used in embedding nodes, LLM prompts, or stored for further analysis

Configuration Steps

Type of OCR – Select the OCR mode based on the language structure:
- Default – for general mono- or multilingual OCR processing.
Language – Choose the language that best matches the input text.
- Example – English