Appearance
Elsai OCR Extractors v1.0.0
INFO
v2.0.0 adds Mistral OCR, Amazon Boto3, and Amazon Textract. See v2.0 docs.
Installation
bash
pip install --extra-index-url https://core-packages.elsai.ai/root/elsai-ocr-extractors/ elsai-ocr-extractors==1.0.0Requirements: Python >= 3.9
Supported services (v1.0.0)
| Service | Class |
|---|---|
| Azure Cognitive Services | AzureCognitiveOCR |
| Azure Document Intelligence | AzureDocumentIntelligence |
| LlamaParse | LlamaParseOCR |
| VisionAI | VisionAIOCR |
Azure Document Intelligence
python
from elsai_ocr_extractors.azure_doc_intelligence import AzureDocumentIntelligence
extractor = AzureDocumentIntelligence(
endpoint="https://your-resource.cognitiveservices.azure.com/",
api_key="your-key",
)
result = extractor.extract(file_path="document.pdf")
print(result.text)VisionAI
python
from elsai_ocr_extractors.visionai import VisionAIOCR
extractor = VisionAIOCR(api_key="your_key")
result = extractor.extract(file_path="scan.pdf")LlamaParse
python
from elsai_ocr_extractors.llama_parse import LlamaParseOCR
extractor = LlamaParseOCR(api_key="your_llama_cloud_key")
result = extractor.extract(file_path="report.pdf")
print(result.markdown)