content-extract/content-processor
Robust PHP library for batch document processing. Extracts content from PDFs/text and generates structured JSON according to user-defined schemas. Now with semantic structuring, OCR support for scanned PDFs, text normalization, and alias-driven field matching. Production-ready, secure, zero unnecess
时间:2026-04-19 15:27
ges/ocr
Core document processing services for OCR, classification, extraction, and normalization.
时间:2026-03-30 00:13
kreuzberg/kreuzberg
High-performance document intelligence for PHP. Extract text, metadata, and structured information from PDFs, Office documents, images, and 75 formats. Powered by Rust core for 10-50x speed improvements.
时间:2025-12-27 13:40