devsyril/tg-cni-reader
Togolese National ID Card (CNI) OCR processor - extracts and validates data from CNI card images
时间:2026-05-12 14:58
ahmaadkhader/pdf-to-html
Standalone PHP library for extracting semantic HTML from PDF files. Detects headings, lists, tables, links, and inline styles from PDF content.
时间:2026-05-12 13:33
smmahfujurrahman/localization
A Laravel package providing Artisan commands to extract, auto-translate, wrap, and sort Blade translation strings with Google Translate support.
时间:2026-04-25 15:17
libresign/pdf-signature-validator
High-quality PDF signature extraction and validation primitives for LibreSign and external consumers.
时间:2026-04-23 23:33
cable8mm/mma-scrapers
A lightweight, extensible PHP library for scraping MMA data from multiple sources.
时间:2026-04-23 04:23
content-extract/content-processor
Robust PHP library for batch document processing. Extracts content from PDFs/text and generates structured JSON according to user-defined schemas. Now with semantic structuring, OCR support for scanned PDFs, text normalization, and alias-driven field matching. Production-ready, secure, zero unnecess
时间:2026-04-19 15:27
ges/ocr
Core document processing services for OCR, classification, extraction, and normalization.
时间:2026-03-30 00:13
teariot/json-repair
Repair broken, malformed, or non-standard JSON — fix quotes, commas, comments, Python constants, JSONP, NDJSON, HTML entities, and more
时间:2026-03-24 11:40
onstage2426/fuzor
Dependency-free full-text search for PHP. BM25 ranking, fuzzy and boolean modes, search-as-you-type prefix matching, stopword filtering and Snowball stemming for 62 languages, snippet extraction and result highlighting — one SQLite file, zero infrastructure.
时间:2026-03-21 13:20
survos/ai-pipeline-bundle
Symfony bundle for resumable, ordered AI task pipelines — OCR, classify, describe, extract, summarize, and more, out of the box.
时间:2026-02-27 11:16
yii1x/active-record
Yii 1.1 Active Record, extracted and modernized for PHP 8.4+
时间:2026-02-24 15:11
jcfrane/pdf-text-extractor
A Laravel PDF text extraction package with multiple strategies (PdfParser, XObject, AWS Textract, Tesseract OCR). Handles Canva-generated PDFs, scanned documents, and other edge cases with automatic fallback.
时间:2026-02-11 09:00
youri/parser-shacl
SHACL parser extending parser-rdf with SHACL-specific extraction.
时间:2026-02-07 11:44
youri/parser-owl
OWL 2 parser extending parser-rdf with OWL-specific extraction.
时间:2026-02-07 11:44
mimmi20/wurfl-constants
the Constants extracted from Wurfl for PHP 5.3
时间:2026-01-04 19:42