devsyril/tg-cni-reader
Togolese National ID Card (CNI) OCR processor - extracts and validates data from CNI card images
时间:2026-05-12 14:58
ahmaadkhader/pdf-to-html
Standalone PHP library for extracting semantic HTML from PDF files. Detects headings, lists, tables, links, and inline styles from PDF content.
时间:2026-05-12 13:33
smmahfujurrahman/localization
A Laravel package providing Artisan commands to extract, auto-translate, wrap, and sort Blade translation strings with Google Translate support.
时间:2026-04-25 15:17
libresign/pdf-signature-validator
High-quality PDF signature extraction and validation primitives for LibreSign and external consumers.
时间:2026-04-23 23:33
cable8mm/mma-scrapers
A lightweight, extensible PHP library for scraping MMA data from multiple sources.
时间:2026-04-23 04:23
mage2kishan/module-redirects
Redirects and 404 management for Magento 2 (Hyva + Luma). Manual + auto redirects, bulk CSV import/export, scheduled cleanup, 404 logging and cluster analysis for redirect recommendations. Extracted from Panth_AdvancedSEO for independent installation.
时间:2026-04-20 11:13
mage2kishan/module-crosslinks
Automatic internal crosslinks for Magento 2 (Hyva + Luma). Converts configured keywords in product, category, and CMS content into anchor links to boost internal linking and SEO. Extracted from Panth_AdvancedSEO for independent installation.
时间:2026-04-20 11:13
content-extract/content-processor
Robust PHP library for batch document processing. Extracts content from PDFs/text and generates structured JSON according to user-defined schemas. Now with semantic structuring, OCR support for scanned PDFs, text normalization, and alias-driven field matching. Production-ready, secure, zero unnecess
时间:2026-04-19 15:27
ges/ocr
Core document processing services for OCR, classification, extraction, and normalization.
时间:2026-03-30 00:13
teariot/json-repair
Repair broken, malformed, or non-standard JSON — fix quotes, commas, comments, Python constants, JSONP, NDJSON, HTML entities, and more
时间:2026-03-24 11:40
onstage2426/fuzor
Dependency-free full-text search for PHP. BM25 ranking, fuzzy and boolean modes, search-as-you-type prefix matching, stopword filtering and Snowball stemming for 62 languages, snippet extraction and result highlighting — one SQLite file, zero infrastructure.
时间:2026-03-21 13:20
boutdecode/etl-core-bundle
Symfony Bundle providing a configurable ETL (Extract/Transform/Load) pipeline engine with CQS, scheduling and workflow support.
时间:2026-03-17 20:17
survos/ai-pipeline-bundle
Symfony bundle for resumable, ordered AI task pipelines — OCR, classify, describe, extract, summarize, and more, out of the box.
时间:2026-02-27 11:16
yii1x/active-record
Yii 1.1 Active Record, extracted and modernized for PHP 8.4+
时间:2026-02-24 15:11
jcfrane/pdf-text-extractor
A Laravel PDF text extraction package with multiple strategies (PdfParser, XObject, AWS Textract, Tesseract OCR). Handles Canva-generated PDFs, scanned documents, and other edge cases with automatic fallback.
时间:2026-02-11 09:00