survos/ai-dataset-bundle
最新稳定版本:2.8.1
Composer 安装命令:
composer require survos/ai-dataset-bundle
包简介
Dataset-scale AI batch processing for Survos datasets using canonical dataset paths and JSONL artifacts.
README 文档
README
Dataset-scale AI batch processing for Survos/Museado datasets.
This bundle is intentionally separate from survos/ai-workflow-bundle.
ai-workflow-bundle operates on individual workflow subjects. This bundle
operates on dataset JSONL stages, writes durable batch artifacts, and uses the
canonical workspace paths from survos/dataset-bundle.
Responsibilities
- Read normalized rows from
20_normalize/{core}.jsonl. - Write provider-ready batch input JSONL and manifests to
40_ai/. - Submit/check/download OpenAI batch jobs through
survos/ai-batch-bundle. - Convert downloaded batch responses into portable claim JSONL files.
- Leave later enrichment/import stages to consume those claim files.
Commands
Commands are exposed as methods on Survos\AiDatasetBundle\Service\DatasetAiService.
php bin/console ai:dataset:estimate mus/aust --core=obj php bin/console ai:dataset:prepare mus/aust --core=obj --force php bin/console ai:dataset:submit mus/aust --core=obj --force php bin/console ai:dataset:status mus/aust php bin/console ai:dataset:download mus/aust --core=obj --force
ai:dataset:submit is the paid provider call. estimate and prepare are local.
Files
For dataset mus/aust and core obj, the bundle uses:
| Path | Purpose |
|---|---|
20_normalize/obj.jsonl |
Normalized source records |
40_ai/obj.dense_summary.batch.input.jsonl |
OpenAI batch input |
40_ai/obj.dense_summary.batch.json |
Local batch manifest |
40_ai/obj.dense_summary.batch.output.jsonl |
Raw OpenAI batch output |
40_ai/obj.jsonl |
Portable claim rows for enrichment |
All paths are resolved with Survos\DataBundle\Service\DataPaths.
Install
composer require survos/ai-dataset-bundle
Register the bundle:
Survos\AiDatasetBundle\SurvosAiDatasetBundle::class => ['all' => true],
Required runtime bundles:
survos/dataset-bundlesurvos/jsonl-bundlesurvos/ai-batch-bundlesurvos/claims-bundle
Optional:
yethee/tiktokenfor better token estimates.
统计信息
- 总下载量: 0
- 月度下载量: 0
- 日度下载量: 0
- 收藏数: 0
- 点击次数: 1
- 依赖项目数: 0
- 推荐数: 0
其他信息
- 授权协议: MIT
- 更新时间: 2026-05-29