CLI Commands
Reference for all INDW CLI commands: merge, validate, test, audit, benchmark, and doctor.
INDW provides six CLI commands for pipeline execution, validation, auditing, and diagnostics. All commands are invoked via the indw entry point.
Command summary
| Command | Purpose | Stability |
|---|---|---|
indw merge | Run quality merge on raw corpus | stable |
indw validate | Parity and acceptance validation | stable |
indw test | Run framework test suite | stable |
indw audit | Operational audit reports | stable |
indw benchmark | Production scale benchmark | stable |
indw doctor | Verify install and dependencies | stable |
indw merge
Run the quality merge pipeline on a raw corpus directory.
| Argument / Flag | Type | Default | Description |
|---|---|---|---|
raw_dir | Path | required | Directory with */data.jsonl sources |
out_path | Path | required | Output JSONL path |
--work-dir | Path | out_path.parent | Checkpoints and audit artifacts |
--workers | int | 1 | Parallel worker count |
--chunk-size | int | 500 | Documents per dispatch batch |
--fresh | flag | off | Clear checkpoints, start clean |
--backend | choice | env default | local, thread, multiprocess, dask |
indw validate
Run the parity acceptance suite. Equivalent to indw test --profile parity.
Verifies:
- Local vs multiprocess backend output hash match
- Workers=1 vs workers=N hash match
- Tier admission consistency
indw test
Run the framework test suite with selectable profiles.
| Profile | Markers | Scope |
|---|---|---|
unit | not integration and not slow | Fast unit tests (default) |
critical | critical and not integration | Production-critical subsystem tests |
parity | integration | Backend parity tests |
integration | integration or slow | Full integration suite |
smoke | smoke | End-to-end smoke tests |
indw audit
Run operational audit reports against pipeline artifacts.
| Kind | Script | Purpose |
|---|---|---|
pipeline | pipeline_audit.py | Architecture and throughput from work dir |
dask | dask_integration_report.py | Dask cluster integration health |
production | production_scale_audit.py | Production certification benchmarks |
library | library_migration_report.py | Dependency migration audit |
stage0 | stage0_production_verify.py | Stage0 throughput verification |
indw benchmark
Run production scale benchmarks at multiple worker counts.
Equivalent to indw audit --kind production. Tests throughput at workers 1 and 2 by default.
indw doctor
Verify installation, Python version, and dependency availability.
Output example:
All commands return exit code 0 on success and non-zero on failure. Use in CI pipelines: indw validate && indw audit --kind production.