Instant DevelopersCliCLI Commands

CLI Commands

|||

Reference for all INDW CLI commands: merge, validate, test, audit, benchmark, and doctor.

INDW provides six CLI commands for pipeline execution, validation, auditing, and diagnostics. All commands are invoked via the indw entry point.

bash
indw --help

Command summary

CommandPurposeStability
indw mergeRun quality merge on raw corpusstable
indw validateParity and acceptance validationstable
indw testRun framework test suitestable
indw auditOperational audit reportsstable
indw benchmarkProduction scale benchmarkstable
indw doctorVerify install and dependenciesstable

indw merge

Run the quality merge pipeline on a raw corpus directory.

bash
indw merge <raw_dir> <out_path> [options]
Argument / FlagTypeDefaultDescription
raw_dirPathrequiredDirectory with */data.jsonl sources
out_pathPathrequiredOutput JSONL path
--work-dirPathout_path.parentCheckpoints and audit artifacts
--workersint1Parallel worker count
--chunk-sizeint500Documents per dispatch batch
--freshflagoffClear checkpoints, start clean
--backendchoiceenv defaultlocal, thread, multiprocess, dask
bash
indw merge ./raw ./out/filtered.jsonl --work-dir ./work --workers 4 --fresh
indw merge ./raw ./out/filtered.jsonl --backend dask --workers 16

indw validate

Run the parity acceptance suite. Equivalent to indw test --profile parity.

bash
indw validate [pytest_args...]

Verifies:

  • Local vs multiprocess backend output hash match
  • Workers=1 vs workers=N hash match
  • Tier admission consistency

indw test

Run the framework test suite with selectable profiles.

bash
indw test [--profile PROFILE] [pytest_args...]
ProfileMarkersScope
unitnot integration and not slowFast unit tests (default)
criticalcritical and not integrationProduction-critical subsystem tests
parityintegrationBackend parity tests
integrationintegration or slowFull integration suite
smokesmokeEnd-to-end smoke tests
bash
indw test --profile critical
indw test --profile unit -k test_stage0

indw audit

Run operational audit reports against pipeline artifacts.

bash
indw audit [--kind KIND] [--work-dir PATH] [--workers N]
KindScriptPurpose
pipelinepipeline_audit.pyArchitecture and throughput from work dir
daskdask_integration_report.pyDask cluster integration health
productionproduction_scale_audit.pyProduction certification benchmarks
librarylibrary_migration_report.pyDependency migration audit
stage0stage0_production_verify.pyStage0 throughput verification
bash
indw audit --kind pipeline --work-dir ./work
indw audit --kind production
indw audit --kind stage0 --workers 8

indw benchmark

Run production scale benchmarks at multiple worker counts.

bash
indw benchmark

Equivalent to indw audit --kind production. Tests throughput at workers 1 and 2 by default.

indw doctor

Verify installation, Python version, and dependency availability.

bash
indw doctor

Output example:

text
indw=1.0 python=3.11.8 platform=Linux-...
backend=multiprocess resolved=multiprocess
orjson=ok
trafilatura=ok
dask=ok
Exit codes:

All commands return exit code 0 on success and non-zero on failure. Use in CI pipelines: indw validate && indw audit --kind production.

© 2026 Instant Developers. All rights reserved.