kmp-impact evaluateCompare a pipeline result against a ground truth and emit Precision / Recall / F1 for both files and screens.
kmp-impact evaluate \
--results CONSOLIDATED.json \
--ground-truth GROUND_TRUTH.yml \
[--output-dir OUT]
| Flag | Required | Default | Description |
|---|---|---|---|
--results |
yes | — | Path to phase4/consolidated.json from a previous pipeline run. |
--ground-truth |
yes | — | Path to a ground_truth.yml with the expected impacted files and screens. |
--output-dir |
no | output/evaluation |
Directory where the per-scenario evaluation artefacts are written. |
ConsolidatedResult model from --results.--ground-truth.--output-dir and logs the headline metrics.A log line summarises the result:
F1=0.79 Precision=0.83 Recall=0.75
The on-disk artefact is an EvaluationResult JSON with the full true-positive, false-positive, and false-negative file/screen lists for inspection.
Given the analyzer’s set A and the ground-truth set M:
Precision = |A ∩ M| / |A|
Recall = |A ∩ M| / |M|
F1 = 2 · P · R / (P + R)
kmp-impact evaluate \
--results output/phase4/consolidated.json \
--ground-truth scenarios/pokedex_ktor_minor/ground_truth.yml \
--output-dir output/evaluation
ground_truth.yml.ConsolidatedResult — the input contract.