Prompt History
Prompt 1
Variant A: summarize every prompt step
Prompt 2
Variant B: compress and score the deltas
Prompt 3
Variant C: bias toward usable takeaways
Output Compare
Best fit: Variant CWhat improved
Compression improved readability without hiding the actual reasoning path.
What regressed
Over-compression removed too much context in low-confidence branches.
Decision Notes
Keep side-by-side comparisons as the default review mode.
Save deltas, not just final outputs.
Useful debriefs preserve prompt changes, output changes, and the decision behind them.
