Prompt History

Prompt 1

Variant A: summarize every prompt step

Prompt 2

Variant B: compress and score the deltas

Prompt 3

Variant C: bias toward usable takeaways

Output Compare

Best fit: Variant C

What improved

Compression improved readability without hiding the actual reasoning path.

What regressed

Over-compression removed too much context in low-confidence branches.

Decision Notes

Keep side-by-side comparisons as the default review mode.
Save deltas, not just final outputs.
Useful debriefs preserve prompt changes, output changes, and the decision behind them.