PCTX Editorial · 4 min
Context engineering: feeding the agent more made it worse
We held one model fixed and changed only how the agent was built around it, and the same model scored anywhere from 67 to 95 percent. The biggest jump came from a setup that cost less than the do-nothing baseline, which gave the agent a short plan and had it check its own output before finishing.
Read the article →