AI News HubLIVE
Original source2 min read

Fable's judgement

During a fireside chat at AIE, the Claude Code team suggested letting Fable and Opus use their own judgment rather than dictating how to work. For example, let Fable decide when to write tests instead of specifying rules. Jesse Vincent further advised using lower-power models for smaller tasks to save tokens. Willison set a prompt for Claude Code to delegate coding to subagents with appropriate models, which has been working well, increasing productivity and slowing token usage.

Fable's judgement

Simon Willison’s Weblog

Subscribe

3rd July 2026

One of the most interesting tips I got from the Fireside Chat I hosted with Cat Wu and Thariq Shihipar from the Claude Code team at AIE on Wednesday was to let Fable (and to a certain extent Opus) use their own judgement rather than dictating how they should work.

The example they gave was testing. You can tell Fable "only use automated testing for larger features, don't update and run tests for small copy or design changes" - but it's better to just tell Fable to use its own judgement when deciding to write tests instead.

Jesse Vincent just gave me a related tip to help avoid burning too many of those valuable Fable tokens in the few days we have left before the prices go up. Tell Fable to use other models for smaller tasks, applying its own judgement about which model to use.

I prompted Claude Code just now with:

For all coding tasks use your judgement to decide an appropriate lower power model and run that in a subagent

Claude saved this memory file in ~/.claude/projects/name-of-project/memory/delegate-coding-to-subagents.md:

--- name: delegate-coding-to-subagents description: Simon wants coding tasks delegated to subagents running an appropriately lower-power model metadata: node_type: memory type: feedback originSessionId: 30068d78-43a9-4fb1-bb29-9799e18c526a ---

Stated by Simon on 2026-07-03: "For all coding tasks use your judgement to decide an appropriate lower power model and run that in a subagent."

Why: cost/efficiency — implementation work rarely needs the top-tier model; judgment, review, and synthesis stay with the main loop.

How to apply: when a task in this project is primarily writing/editing code, spawn an Agent with a model override (sonnet for substantive implementation, haiku for trivial/mechanical edits) and a self-contained prompt; review the result in the main loop before committing. Design, auditing, data synthesis, and anything judgment-heavy stays in the main model. See also [[project-goals]].

So far it seems to be working well. I'm getting a ton of work done and my Fable allowance is shrinking less quickly than before.

Recent articles

Have your agent record video demos of its work with shot-scraper video - 30th June 2026

Porting the Moebius 0.2B image inpainting model to run in the browser with Claude Code - 22nd June 2026

sqlite-utils 4.0rc1 adds migrations and nested transactions - 21st June 2026

This is a note by Simon Willison, posted on 3rd July 2026.

ai 2,099

prompt-engineering 191

generative-ai 1,856

llms 1,823

anthropic 303

claude 287

coding-agents 217

claude-code 120

claude-mythos-fable 21

Monthly briefing

Sponsor me for $10/month and get a curated email digest of the month's most important LLM developments.

Pay me to send you less!

Sponsor & subscribe

Disclosures

Colophon

©

2002

2003

2004

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026