When LLMs are unreliable, developers build scaffolding: Retries. Judges. Multi-step pipelines. When...