Blog

How teams use agents to iterate, review, and ship PRs with proof.

Showing 12 of 122 posts

GPT-5 Codex Works Best When You Strip Your Harness Down

Roo Cast2025-11-05

Learn why GPT-5 Codex underperforms in custom tool harnesses and how stripping down to native tooling like apply patch and ripgrep eliminates the attention tax for better coding agent results.

codexai-agentsdeveloper-toolsmodel-integration

Score Agents Like Employees, Not Like Models

Roo Cast2025-11-05

Why code correctness benchmarks miss critical agent failure modes and how to evaluate AI coding agents using work style metrics like proactivity, context management, and communication.

ai-agentsdeveloper-productivityevaluationcoding-agents

Why Your Model Searches the Same Files Twice

Roo Cast2025-11-05

Learn why reasoning models repeat file searches across prompts and how preserving reasoning tokens between API calls can boost coding benchmark performance by 4-5 percentage points.

reasoning-modelsapi-architecturecontext-managementai-coding

Code-Specific Models Miss the Point

Roo Cast2025-10-22

Why code-specific LLMs fail at pair programming - they optimize for syntax prediction but strip out the world understanding needed to build software that actually serves users.

ai-codingllm-modelspair-programmingdeveloper-workflow

The First Prompt Decides Whether Users Stay

Roo Cast2025-10-22

Why AI coding tools live or die on the first response, and how engineering teams can evaluate tools beyond the initial impression.

ai-toolsdeveloper-experienceactivationevaluation

Why Your 2023 LLM Integration Needs a Rewrite

Roo Cast2025-10-22

Learn why AI integrations built in 2023 need a complete rewrite, not patches. The scaffolding you built to work around model limitations now prevents you from using current capabilities.

llm-integrationtechnical-debtai-architectureagentic-workflows

Working Prototypes Beat Specs

Roo Cast2025-10-22

Why clickable prototypes eliminate guesswork and alignment meetings that specs create - and how AI coding agents make prototype-first workflows the new default.

product-developmentprototypingworkflowai-coding-agents

Async Agents Change the Speed vs Quality Calculus

Roo Cast2025-10-16

In many workflows, quality at scale beats speed in series. That sentence sounds wrong until you stop running one agent at a time.

ai developmentworkflowperformance

Code Review Got Faster, Not Easier

Roo Cast2025-10-16

AI is generating over half of Google's production code. The bottleneck didn't disappear, it moved. Here's how teams are adapting review workflows to handle the volume.

code-reviewai-developmentengineering-practicesproductivity

Vibe Coders Build and Rebuild, They Do Not Migrate

Roo Cast2025-10-16

Why non-technical builders hit a migration wall with AI coding tools and how engineering artifacts make the difference between throwaway prototypes and production-ready handoffs.

vibe-codingai-coding-agentsdeveloper-workflowprototyping

Vibe Coders Build and Rebuild, They Do Not Migrate

Roo Cast2025-10-16

Throwaway apps aren't a bug. They're the workflow. Understanding when rebuilding beats migrating, and what makes prototypes production-ready.

ai developmentworkflowprototyping

Dev Plans Turn Fast Models Into Reliable Builders

After Hours2025-10-10

Learn how dev plans transform inexpensive AI models into reliable code generators by providing explicit specifications instead of vague prompts.

dev-plansorchestrator-modeagentic-workflowsprompt-engineering

Stop being the human glue between PRs

Cloud Agents review code, catch issues, and suggest fixes before you open the diff. You review the results, not the process.

Try Cloud Free See How It Works