Blog

How teams use agents to iterate, review, and ship PRs with proof.

Showing 12 of 122 posts

Score Agents Like Employees, Not Like Models

Roo Cast2025-11-05

Why code correctness benchmarks miss critical agent failure modes and how to evaluate AI coding agents using work style metrics like proactivity, context management, and communication.

ai-agentsdeveloper-productivityevaluationcoding-agents
Read more →

Why Your Model Searches the Same Files Twice

Roo Cast2025-11-05

Learn why reasoning models repeat file searches across prompts and how preserving reasoning tokens between API calls can boost coding benchmark performance by 4-5 percentage points.

reasoning-modelsapi-architecturecontext-managementai-coding
Read more →

Code-Specific Models Miss the Point

Roo Cast2025-10-22

Why code-specific LLMs fail at pair programming - they optimize for syntax prediction but strip out the world understanding needed to build software that actually serves users.

ai-codingllm-modelspair-programmingdeveloper-workflow
Read more →

Why Your 2023 LLM Integration Needs a Rewrite

Roo Cast2025-10-22

Learn why AI integrations built in 2023 need a complete rewrite, not patches. The scaffolding you built to work around model limitations now prevents you from using current capabilities.

llm-integrationtechnical-debtai-architectureagentic-workflows
Read more →

Working Prototypes Beat Specs

Roo Cast2025-10-22

Why clickable prototypes eliminate guesswork and alignment meetings that specs create - and how AI coding agents make prototype-first workflows the new default.

product-developmentprototypingworkflowai-coding-agents
Read more →

Code Review Got Faster, Not Easier

Roo Cast2025-10-16

AI is generating over half of Google's production code. The bottleneck didn't disappear, it moved. Here's how teams are adapting review workflows to handle the volume.

code-reviewai-developmentengineering-practicesproductivity
Read more →

Vibe Coders Build and Rebuild, They Do Not Migrate

Roo Cast2025-10-16

Why non-technical builders hit a migration wall with AI coding tools and how engineering artifacts make the difference between throwaway prototypes and production-ready handoffs.

vibe-codingai-coding-agentsdeveloper-workflowprototyping
Read more →

Stop being the human glue between PRs

Cloud Agents review code, catch issues, and suggest fixes before you open the diff. You review the results, not the process.