How Stripe built “minions”—AI coding agents that ship 1,300 PRs weekly from Slack reactions | Steve Kaliski (Stripe engineer)

Listen now on YouTubeSpotifyApple Podcasts

Brought to you by:

  • Optimizely—Your AI agent orchestration platform for marketing and digital teams

  • Rippling—Stop wasting time on admin tasks, build your startup faster

Steve Kaliski has spent over six years building developer infrastructure at Stripe. In this conversation with Claire, he breaks down Stripe’s “minions”: AI coding agents that ship about 1,300 pull requests per week, often kicked off with nothing more than a Slack emoji. He explains why the real bottleneck in engineering isn’t coding, how cloud development environments unlock parallel AI workflows, and what it takes to safely review thousands of AI-generated PRs. He also demos AI agents that can spend money, coordinate services, and complete tasks end-to-end without human involvement.

Biggest takeaways:

  1. What’s good for human developers is good for AI agents (and vice versa). Stripe’s years of investment in developer experience—comprehensive documentation, blessed paths for common tasks, robust CI/CD, excellent tooling—directly translates to higher AI agent success rates. When you have clear docs on “how to add a new API field,” the agent can follow those same instructions. This creates a virtuous cycle: investments in DX improve agent performance, and investments in agent infrastructure (like cloud environments) benefit human developers too.

  2. Activation energy is the real bottleneck, not coding speed. Steve hasn’t started work in a text editor in months. Instead, work begins in Slack threads, Google Docs, or support tickets—the natural places where ideas emerge. By allowing engineers to kick off development with a single emoji reaction, Stripe lowered the friction between “good idea” and “code in production.” This is especially powerful in large organizations, where coordination costs typically kill momentum before coding even begins.

  3. Cloud development environments are non-negotiable for multi-threaded AI work. Running multiple AI agents in parallel requires cloud-based dev environments that can spin up in seconds, run isolated workloads, and never fall asleep. This infrastructure investment—which Stripe’s developer productivity team built long before AI agents—now enables engineers to run dozens of agents simultaneously without melting their MacBook Pros.

  4. 1,300 AI-written PRs per week requires shifting review capacity, not eliminating it. Stripe still reviews every AI-generated PR, but the review process relies heavily on automated confidence signals: comprehensive test coverage, synthetic end-to-end tests, and blue-green deployments that enable quick rollbacks. The bottleneck shifts from writing code to reviewing it—and eventually to generating enough good ideas in the first place.

  5. Machine-to-machine payments unlock ephemeral, API-first businesses. In Steve’s birthday party demo, Claude Code autonomously paid Browser Base, Parallel AI, and Postal Form for single-use services—no human signup, no subscription, no dashboard. Businesses can now optimize for agent consumers rather than human users, focusing on “hyper-useful single APIs” instead of landing pages and admin panels. The economics become transparent: tokens and dollars sit side by side, making the true cost of AI work visible.

  6. Treat AI agents like new employees, with progressive trust. Start with limited access, expand permissions as the agent proves reliable, and maintain clear boundaries. Each minion runs in an isolated environment with specific data access—the finance agent can read bank statements but can’t send messages; the scheduling agent can text but has no financial data. This physical partitioning prevents accidental data leakage and creates accountability.

  7. The future of software is disposable and hyper-personalized. Steve builds custom iOS apps for his toddler—music players limited to six specific songs—despite having no iOS development experience. He describes this as “the disposability of software”: when AI can build apps in hours, you can create single-purpose tools for incredibly specific use cases and throw them away when they’re no longer needed.

Detailed workflow walkthroughs from this episode:

How to turn Claude Code into your personal life operating system | Hilary Gridley

Listen now on YouTubeSpotifyApple Podcasts

Brought to you by:

WorkOS—Make your app enterprise-ready today

Lovable—Build apps by simply chatting with AI

Hilary Gridley returns to the podcast to share how her approach to productivity has completely evolved since her last appearance. Now a new mom and entrepreneur, she walks Claire through how she uses Claude Code as a personal operating system, managing everything from daily planning to life admin without complex tools or rigid workflows. Instead of building elaborate systems, Hilary leans into what she calls the “anti-system system”: letting AI observe her behavior, learn her preferences over time, and gradually take work off her plate. Together, Claire and Hilary explore how simple inputs—like capturing tasks with a shortcut or “yapping” to Claude throughout the day—can replace traditional productivity stacks and integrations.

Biggest takeaways:

  1. The 10x impact framework: For any task, ask “If I were 10 times better at it, would it have 10 times the impact?” If no, automate it. If yes, keep it human. This applies to both work tasks and life tasks—including whether baking bread will bring you joy or feel like a chore.

  2. Complexity has to earn its keep. Hilary only connects APIs and builds complex integrations after testing the “janky version” of a workflow for a week. Her hit rate is only 20% on workflows she actually continues using, so starting simple saves massive time.

  3. The yappers API beats OAuth every time. Instead of connecting all your tools in the background, just talk to Claude about what you’re doing throughout the day. Hilary keeps Claude Code open in her terminal and narrates her work, letting Claude observe and take notes without complex integrations.

  4. Let AI learn your preferences by observing, not by your defining them. Hilary never sat down to write out her ideal schedule. Claude just watches what she actually does (not what she says she’ll do) and adjusts preferences automatically. Real behavior beats aspirational planning.

  5. Calendar management is the ultimate to-do list. You can’t say you take something seriously if you’re not putting time into it. But manually adding tasks to your calendar is tedious—so let Claude do it automatically based on what you say you want to accomplish.

  6. Screenshots are your friend for getting started. Don’t wait for API access or permissions at work. Build a janky version with screenshots and voice dictation, prove it’s valuable, and then get the permissions you need. Half-baked ideas don’t deserve full access.

  7. You don’t need coding knowledge to build Claude Code skills. Hilary just describes problems to Claude: “I keep forgetting to return things on time.” Claude asks a few questions, then builds the entire workflow—including checking return policies and drop-off locations automatically.

  8. Test everything before integrating it into working systems. Hilary refuses to add new workflows to her daily routine until she’s tested them separately for a week. If something breaks, you don’t want it taking down systems that were already working.

  9. Build the muscle memory by doing one thing with AI every day. The biggest barrier isn’t technical knowledge—it’s rewiring your brain to think “the alien in my computer could help with this.” Hilary went from “I’ll never use the terminal” to running her life in Claude Code in about a week.

Detailed workflow walkthroughs from this episode:


If you’re enjoying these episodes, reply and let me know what you’d love to learn more about: AI workflows, hiring, growth, product strategy—anything.

Catch you next week,
Lenny

P.S. Want every new episode delivered the moment it drops? Hit “Follow” on your favorite podcast app.

Leave a Reply

Sign Up for TheVCDaily

The best news in VC, delivered every day!