Yesterday, we were discussing when we could actually let AI agents auto-merge and publish code.

My take - probably not so soon.

1 min read LinkedIn
Yesterday, we were discussing when we could actually let AI agents auto-merge and publish code.

My take - probably not so soon.

These screenshots are exactly why. It is the output from Claude Code when I asked it to add a feature.

Before you think it is a prompt and workflow issue, let me tell you that I have a comprehensive workflow that spawns 9 subagents to review code based on 9 best practices. Two of the subagents review React Native and UX/UI.

Yet, it still keeps making these mistakes:

  1. UI does not respect screen safe area - the navigation bar overlaps with the status bar.

  2. Layout misalignment issue.

LLM output right now is like a slot machine. Most spins land fine. But about 10% to 20%, it fails at the simplest layout.

The code compiles. The app runs. But visually, it is off.

For web apps, there is a test harness. You can now let Claude Code see your browser UI through extensions. It spots visual issues and self-corrects. This helps a lot.

But for mobile apps? I still haven’t come across a way to let Claude Code see the screen.

And that I think is the real bottleneck in AI coding now. I spent more than 50% of my time doing manual UI testing.

AI can write code that compiles. But right now, it can’t tell if the screen looks right.

Until an AI agent can see its own work the way a developer does, we probably still need a human UI tester.

Enjoyed this? Subscribe for more.

Practical insights on AI, growth, and independent learning. No spam.

More in AI Agents