Featured
Recent Posts
- Published:
Agents as Bounty Hunters
I built a benchmark that pits coding agents against each other in a bug-finding treasure hunt.
- Published:
Hiring For Humans (podcast highlights)
What are we actually hiring for when AI can ace your interviews?
- Published:
Claude Don't Code
Non-coding use cases for Claude Code and what to make of it.
- Published:
This is Not Me, Dancing.
Google's video model made me dance.