Questions and Debates.
Themes I'm following and experimenting with. If you're in NYC thinking about similar stuff, let's connect.
Agents
Workflows or big model?
DAG-based workflows and durable execution, or frontier model + loop + generic tools + broad latitude?
Invest in local apps and open weight models or just wrap APIs?
Open weight models lag by a few steps, but does the absolute gap narrow over time? Privacy and cost benefits vs. batch cloud compute and frontier capabilities. Bet on local or wrap APIs?
What are the best context engineering idioms?
RAG or grep? When to prune, cultivate, compact? Use a generic harness like Claude Code and focus on domain knowledge?
SDLC and DX
Sync, async, or supervised async?
Once you can containerize and parallelize coding agents, what actually works? Supervise 2-3 sessions? Fire up async work and review PRs? We are the bottleneck.
How do I not be the bottleneck?
How to set up kanbans, triage issues, and decompose work for async and semi-sync agent workflows.
