Finn — Enterprise Multi-Agent Platform
Problem. Knowledge workers lost hours each week stitching together data across a dozen internal systems and chasing routine approvals. Existing chatbots could answer questions but couldn't take action, so adoption stalled after the novelty wore off.
- Framed the product around 'jobs that finish themselves' — agents that complete a task end-to-end, not just retrieve an answer.
- Designed an orchestrator + specialist-agent architecture with tool-use, human-in-the-loop approval gates, and per-action audit logging.
- Shipped a thin, observable v1 (3 high-frequency workflows) and instrumented every step to learn where agents failed or needed a human.
- Built an evaluation harness so each new agent was graded on task success, not vibes, before release.
I owned product strategy, the agent UX, and the evaluation framework, partnering with an ML platform team of ~6 engineers.
Impact