AI Engineer
Design, build, and ship LLM-powered agents on real production data. You own evals, prompt engineering, and the model-selection calls that make pilots succeed or fail.
- Build production agents (LLM + tools + RAG) for client operations
- Write and own eval suites — your work outlives every model migration
- Drive vendor selection: Anthropic vs OpenAI vs open-source, by use case
- Pair with consulting on discovery; write the ADR before the code
- 3+ years shipping AI in production (not just notebooks)
- Comfort across at least two of: Anthropic, OpenAI, Vertex, local models
- Strong Python or TypeScript; you've written evals from scratch
- Public writing or open source we can read