We are hiring for exceptional research engineers.
Our worldview
Almost no one has priced in what is about to happen. Even startups that say that they believe in transformative AI don’t act that way. As we get closer to a world where the cost of writing software goes to zero, it becomes more important than ever for hackers to be mindful of what they work on.
AI safety is the most impactful thing one could work with at this point in time. If the development of AI goes right, most of humanity’s biggest problems (diseases, poverty, energy, etc.) will be solved. But there are many potential missteps with consequences ranging from “missed opportunity” to “doom”. In our view, AI safety is not opposed to AI progress, it is the key to it.
Our work
The world is not ready for AGI. The public does not know what’s about to happen, and the developers don’t know how their own AI creations work:
"We do not understand how our own AI creations work. […] this lack of understanding is essentially unprecedented in the history of technology."
- Dario Amodei, April 2025
Andon Labs’ mission is to prepare the world for AGI. Concretely, this means we're building:
- Evals: Scientific experiments to evaluate frontier AI capabilities.
- Integrations: AI will soon be everywhere in society, both in our digital and physical world. We put AI in the real world as a canary in the coal mine.
The immense potential of AI means the company that enables its full release in a safe way will ba among the largest in the world. That company will be us.
Your role
We’re looking for research scientists, research engineers, and software engineers to build Evals and Integrations that rigorously probe frontier AI systems for dangerous capabilities, develop principled benchmarks, and rapidly iterate on elicitation techniques. You’ll design and run end-to-end experiments and translate insights into scalable evaluation tools. You will often work closely with our clients at the leading AI labs.
Through this work, you will have a front row seat to the frontiers of AI.
Example work
Fast-moving startups struggle to specify the exact responsibilities of roles because they change all the time. We’re sorry about the vagueness of the above description, but here is something you would have worked on if you joined in February: You would have co-authored our paper Vending-Bench, which now has almost a million views on X. This would have involved experiment design, implementation of the AI agent in inspect-ai, result analytics, and writing the paper. If you joined in April, you would have worked on something 10x bigger ;) (soon to be public)
We’re based in San Francisco (in-person only), offering competitive salary and stock compensation, and most importantly, a mission critical to ensuring humanity’s prosperous future.