AnthropicAnthropic

Research Scientist, Interpretability

Research Scientist to develop methods for understanding LLMs by reverse engineering algorithms learned in their weights. Design and run experiments, create and analyze features, build infrastructure, and communicate results.

Apply now