Evals Software Engineer

Applications deadline: The final date for submissions is 15 July 2024. However, we review applications on a rolling basis and encourage early submissions.

About Apollo Research

The capabilities of current AI systems are evolving at a rapid pace. This provides us with many great opportunities but also brings challenges, including deliberate misuse or the deployment of sophisticated yet misaligned models. At Apollo Research, we are especially concerned with deceptive alignment, i.e. where a model appears aligned but is, in fact, misaligned and evades human oversight. 

Our approach involves conducting fundamental research on interpretability and behavioral model evaluations, which we then use to audit real-world models. Ultimately, our goal is to leverage interpretability tools for model evaluations, as we believe that examining model internals in combination with behavioral evaluations offers stronger safety assurances compared to behavioral evaluatio Apply now and work remotely at Apollo Research

Posted on
Jul 13, 2024
Applicants
0
Category
Evals Software Engineer
Type
TELECOMMUTE
Salary
60000 - 110000
Location
United Kingdom
AR
Apollo Research
Location
New York, NY, USA
Job posted
1 Jobs

Share this job

Other listings

Web3 Accountant

Korporatio

1 year ago
45000 - 90000
Singapore
Full-time
45000 - 90000
Singapore
Full-time
View More
Expression of Interest - Machine Learning Engineer

Marlee (Fingerprint For Success)

1 year ago
350000 - 70000
Singapore
Full-time
350000 - 70000
Singapore
Full-time
View More
Member of Technical Staff (Product Frontend)

Reka AI

1 year ago
40000 - 80000
Singapore
Full-time
40000 - 80000
Singapore
Full-time
View More

© 2026 remoteworks. All rights reserved.