This project won the “Education and community building” prize on our AI Alignment (June 2024) course.
AI Sandbagging: an Interactive Explanation
This project won the “Education and community building” prize on our AI Alignment (June 2024) course.