This project was submitted by Galen Pogoncheff. It was a runner-up for the ‘Interpretability’ prize in our AI Alignment course (Mar 2024). Participants worked on these projects for 4 weeks.
Towards Behavioral-Alignment via…
This project was submitted by Galen Pogoncheff. It was a runner-up for the ‘Interpretability’ prize in our AI Alignment course (Mar 2024). Participants worked on these projects for 4 weeks.