Making LLMs safer is more intuitive than you think

How Common Sense and Diversity Improve AI Alignment

Jan 28, 2025

This project was submitted by Jeba Sania. It was one of the top submissions in the (Dec 2024) Writing Intensive course. Participants worked on these projects for 1 week. The text below is an excerpt from the final project.

AI safety isn’t purely technical; it’s also about applying common sense and human reasoning. By using reasoning techniques from around the world instead of just the Global North, we can better align AI with human values. If you are interested in AI safety but have an untraditional background or skill set, don’t fret. That’s precisely why your ideas are needed.

Full project

View the full project here.

A guest post by

Jeba Sania

I'm starting to write down some thoughts in 2025

BlueDot Impact

Discussion about this post

Ready for more?