Demonstration of AI Safety via Market Making

Jun 19, 2024

This project was submitted by Cameron Holmes. It won the ‘Scalable Oversight’ prize in our AI Alignment course (Mar 2024). Participants worked on these projects for 4 weeks.

Abstract

This notebook provides a practical (toy) implementation that demonstrates the AI Safety via Market Making proposal (AISvMM)

This project is still relatively nascent with some gaps, most notably the RL & backprop steps are not yet implemented and lots of future work is identified.

While this work falls short of evaluating the effectiveness of the proposal it partially demonstrates the viability of the AISvMM proposal by showing that current open weights LLMs are capable of acting as agents in the framework and provides a clear path to further work in the area.

Read the full piece here.

BlueDot Impact

Discussion about this post

Ready for more?