This work was done during one weekend by research workshop participants and does not represent the work of Apart Research.
Accepted at the 
 research sprint on 
July 17, 2023

DPO vs PPO comparative analysis

We perform a comparative analysis of the DPO and PPO algorithms where we use techniques from interpretability to attempt to understand the difference between the two

Rauno Arike, Luke Marks, Amir Abdullah, Luna Mendez
