Quality Diversity through AI Feedback
Herbie Bradley1,2,3, Andrew Dai4, Jenny Zhang5,6, Jeff Clune5,6, Kenneth Stanley7, Joel Lehman1,2 1CarperAI, 2Stability AI, 3University of Cambridge, 4Aleph Alpha, 5University of British Columbia,...
Diff Models – A New Way to Edit Code
CarperAI is releasing a series of diff models—models trained to predict a code diff, trained on millions of commits scraped from GitHub. We are releasing 3 models of different sizes, all fine-tuned...
CHEESE Release
We at CarperAI are happy to announce a new release today: CHEESE, a Co-adaptive Harness for Effective Evaluation, Steering, and Enhancement. We hope it will solve human feedback data collection with...