This work was done during one weekend by research workshop participants and does not represent the work of Apart Research.
AI Testing
Accepted at the 
AI Testing
 research sprint on 
December 19, 2022

LLM benchmarking through specifically-aligned feedback

Michał Okoń, Jakub Tokarz, Filip Błaszczyk, Filip Płonka
4th place
3rd place
2nd place
1st place
 by peer review