This work was done during one weekend by research workshop participants and does not represent the work of Apart Research.
ApartSprints
AI capabilities and risks demo-jam: Creating visceral interactive demonstrations
66a7c53acd7d1c97a3b3dad0
AI capabilities and risks demo-jam: Creating visceral interactive demonstrations
August 26, 2024
Accepted at the 
66a7c53acd7d1c97a3b3dad0
 research sprint on 

Web App for Interacting with Refusal-Ablated Language Model Agents

While many people and policymakers have had contact with language models, they often have outdated assumptions. A significant fraction is not aware of agentic capabilities. Furthermore, most models that are available online have various safety guardrails. We want to demonstrate refusal-ablated agents to people to make them aware of various misuse potentials. Giving people a sense of agentic AI and perhaps having the AI operate against themselves could provide a better intuition about agency in AI systems. We present a simple web app that allows users to instruct and experiment with an unrestricted agent.

By 
Simon Lermen
🏆 
4th place
3rd place
2nd place
1st place
 by peer review
Thank you! Your submission is under review.
Oops! Something went wrong while submitting the form.

This project is private