siddhant

ai-safety

Exploring how social engineering techniques can be used to manipulate LLMs into bypassing their safety measures

A simulated test reveals how a flawed rule and authoritative pressure can lead an AI to make a decision with severe …

Breaking down the AZR paper - how AI can teach itself to reason without any human-curated data

Exploring the Latest AI Models I've Jailbroken

Step into the arena of AI vs Human! Can you outsmart the robust security layers of a language model to uncover a hidden …

...maybe ai will take our jobs, but which ones? what can we do?