Forgot your password?
Universal and Transferable Adversarial Attacks on Aligned Language Models (2023, Arxiv)
by Andy Zou, Zifan Wang, J. Zico Kolter, and 1 other
Successfully posted status
Error posting status