Lenguador@kbin.social to Machine Learning@kbin.social · 2 years agoUniversal and Transferable Attacks on Aligned Language Modelsllm-attacks.orgexternal-linkmessage-square2fedilinkarrow-up19arrow-down10cross-posted to: [email protected][email protected]
arrow-up19arrow-down1external-linkUniversal and Transferable Attacks on Aligned Language Modelsllm-attacks.orgLenguador@kbin.social to Machine Learning@kbin.social · 2 years agomessage-square2fedilinkcross-posted to: [email protected][email protected]
minus-squareWats0nslinkfedilinkarrow-up1·2 years agoThat seems highly interesting, although the consequences are not clear to me
That seems highly interesting, although the consequences are not clear to me