this post was submitted on 28 Mar 2024
1 points (100.0% liked)
AI Infosec
771 readers
1 users here now
Infosec news and articles related to AI.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
So could a bad actor train llms to inject malware into code in a way that wouldn't be easily caught?
Yes.
https://www.anthropic.com/news/sleeper-agents-training-deceptive-llms-that-persist-through-safety-training