Pro@programming.devM to

AI - Artificial intelligence@programming.devEnglish · 14 days ago

Forcing LLMs to be evil during training can make them nicer in the long run

www.anthropic.com

0

cross-posted to:
hackernews@lemmy.bestiver.se

1

Forcing LLMs to be evil during training can make them nicer in the long run

www.anthropic.com

Pro@programming.devM to

AI - Artificial intelligence@programming.devEnglish · 14 days ago

0

cross-posted to:
hackernews@lemmy.bestiver.se

Persona vectors: Monitoring and controlling character traits in language models

www.anthropic.com

A paper from Anthropic describing persona vectors and their applications to monitoring and controlling model behavior

You must log in or register to comment.

Chat