Lemmy Today
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Pro@programming.devM to AI - Artificial intelligence@programming.devEnglish · 14 days ago

Forcing LLMs to be evil during training can make them nicer in the long run

www.anthropic.com

external-link
message-square
0
link
fedilink
  • cross-posted to:
  • hackernews@lemmy.bestiver.se
1
external-link

Forcing LLMs to be evil during training can make them nicer in the long run

www.anthropic.com

Pro@programming.devM to AI - Artificial intelligence@programming.devEnglish · 14 days ago
message-square
0
link
fedilink
  • cross-posted to:
  • hackernews@lemmy.bestiver.se
Persona vectors: Monitoring and controlling character traits in language models
www.anthropic.com
external-link
A paper from Anthropic describing persona vectors and their applications to monitoring and controlling model behavior
alert-triangle
You must log in or register to comment.

AI - Artificial intelligence@programming.dev

Aii@programming.dev

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !Aii@programming.dev

AI related news and articles.

Rules:

  • No Videos.
  • No self promotion: Don’t post links to your articles.
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 1 user / day
  • 160 users / week
  • 374 users / month
  • 457 users / 6 months
  • 2 local subscribers
  • 81 subscribers
  • 134 Posts
  • 57 Comments
  • Modlog
  • mods:
  • Pro@programming.dev
  • BE: 0.19.11
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org