OpenAI: Learning to Reason with LLMs

openai.com

OpenAI: Learning to Reason with LLMs

openai.com

howrar@lemmy.caM to Reinforcement Learning@lemmy.caEnglish · 8 months ago

Just a moment...

openai.com

OpenAI just put out a blog post about a new model trained via RL (I’m assuming this isn’t the usual RLHF) to perform chain of thought reasoning before giving the user its answer. As usual, there’s very little detail about how this is accomplished so it’s hard for me to get excited about it, but the rest of you might find this interesting.

You must log in or register to comment.

Chat

Reinforcement Learning@lemmy.ca

reinforcement_learning@lemmy.ca

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !reinforcement_learning@lemmy.ca

A community dedicated to discussions on reinforcement learning, a subdiscipline of machine learning that tackles sequential decision making problems.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
1 user / week
1 user / month
18 users / 6 months
1 local subscriber
53 subscribers
8 Posts
0 Comments
Modlog

mods:
howrar@lemmy.ca