Unlock the potential of ChatGPT with this informative video on the key feedback mechanisms for improving its responses. Understand the difference between in-session feedback and the built-in rating system, and learn how to effectively use them to enhance your experience. Enhance your knowledge and improve your results with ChatGPT. Subscribe now to stay updated.
Can’t see anything? Watch it on YouTube here.
Listen to the audio here:
- Got a question for You Ask, I'll Answer? Submit it here!
- Subscribe to my weekly newsletter for more useful marketing tips.
- Subscribe to Inbox Insights, the Trust Insights newsletter for weekly fresh takes and data.
- Find older episodes of You Ask, I Answer on my YouTube channel.
- Need help with your company's data and analytics? Let me know!
- Join my free Slack group for marketers interested in analytics!
What follows is an AI-generated transcript. The transcript may contain errors and is not a substitute for watching the video.
Christopher Penn 0:00
In this episode Carol asks, regarding chat GPT if I provide positive feedback after an answer, will the influence chat GPT-2 Next replies? Yes, but it depends on the kind of positive feedback we’re talking about.
There’s two essential feedback mechanisms to chat GPT.
One is you’ll notice next to each prompt, there’s a thumbs up thumbs down.
That is the training data that we are asked to provide as users of the system as beta users to say this response was good or this response was not good.
Doing that provides training data to OpenAI to essentially take those prompt response pairs, the ones that got thumbs up, when it basically was back into the training model and says, do more of this stuff.
And it wasn’t got thumbs down, it goes into the back of the training while saying do less of this stuff.
And so that feedback, it’s called reinforcement learning, helps AIS get smarter, essentially, get get more clever at what they do, by avoiding things that are that are not appropriate.
That’s one of the reasons why fine tuning, which is a process where you retrain the AI a little bit or give some things additional weights is so important.
That’s one aspect.
The second aspect is if you’re talking about sort of just text interaction, that doesn’t amend the training dataset, not overtly, but what it does do is that it provides guidance for the model within that session to do less or more of something.
And that data may may be used for reinforcement learning as well, if it’s clear enough that the feedback is about that prompt.
But the the mechanism that for sure, we know impacts the reinforcement learning is the thumbs up thumbs down thing.
When you’re working within a session within OpenAI within a specific conversation, providing positive feedback or corrective feedback will help more than anything, refine the results that you get, right.
If you say to him, hey, good answer.
It may say thank you and may do all these things and then say, Do you want to continue to want to do something more that that’s going to be sort of in session textual feedback, but it doesn’t change the model as much as the thumbs up thumbs down ratings.
So if you want to influence chat GPT to overall provide a better experience use that the built in rating system if you want to see how it interacts with you within that session and the feedback that it gives you and the way the prompts and the outputs change.
You can use in conversation feedback as well, but there’s been no indication that OpenAI overtly uses that training data as part of its reinforcement learning mechanisms.
They may they may, we just don’t know that’s not disclosed in the documentation.
Thanks for asking.
If you’d like this video, go ahead and hit that subscribe button.
You might also enjoy:
- B2B Email Marketers: Stop Blocking Personal Emails
- It's Okay to Not Be Okay Right Now
- Transformer les personnes, les processus et la technologie - Christopher S. Penn - Conférencier principal sur la science des données marketing
- How I Think About NFTs
- How To Start Your Public Speaking Career
Want to read more like this from Christopher Penn? Get updates here:
Get your copy of AI For Marketers