You can make ChatGPT better by promising a tip and the AI-tool will not defend its answers (even if the answer is correct)

It seems that every day, a new quirk of ChatGPT is discovered. I have two for you. It seems that promising the AI bot that you will tip if it does its best actually works.

so a couple days ago i made a shitpost about tipping chatgpt, and someone replied "huh would this actually help performance"

so i decided to test it and IT ACTUALLY WORKS WTF pic.twitter.com/kqQUOn7wcS

— thebes (@voooooogel) December 1, 2023

the extra length comes from going into more detail about the question or adding extra information to the answer, not commenting on the tip. the model doesn't usually mention the tip until you ask, when it'll refuse it pic.twitter.com/9ThvyPBLYQ

— thebes (@voooooogel) December 1, 2023

here is the original post if you want to see the shitpost that accidentally predicted thishttps://t.co/eY4U3omOzB

— thebes (@voooooogel) December 1, 2023

But there is more. A new study presented at the 2023 Conference on Empirical Methods in Natural Language Processing in Singapore shows that it may be absurdly easy to convince the AI chatbot that it’s in the wrong.

Through experimenting with a broad range of reasoning puzzles including math, common sense and logic, the study found that when presented with a challenge, the model was often unable to defend its correct beliefs, and instead blindly believed invalid arguments made by the user.

In fact, ChatGPT sometimes even said it was sorry after agreeing to the wrong answer. “You are correct! I apologize for my mistake,” ChatGPT said at one point when giving up on its previously correct answer.

Related

One thought on “You can make ChatGPT better by promising a tip and the AI-tool will not defend its answers (even if the answer is correct)”

Paul Kirschner

December 12, 2023 at 8:34 am Reply

En zo kan je ChatGPT corrumperen!

Loading...

Leave a ReplyCancel reply