To be fair, it also seems like it performs worse. question: "Who is Scott Alexan...

adrianmonk · on March 2, 2023

Is that necessarily worse performance, though?

One of the main pitfalls/criticisms of ChatGPT has been that it confidently plows forward and gives an answer regardless of whether it's right or wrong.

Here, it seems like it's being more circumspect, which could be a step in the right direction. At least that's one possible explanation for not answering.

On Wikipedia, if I type "Scott Alexander" and hit enter, it takes me directly to the page for a baseball player. So it's not clear that the blogger is the right answer.

I do think there's a better response than either of these, though. It could list the most famous Scott Alexanders and briefly say what each is known for, then ask if you mean one of those.

Tenoke · on March 2, 2023

With enough tries it gives wrong ones to the exact same question, too so I don't see an improvement in that direction.

nicky0 · on March 2, 2023

Perhaps a transitory issue. I just tried it with the API, `gpt-3.5-turbo`. I got:

> Scott Alexander is the pen name of American psychiatrist and blogger, Scott Alexander Siskind. He is known for writing his blog, "Slate Star Codex", which covers a wide range of topics including science, medicine, politics, and culture. He has been praised for his clear and concise writing style and thoughtful analysis of various issues. In addition to his work as a blogger, Scott Alexander has also published a book titled "Unsong", which is a fantasy novel set in an alternate universe where the Bible is a magical text.

CapsAdmin · on March 2, 2023

Can we really draw any conclusions on LLMs based on 1 sample? Maybe you've tried multiple times and with different semi famous people, but in general I see people comparing ML models in this fashion.

Tenoke · on March 2, 2023

Not really, I did try it with multiple attempts with multiple people and chatgpt had more issues. I just shared only one of them. If someone tests in a more systematic fashion that'd be great.

matteocontrini · on March 2, 2023

Did you add the default ChatGPT system prompt at the beginning, when using the API?

Tenoke · on March 2, 2023

I'm doing it via the openai library in the way they have in its docs.

>completion = openai.ChatCompletion.create(model="gpt-3.5-turbo", messages=[{"role": "user", "content": "Who is Scott Alexander?"}])

matteocontrini · on March 2, 2023

Adding ChatGPT's initial prompt as a message with `system` role may make a difference (didn't try): https://platform.openai.com/docs/guides/chat/instructing-cha...

Also, we don't know ChatGPT's parameters (temperature, etc.).