It was GPT-3 I think. As far as I remember, it's post-training that kills chess ...

Imustaskforhelp · 2025-11-03T03:15:55 1762139755

This is so interesting, I am curious as to why, can you (or anyone) please provide any resources or insightful comments about it, they would really help a ton out here, thanks!

pixelmelt · 2025-11-03T06:57:46 1762153066

Gpt3 was trained on completion data so it likely saw lots of raw chess games layed out in whatever standard format moves are listed in, while 3.5 was post trained on instruct data (talking back and forth) which would have needed to explicitly include those chess games as conversational training data for it to retain as much as it would otherwise