What's with this how many r's in a strawberry thing I keep seeing?

andrewla · on Sept 12, 2024

What's amazing is that given how LLMs receive input data (as tokenized streams, as other commenters have pointed out) it's remarkable that it can ever answer this question correctly.

swalsh · on Sept 12, 2024

Models don't really predict the next word, they predict the next token. Strawberry is made up of multiple tokens, and the model doesn't truely understand the characters in it... so it tends to struggle.

dr_quacksworth · on Sept 12, 2024

LLM are bad at answering that question because inputs are tokenized.

runjake · on Sept 12, 2024

This became something of a meme.

https://community.openai.com/t/incorrect-count-of-r-characte...

bn-l · on Sept 12, 2024

It’s a common LLM riddle. Apparently many fail to give the right answer.

seydor · on Sept 12, 2024

Somebody please ask o1 to solve it

lloydatkinson · on Sept 12, 2024

The link shows it solving it