I interpreted it to suggest that the product might include a user-facing “maximu... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		alwa on Sept 12, 2024 \| parent \| context \| favorite \| on: Learning to Reason with LLMs I interpreted it to suggest that the product might include a user-facing “maximum test time” knob. Generating problem sets for kids? You might only need or want a basic level of introspection, even though you like the flavor of this model’s personality over that of its predecessors. Problem worth thinking long, hard, and expensively about? Turn that knob up to 11, and you’ll get a better-quality answer with no human-in-the-loop coaching or trial-and-error involved. You’ll just get your answer in timeframes closer to human ones, consuming more (metered) tokens along the way.

mrdmnd on Sept 12, 2024 [–]

Yeah, I think this is the goal - remember; there are some problems that only need to be solved correctly once! Imagine something like a millennium problem - you'd be willing to wait a pretty long time for a proof of the RH!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact