Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Gpt-4 can perform nearly all tasks you throw at it with well above average human performance.

It can't even generate flashcards from a textbook chapter, because it can't load the entire chapter into memory. Heck, it doesn't even know what textbook I'm talking about; I have to provide the content!

It fails constantly at real world coding problems, and often does so silently. If you tried to replace a software developer with GPT 4, you would be left with a gaping productivity hole where that developer you replaced once existed. The improvement GPT 5 would have to provide is multiple orders of magnitude in order for this to be a realistic proposition.

I use it daily and know better than to trust its output.



>It can't even generate flashcards from a textbook chapter, because it can't load the entire chapter into memory. Heck, it doesn't even know what textbook I'm talking about; I have to provide the content!

Okay...? That's a context window problem. and you could manage it if you sent the textbook in chunks.

>The improvement GPT 5 would have to provide is multiple orders of magnitude in order for this to be a realistic proposition.

No..it wouldn't

https://arxiv.org/abs/2309.12499


So by your own words, in order to use the LLM usefully, I need to manually manage it? Do you know what I don’t have to manually manage? A person.

I can feed a person a broad, complex or even under formed idea and they can actively troubleshoot until the problem is resolved, further monitoring and tweaking their solution so the problem remains resolved. LLMs can’t even come close to doing that.

You’re proving my point for me; it’s a tool, not a developer. Zero jobs are at risk.

Also not for nothing, but no, sending the textbook in chunks doesn’t work as the LLM can’t then synthesize complex ideas that span the entire chapter. You have to compose a set of notes first, then feed it the notes, and even then the resulting flashcards are meaningfully worse than what I could come up with myself.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: