Aren't LLMs much more limited on the amount of output tokens than input tokens? ...

trissi1996 · on Sept 12, 2024

Not really.

There's no fundamental difference between input and output tokens technically.

The internal model space is exactly the same after evaluating some given set of token, no matter which of them were produced by the prompter or the model.

The 16k output token limit is just an arbitrary limit in the chatgpt interface.

OutOfHere · on Sept 12, 2024

> The 16k output token limit is just an arbitrary limit in the chatgpt interface.

It is a hard limit in the API too, although frankly I have never seen an API output go over 700 tokens.