Hacker Newsnew | past | comments | ask | show | jobs | submit | throw7381's commentslogin

For data extraction from long documents (100k+ tokens) how does structured outputs via providing a json schema compare vs asking one question per field (in natural language)?

Also I've been hearing good things regarding document retrieval about Gemini 1.5 Pro, 2.0 Flash and gemini-exp-1206 (the new 2.0 Pro?), which is the best Gemini model for data extraction from 100k tokens?

How do they compare against Claude Sonnet 3.5 or the OpenAI models, has anyone done any real world tests?


Anyone has done any benchmarks for RAG yet?


Does anyone know the cost of the commercial licence?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: