Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Doesn't know JSON and has many OCR errors on documents.


Have you tested PaliGemma's OCR abilities? The article says it does well:

"In average accuracy, we saw 85.84%, beating all other OCR models except for Anthropic’s Claude 3 Opus."


It’s very good. And the cool thing is it’s made for fine tuning also. Excited to see how fine-tuned OCR models do.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: