Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Show HN: Byte-Pair Encoding tokenizer for training LLMs on large datasets (github.com/jmaczan)
5 points by yu3zhou4 on Oct 11, 2024 | hide | past | favorite


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: