Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You will need dictionaries with millions of tokens, which will make models much larger. Also, any word that has too low frequency to appear in the dictionary is now completely unknown to your model.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: