Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

State of the art implementation, using random projection for reasonably accurate yet hundreds to thousands of times faster: http://radimrehurek.com/gensim/

Random Projection is something you should be aware of if you do any kind of large dimensional modeling. It is magic.



Also, random indexing, which is in its theoretic essence the same thing: http://www.sics.se/~mange/random_indexing.html




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: