MinHashing
https://moultano.wordpress.com/2018/11/08/minhashing-3kbzhsxyg4467-6/ [moultano.wordpress.com]
2018-12-25 03:05
At any rate, you would like to make clusters out of your data, but you only get to look at each item once in isolation. After looking at it you have to decide what cluster it should go to, at that moment, without looking at any other information, or any other items in your dataset. You only get one shot, do not throw it away! How can we accomplish this?