-
Notifications
You must be signed in to change notification settings - Fork 10
Open
Description
First of all, thanks a lot for your useful, nice and interesting Java implementation of HyperMinHash.
My question is about this: https://github.com/LiveRamp/HyperMinHash-java/blob/master/src/main/java/com/liveramp/hyperminhash/BetaMinHashCombiner.java#L54
As I read arXiv:1710.08436
, while mergeability is trivial, I think it's not trivial that Jaccard Index estimation for multiple (> 2) sets works properly.
- Does the estimation still have same accuracy as of 2-set Jaccard Index ?
- If so, is there any proof ?
Sorry for obscure question. Thanks again.
Metadata
Metadata
Assignees
Labels
No labels