You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was not able to find the minhash sketch size but only filter fraction, what does this mean with respect to sketch size? It was very clear in all other minhash implementations such as Mash, BinDash et.al. and the sketch size is the key parameter determining accuracy and speed.
Thanks,
Jianshu
The text was updated successfully, but these errors were encountered:
Instead of using fixed-sized sketches, Kmer-db selects given fraction of k-mers as this allows more accurate distance estimation for genomes with different sizes. But if you now more or less the size of your genomes, the math is easy: sketch_size = genome_size * fraction. In the paper you can find the error comparison of Kmer-db fractions and Mash sketches on bacterial genomes.
The thing is that Kmer-db is significantly faster than Mash and in many cases you won't need sketching at all (this is the default mode, actually).
Hi kmer-db team,
I was not able to find the minhash sketch size but only filter fraction, what does this mean with respect to sketch size? It was very clear in all other minhash implementations such as Mash, BinDash et.al. and the sketch size is the key parameter determining accuracy and speed.
Thanks,
Jianshu
The text was updated successfully, but these errors were encountered: