-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question: a way to extract read-only k-mers #9
Comments
Hi Ben, Glad to hear that merqury is being useful! 1. Collect all k-mers in your assembly
2. Subtract them from the read set
3. Extract reads containing the read-only mers
Hope this helps! |
Awesome! thanks so much for the quick and thoroughly laid out answer! |
Forgot to add the |
Arang, I realize that this is out of the scope of Merqury issues, but I'm having some memory issues running the read extraction steps. Meryl-lookup was using something like 260G of RAM (out of 400+ available) but crashed with a cannot allocate memory error:
I'm using meryl from here: https://github.com/marbl/meryl Any advice? |
Arang, I've just upped the memory requested to 512G and seems to be running well. |
Hi Ben, Sorry for the delayed reply. You can set Arang |
Not a delayed response at all - Nothing to apologize for. |
I think that's a fair strategy. I'd have done the same thing. You could also limit the high-copy k-mers, with Arang |
Awesome thanks! |
Hello @kneubehl , it seems like your meryl is not updated, or used from the previous path. Could you double check this is the path where v1.3 was installed? |
Alex, this is the help message for meryl-lookup in v1.3:
How did you install meryl? |
Also please use this script for excluding reads having kmers from a given kmer db.
for paired-end (short) reads or
for single-end (long) reads. |
Hi again Dr. Rhie,
I'm having a great time using merqury to asses quality and completeness of different versions of our assembly. It's an awesome tool and it is allowing us to make more informed decisions about haplotig purging and other parameter tweaks. So thanks again!
As I was examining our spectra I noticed our assembly is missing some heterozygosity ie a black (read-only) peak at 1-copy.
I was wondering if there is a way to pull out those kmers specifically and then the reads they came from. My thought is that we could then assemble at least part of the missing heterozygosity using the short reads then add that to our haplotigs to get a more complete assembly.
Can you suggest a way to extract those kmers and their sequence for such a purpose?
Thanks in advance,
Ben
The text was updated successfully, but these errors were encountered: