Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for variable length reads and variable insert sizes #88

Open
jakebiesinger opened this issue Dec 6, 2013 · 3 comments
Open

Comments

@jakebiesinger
Copy link
Contributor

For the scaffolding steps, we need to know how long each read is and its "outer distance", that is, the total distance between the two outside tips of the fragment. We currently don't have any mechanism for recording this.

Reads belong to different "libraries", all of whose reads will have the same length and that will have the same insert size. We currently don't have any way of indicating the library of a given read.

We need to record the library-id in each read and then have lookup tables that map library-id to read length and library-id to outer-distance.

@JavierJia
Copy link
Collaborator

Since we store the whole read sequence inside node, can we just get the read length by .length ?

@jakebiesinger
Copy link
Contributor Author

yep but that means we'd have to iterate over the whole collection when doing queries. But maybe that wouldn't be such a bad approach if you're doing a batch process.

@jakebiesinger
Copy link
Contributor Author

Plus, that doesn't help with the insert size problem, where the user specifies the value and it can be different depending on the library.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants