Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update analysis example: Switch WGCNA dataset to something that doesn't have technical replicates #373

Closed
cansavvy opened this issue Nov 23, 2020 · 4 comments

Comments

@cansavvy
Copy link
Contributor

Background

#364 (comment)
There's some technical replicates in SRP133573.

Problem

We could deal with replicates by collapsing them, but I think this example is already pretty long and complicated as it is (even though it is an advanced topics example). I think we can switch this out for a dataset that is less complicated and then deal with the collapsing replicates issue separately.

What potential "gotchas" do we know of?

The dataset should be sufficiently large (bigger than 15) but not so large someone couldn't run it locally.
For reference

What are the recommended next steps?

Step 0) After #363 and #364 are merged, this can be addressed. (Easier to take it one step at a time).
Step 1) Find a suitable dataset replacment
Step 2) Try running it in the notebook. If there's not an R^2 above 0.80 than probably no to that dataset.
Step 3) Change module explorations -- see how the plots look.
Step 4) If that dataset otherwise looks good, update all the wording and dataset descriptions.

@cansavvy cansavvy changed the title Switch WGCNA dataset to something that doesn't have technical replicates Update analysis example: Switch WGCNA dataset to something that doesn't have technical replicates Nov 23, 2020
@cansavvy
Copy link
Contributor Author

There's this dataset that has a two time point variable that seems reasonable to use for our differential expression step. It also has 62 samples which should be plenty: https://www.refine.bio/experiments/SRP140558/acute-viral-bronchiolitis-pbmc

It's a bit metadata poor otherwise, but that's going to be the case for a lot of the RNA-seq datasets.

@cansavvy
Copy link
Contributor Author

Another dataset with two time points: https://www.refine.bio/experiments/SRP132018/in-vitro-stimulation-of-healthy-donor-blood-with-il-3-cytokine

It has more metadata labels than that previous dataset but still has 56 samples.

@cansavvy
Copy link
Contributor Author

cansavvy commented Nov 24, 2020

Another dataset with two time points: https://www.refine.bio/experiments/SRP132018/in-vitro-stimulation-of-healthy-donor-blood-with-il-3-cytokine

It has more metadata labels than that previous dataset but still has 56 samples.

I looked at some more datasets, but this one seems like it should be fine so I'm going to give it a whirl.

Edit: It looks like its a 2x2 model, two time points and treatment/control. So nevermind. Will try https://www.refine.bio/experiments/SRP140558/acute-viral-bronchiolitis-pbmc now.

@cansavvy
Copy link
Contributor Author

This has been wrapped up by #379

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant