-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can we use MELD on single replicate #56
Comments
Hi Rohit! You can run MELD on each pair of samples independently (i.e. t1_ctr vs t1_ko; t2_ctr vs tr_ko), then compare the magnitude of the MELD likelihoods if they were calculated on the same graph. To know if you can build a single graph, you can run PHATE on the datasets and then examine the amount of overlap between the pairs of samples. If they are totally overlapping, then you're good to go with one graph. Otherwise, I would run MELD start to finish separately on each pair of samples. adata here is an Annotated Data Matrix (AnnData). https://anndata.readthedocs.io/en/latest/ Please let me know if this is helpful or if you still have questions |
Below is my PHATE scatter plot using default parameters when I ran PHATE on all 4 Samples together as a single matrix. Is there a measure of overlap that we can use to determine if we can build a single graph or does it rely on visual inspection of the PHATE plot? Besides, for AnnData ( Edit 1: It must be mentioned in the documentation that |
This looks pretty clearly like there's minimal overlap between the T1 vs T2 datasets so I would analyze them separately |
Hi Rohit, I am a co-author on MELD. I want to chime in on your question about quantifying the batch effect. However, there are ways to quantify this without visual inspection. A very simple metric will be a multinomial test. First, treat the probability of a sample being in batch t1 or t2 as a multinomial with a single count. For example, let's say you have 400 t1 and 600 t2 cells. Then, for a given cell, if the batches are distributed uniformly, you have the probabilities in the null hypothesis that The precise test you need is this https://en.wikipedia.org/wiki/Multinomial_test. You can use Pearson's chi-squared test for this. I think that this or a very similar approach was demonstrated for biological purposes in "A test metric for assessing single-cell RNA-seq batch correction" by Buttner et. al (2019) in Nature Methods. https://www.nature.com/articles/s41592-018-0254-1 Jay |
@dburkhardt I split up my assay matrix separately for time points T1 and T2 and I was planning to run a benchmark for parameter search separately on them. However, I got an error Edit 1: Okay, I got it. It's |
I don't understand why Likelihood estimates for both Control and Knockout are all 1. Is it because of a lack of replicates? I will reiterate how I performed the analysis.
Is there another metric that I can use to quantify and say the cells at time T1 are more likely to get arrested because of knockout than at time T2?? |
If you are planning to organize a workshop on MELD, I would like to join. I have been watching the tutorials already present on youtube and I have many questions. How can I join the slack channel of old workshop where other MELD users can help me out? |
Hi Developers!!
We have single-cell data for two-time points each containing control and knockout sample (t1_ctr, t1_ko, t2_ctr,tr_ko). We wish to find out how if the knockout cells are arrested in lifecycle more at time T1 or at Time T2. Is this achievable because we have one replicate per time_period?
Also, in the tutorial and the following code chunk:
what is
adata.X
as it is giving an errorThe text was updated successfully, but these errors were encountered: