Name		Name	Last commit message	Last commit date
parent directory ..
UMAP_analysis		UMAP_analysis
class_balanced_well_log_reg_aggregate_probabilities		class_balanced_well_log_reg_aggregate_probabilities
class_balanced_well_log_reg_comparison_results		class_balanced_well_log_reg_comparison_results
nbconverted		nbconverted
results		results
utils		utils
README.md		README.md
analyze_data.sh		analyze_data.sh
compare_raw_mitocheckjump.ipynb		compare_raw_mitocheckjump.ipynb
log_reg_class_balanced_areashape_aggregate_well_predicted_probabilities.ipynb		log_reg_class_balanced_areashape_aggregate_well_predicted_probabilities.ipynb
log_reg_class_balanced_areashape_analyze_well_predicted_probabilities.ipynb		log_reg_class_balanced_areashape_analyze_well_predicted_probabilities.ipynb

README.md

Analyze Predicted Probabilities

In this module, we perform multiple analyses on the predicted probability data to validate the phenotypic predictions for each treatment (e.g., compound, CRISPR, or ORF). To compare treatments and the negative control groups, we perform KS tests.

Analyze Well Probabilities

We compare the phenotype probabilities between each treated well and the remaining negative control wells on the corresponding plate. Each treatment well and corresponding negative control well phenotype probabilities are only compared if the number of cells in these groups is above a given cell count threshold. The group, treatment cells or control cells, are then randomly down-sampled depending on which of these groups has a larger population of cells. Random sampling of the control cells is accomplished through stratification of cells by the plate's wells. After sampling the cell population, the cells from the treated and control groups are compared using the KS test statistic.

We have found that the predicted probabilities generated from non-shuffled and shuffled weighted logistic regression models seem to perform the best from validation. These models were trained exclusively from mitocheck cellprofiler areashape morphology features.

Run the analysis notebooks

To perform the analyses, run the analyze_data.sh file which will convert the notebook into a python file and run it from terminal.

# Make sure you are in the 3.analyze_data directory
cd 3.analyze_data
# Run the notebook as a python script
source analyze_data.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

3.analyze_data

3.analyze_data

README.md

Analyze Predicted Probabilities

Analyze Well Probabilities

Run the analysis notebooks

Files

3.analyze_data

Directory actions

More options

Directory actions

More options

Latest commit

History

3.analyze_data

Folders and files

parent directory

README.md

Analyze Predicted Probabilities

Analyze Well Probabilities

Run the analysis notebooks