Start wilms analysis #681

maud-p · 2024-08-01T12:34:43Z

Purpose/implementation Section

In this module 1, I create 2 metadata tables to compile from the literature information on marker genes and known genetic alterations, that will be used later to validate annotations of the Wilms tumor dataset.

Please link to the GitHub issue that this pull request addresses.

#671
#635 (reply in thread)

What is the goal of this pull request?

Wilms tumor (WT) is the most common pediatric kidney cancer characterized by an exacerbated intra- and inter- tumor heterogeneity. The genetic landscape of WT is very diverse in each of the histological contingents. The COG classifies WT patients into two groups: the favorable histology and diffuse anaplasia. Each of these groups is composed of the blastemal, epithelial, and stromal populations of cancer cells in different proportions, as well as cells from the normal kidney, mostly kidney epithelial cells, endothelial cells, immune cells and normal stromal cells (fibroblast).

In this module, we reviewed the literature to compile a table of marker genes for each of the expected cell types in the dataset. Additionally, we provide a table of know genetic alterations in Wilms tumor that can be useful to validate CNV profiles obtained after running inferCNV function.

Briefly describe the general approach you took to achieve this goal.

The table CellType_metadata.csv contains the following column and information:

"gene_symbol" contains the symbol of the described gene, using the HUGO Gene Nomenclature
ENSEMBL_ID contains the stable identifier from the ENSEMBL database
cell_class is either "malignant" for marker genes specific to malignant population, or "non-malignant" for markers genes specific to non-malignant tissue or "both" for marker genes that can be found in malignant as well as non-malignant tissue but are still informative in respect to the cell type.
cell_type contains the list of the cell types that are attributed to the marker gene
DOI contains the list of main publication identifiers supporting the choice of the marker gene
comment can be empty or contains any additional information

The table GeneticAlterations_metadata.csv contains the following column and information:

alteration contains the number and portion of the affected chromosome
gain_loss contains the information regarding the gain or loss of the corresponding genetic alteration
cell_class is "malignant"
cell_type contains the list of the malignant cell types that are attributed to the marker gene, either blastemal, stromal, epithelial or NA if none of the three histology is more prone to the described genetic alteration
DOI contains the list of main publication identifiers supporting the choice of the genetic alteration
comment can be empty or contains any additional information

If known, do you anticipate filing additional pull requests to complete this analysis module?

This module will be used for later validation of the annotations and results from inferCNV.

What is the name of your results bucket on S3?

Results should be uploaded to your bucket so they are available during review.
See here for instructions on how to upload to your bucket:
https://openscpca.readthedocs.io/en/latest/software-platforms/aws/working-with-s3-buckets/

What types of results does your code produce (e.g., table, figure)?

2 tables

Provide directions for reviewers

This section had 2 aims:

learn how to build the github repository, perform issue, pull request
gather literature information into a metadata file for later use for validation of the annotations

What are the software and computational requirements needed to be able to run the code in this PR?

Are there particularly areas you'd like reviewers to have a close look at?

Is there anything that you want to discuss further?

Author checklists

Check all those that apply.
Note that you may find it easier to check off these items after the pull request is actually filed.

Analysis module and review

This analysis module uses the analysis template and has the expected directory structure.
[x ] The analysis module README.md has been updated to reflect code changes in this pull request.
The analytical code is documented and contains comments.
Any results and/or plots this code produces have been added to your S3 bucket for review.

Reproducibility checklist

Code in this pull request has been added to the GitHub Action workflow that runs this module.
The dependencies required to run the code in this pull request have been added to the analysis module Dockerfile.
If applicable, the dependencies required to run the code in this pull request have been added to the analysis module conda environment.yml file.
If applicable, R package dependencies required to run the code in this pull request have been added to the analysis module renv.lock file.

jaclyn-taroni · 2024-08-01T12:39:22Z

This looks great! I'm going to close #672 and start my review here.

jaclyn-taroni

Hi @maud-p - I have to head to some meetings, so I will return the feedback I have so far here. 😄

Some high-level thoughts:

I would move to use the data download script as part of your development process because I think it will make it easier for other people (e.g., reviewers) to run your code over the project's lifecycle.
Were you able to build this Docker image successfully locally? If so, what kind of machine are you on? I ran into a problem locally, and I think your answers might help us narrow down what's going on here.

Please let us know if you have any questions. Thank you!

analyses/cell-type-wilms-tumor-06/Dockerfile

analyses/cell-type-wilms-tumor-06/README.md

jaclyn-taroni · 2024-08-01T13:46:42Z

analyses/cell-type-wilms-tumor-06/README.md

+
+• Provide annotations of normal cells composing the kidney, including normal kidney epithelium, endothelium, stroma and immune cells
+• Provide annotations of tumor cell populations that may be present in the WT samples, including blastemal, epithelial, and stromal populations of cancer cells
+Based on the provided annotation, we would like to additionally provide a reference of marker genes for the three cancer cell populations, which is so far lacking for the WT community.


analyses/cell-type-wilms-tumor-06/README.md

jaclyn-taroni · 2024-08-01T14:02:06Z

analyses/cell-type-wilms-tumor-06/README.md

+Data have been downloaded locally and are found in mnt_data. the mnt_data folder has to be define in the config.yaml file or changed in the notebook accordingly. 
+
+```{r paths}
+path_to_data <- "~/mnt_data/Wilms ALSF/SCPCP000006_2024-06-25"
+```


I think using the download-data.py method here would require fewer manual steps for folks who want to run this module.

From this directory, one could run:

../../download-data.py --projects SCPCP000006

(This would use the defaults = download the _processed.rds SingleCellExperiment objects, which I believe are all you need but please do let me know if I'm wrong!)

And then your future notebooks could develop against the path (relative to the root of the repository) data/current/SCPCP000006. That way, even as new releases get cut, your code would continue to run without modification.

This requires AWS CLI setup to run as intended, so let us know if you have any questions!

AWS CLI setup and download into data worked well 👍

../../download-data.py --projects SCPCP000006

I'll change the READ.ME file, config.yaml and (future) scripts!

jaclyn-taroni · 2024-08-01T14:12:28Z

analyses/cell-type-wilms-tumor-06/run.sh

+# parse config parameters:
+source bash/parse_yaml.sh
+eval $(parse_yaml config.yaml CONF_)


I'm not sure if we expect everyone who might want to run this module to have access to bash/parse_yaml.sh?

... Start being a bit too high-level for me to be honnest...
All our template have been done by our expert Christoph Hafemeister, and here, I have to admit, I run it without understanding every line...

I'll ask him once he is back from vacation!

Would this replace the run.sh and config.yaml files in the end?
https://openscpca.readthedocs.io/en/latest/ensuring-repro/docker/docker-images/#r-based-images

jaclyn-taroni · 2024-08-01T14:15:56Z

analyses/cell-type-wilms-tumor-06/Dockerfile

+# pull base image
+FROM bioconductor/tidyverse:3.19


I had a problem building this locally, but it seems that the base image might be the issue. I will need to investigate a little more.

I changed for this:

# Base image on the Bioconductor 3.19 image FROM bioconductor/r-ver:3.19

and build locally like this

podman buildx build . -t openscpca/cell-type-wilms-tumor-06:latest --platform linux/amd64

and it works :)

I am building locally to make some tested recommendations regarding what to do with run.sh, but it is taking a while, so I'll post an update as soon as possible!

Thank you that would be great!

I am initiating a renv environment from the current R Session to then simplify the Dockerfile using the renv.lock (https://openscpca.readthedocs.io/en/latest/ensuring-repro/docker/docker-images/)

renv_init() is taking a while... I'll continue tomorrow :)

I had trouble accessing RStudio on the image I built locally. I'm attempting to isolate the problem... but that unfortunately requires rebuilding things 😅 ⏳

Thank you for being on it :) I have troubles trying to implement the renv() environment and to change for the bioconductor/r-ver:3.19 based image... So far I can only open RStudio using the following Dockerfile (just a bit "cleaned" compared to the PR).

# Base image on the Bioconductor 3.19 image FROM bioconductor/tidyverse:3.19 # Set global R options RUN echo "options(repos = 'https://cloud.r-project.org')" > $(R --no-echo --no-save -e "cat(Sys.getenv('R_HOME'))")/etc/Rprofile.site ENV RETICULATE_MINICONDA_ENABLED=FALSE RUN R --no-echo --no-restore --no-save -e "install.packages('remotes')" RUN R -e "devtools::install_github('enblacar/SCpubr')" RUN R -e "remotes::install_github('satijalab/seurat', 'seurat5', quiet = TRUE)" # this also install patchwork (and others) RUN R -e "remotes::install_github('satijalab/azimuth', quiet = TRUE)" # this also install SingleCellExperiment, DT (and others) RUN R -e "remotes::install_github('cancerbits/DElegate')" RUN R -e "install.packages('viridis')" RUN R -e "install.packages('ggplotify')" RUN R -e "BiocManager::install('edgeR')" # make sure all R related binaries are in PATH in case we want to call them directly ENV PATH ${R_HOME}/bin:$PATH

@jaclyn-taroni FYI, I removed from the run.sh et config file unnecessary volume and path to data that are specific to our group. I realized it prevent the execution of the docker image if not defined!

Just in case it can help.

Last comment of the week ;)

I added the renv.lock file from the RStudio session of docker image I am running on my machine.

I am now trying to use it to build the image as described here: (https://openscpca.readthedocs.io/en/latest/ensuring-repro/docker/docker-images/), but building of the image is full of ERROR because of BiocManager version not matching the bioconductor version.. (as described here rstudio/renv#517) Will take some time of fine tunning ⏳

But I think it's worth trying to have the Docker image in this format, then it might be easier to share/reproduce? What do you think @jaclyn-taroni ?

I agree that our end goal should be to use renv as part of the Docker build process, but I don't know if that's necessarily where we need to start. I think it's fine to start installing packages one-by-one in the Dockerfile, and then view renv as a feature enhancement!

I am specifically struggling with what base image should be used right now.

I expect folks working on this project (including all of our staff at the Data Lab) to be on Macs with Apple Silicon.

My understanding is that you want to be able to develop using the RStudio Server IDE from within the running container. This is often part of my workflow, and I expect many other project participants might have this use case! (That is to say, I think we are bumping into a problem that will come up again and again in the project...)

However, I'm not sure we can use bioconductor/tidyverse:3.19 as the base image and have it play nicely with multiple architectures (e.g., if someone using an ARM machine wants to build and use it locally). That's because I don't think rocker/tidyverse, which I understand to be the base of bioconductor/tidyverse, supports ARM: rocker-org/rocker-versioned2#830

So, we might instead want to use the bioconductor_docker as the base:

# pull base image FROM bioconductor/bioconductor_docker:3.19

And until we implement renv, I expect we can install the Tidyverse packages the same way suggested in that issue ☝🏻

# pull base image FROM bioconductor/bioconductor_docker:3.19 # Adds tidyverse packages & devtools RUN /rocker_scripts/install_tidyverse.sh

Then I believe we'd have an image we can build and push to Elastic Container Registry using GitHub Actions that can also be built and used locally on ARM machines. This compatibility seems like a good goal to me.

I would have liked to test this all conclusively, but installing Azimuth is taking a very long time for me 😅 Appreciate your patience, @maud-p!

maud-p · 2024-08-01T18:56:43Z

Hi @maud-p - I have to head to some meetings, so I will return the feedback I have so far here. 😄

Some high-level thoughts:

I would move to use the data download script as part of your development process because I think it will make it easier for other people (e.g., reviewers) to run your code over the project's lifecycle.

Were you able to build this Docker image successfully locally? If so, what kind of machine are you on? I ran into a problem locally, and I think your answers might help us narrow down what's going on here.

Please let us know if you have any questions. Thank you!

Thank you for your feedback @jaclyn-taroni ! I will try to answer.

I will try to move to the data download script, thanks for the advice below!
I build this Docker image on my machine. It is our group's compute server, when I log into it say:

"Welcome to Ubuntu 22.04.2 LTS (GNU/Linux 5.15.0-101-generic x86_64)"

System load: 0.79736328125 Processes: 1245
Usage of /: 44.1% of 196.30GB Users logged in: 0
Memory usage: 25%
Swap usage: 51%

I used podman to build the image
podman build -t cancerbits/dockr:maudp_ScPCAOpen_podman -f Dockerfile.Dockerfile .

I will try to work on the improvment you suggested below for the Dockerfile, it might also solve the issue.

Co-authored-by: Jaclyn Taroni <[email protected]>

…-test Build and test Docker image using `bioconductor/bioconductor_docker:3.19` and `renv`

jaclyn-taroni

@maud-p, this is very close! 🎉

I left a few comments on the README about the marker sets:

For some of the genes included, I might expect it to be helpful to examine altered expression patterns/programs rather than individual gene expression. You do not need to take any action besides replying with what you think about those ideas—these are just items for scientific discussion and for us to consider moving forward!
I had a question about why THY1 was included vs. other markers based on one publication.
I had a problem with one of the DOIs that’d be good to check out before merging.

Regarding how you need to run the Docker image:

I’d rename the run.sh to be more general and to accept the password as an argument.
I’d write a section in the README that explains how you use the script to run the container on your own system.

We now know that the Docker image can be built because of the workflow we added! ✅ I link to the successful run in one of my comments so you can check it out.

I also wanted to mention that your decision to close #680, as mentioned in maud-p#3 (comment), and re-open a new pull request makes sense to me!

jaclyn-taroni · 2024-08-04T13:46:12Z

analyses/cell-type-wilms-tumor-06/README.md

+
+  |gene_symbol|ENSEMBL_ID|cell_class|cell_type|DOI|comment|
+  |---|---|---|---|---|---|
+  |WT1|ENSG00000184937|malignant|cancer_cell|10.1242/dev.153163|Tumor_suppressor_WT1_is_lost_in_some_WT_cells|


I think of sc/snRNA-seq as better suited to picking up overexpression than loss. I know you've worked on these data before, so I'm just curious if you expect or have observed differences in WT1 expression in the cancer cells. Although, I see that you put:

WT1 (?)

in #635, so maybe we don't know yet!

I added the (?) for the reason you mentionned, as we are looking for loss of function, I am not sure that we can really use it for annotation.

However, about 20% of Wilms tumor would have imperment of WT1, and I would expect the WT1 mutated Wilms tumor to have a specific transcriptional program. At the final step of integration of the 40 samples together, I would expect a cluster negative for WT1. Also, the normal kidney should be WT1 positive. So in this last step I think looking at WT1 would make sense.

jaclyn-taroni · 2024-08-04T13:50:13Z

analyses/cell-type-wilms-tumor-06/README.md

+  |---|---|---|---|---|---|
+  |WT1|ENSG00000184937|malignant|cancer_cell|10.1242/dev.153163|Tumor_suppressor_WT1_is_lost_in_some_WT_cells|
+  |IGF2|ENSG00000167244|malignant|cancer_cell|10.1038/ng1293-408|NA|
+  |TP53|ENSG00000141510|malignant|anaplastic|10.1158/1078-0432.CCR-16-0985|Might_also_be_in_small_non_anaplastic_subset|


From the abstract of this publication, I wonder if looking at TP53 loss/activation at a pathway level would be interesting 🤔

This is a great idea, I will implement this in the next PR:

Differential expression analysis using DElegate (pseudobulk based) FindAllMarkers2 to find markers for each clusters

Enrichment analysis of the markers genes, I was thinking using the gene sets hallmarks and MSigdB C8, I found them the most informative. The Hallmarks of Cancer might help us defining specific cancer clusters (proliferation +++, DNA damage/repair +++, hallmark TP53 in anaplastic Wilms tumor). And the MSigdB C8 can be an additionnal level of information regarding cell types. Do you think to some other gene sets?

https://www.gsea-msigdb.org/gsea/msigdb/human/genesets.jsp?collection=H
https://www.gsea-msigdb.org/gsea/msigdb/human/genesets.jsp?collection=C8 : 50 gene sets kidney related

jaclyn-taroni · 2024-08-04T13:54:12Z

analyses/cell-type-wilms-tumor-06/README.md

+  |SIX1|ENSG00000126778|malignant|blastema|10.1016/j.ccell.2015.01.002|NA|
+  |SIX2|ENSG00000170577|malignant|blastema|10.1016/j.ccell.2015.01.002|NA|


Similar to my TP53 comment – from a quick look at this publication, I wonder if looking at the altered expression patterns rather than the individual genes could be helpful.

In the MSigdB C3/MIR gene sets, we have gene sets for DICER and DROSHA, I could also give a try to run enrichment for this dataset.

https://www.gsea-msigdb.org/gsea/msigdb/human/genesets.jsp?collection=MIR

jaclyn-taroni · 2024-08-04T14:00:54Z

analyses/cell-type-wilms-tumor-06/README.md

+  |NCAM1|ENSG00000149294|malignant|blastema|10.1016/j.stemcr.2014.05.013|might_also_be_expressed_in_non_malignant|
+  |PODXL|ENSG00000128567|non-malignant|podocyte|10.1016/j.stem.2019.06.009|NA|
+  |COL6A3|ENSG00000163359|malignant|mesenchymal|10.2147/OTT.S256654|might_also_be_expressed_in_non_malignant_stroma|
+  |THY1|ENSG00000154096|malignant|mesenchymal|10.1093/hmg/ddq042|might_also_be_expressed_in_non_malignant_stroma|


I'm curious why this gene differs from some of the ones outlined in the abstract and is included. I suppose I would not expect the ones outlined in the abstract to be specific to malignant cells necessarily.

unfortunately I don't know about one mesenchymal gene specific for mesenchymal Wilms tumor cells... For some colleagues who wanted to characterize CAF, the best approach I found was:

identify stromal cells and

based on the inferedCNV, distinguish normal from cancer stromal cells at the cluster level.

This is not perfect I think, but at least we should have clusters enriched in the target population.

Stromal cells are really easily identified based on either few markers, or label transfer from the fetal kidney reference. They often form one single cluster. This is the reason why I didn't spend too much time adding marker genes for them, but I can add few more mesenchymal markers and references for correctness :)

and THY1 is generally nicely expressed is I remember well!

I'd defer to you since you've spent more time thinking about this problem 😄

You don't need to add them – we'll have a record of this conversation!

jaclyn-taroni · 2024-08-04T14:02:26Z

analyses/cell-type-wilms-tumor-06/README.md

+|alteration|gain_loss|cell_class|cell_type|DOI|PMID|comment
+|---|---|---|---|---|---|---|
+|11p13|loss|malignant|NA|10.1242/dev.153163|NA|NA|
+|11p15|loss|malignant|NA|10.1128/mcb.9.4.1799|NA|NA|


I get a 404 at: https://doi.org/10.1128/mcb.9.4.1799 – perhaps a typo in the DOI?

oh sorry, complete doi is https://doi.org/10.1128/mcb.9.4.1799-1803.1989

should be corrected in the READ.ME and csv file :)

jaclyn-taroni · 2024-08-05T11:04:30Z

analyses/cell-type-wilms-tumor-06/run.sh

Can we rename this to run-podman-internal.sh, please? run.sh might imply that this is how you run all steps in the module, so I think this name is more descriptive.

Did you need to add this file with force, i.e., git add --force analyses/cell-type-wilms-tumor-06/run.sh? I would expect this file to be ignored, which is why I ask.

Generally, you want to avoid using --force. There are many things in the .gitignore in the root of the repository to prevent us from committing things we want to put in S3 instead, for example, and --force allows you to avoid the guardrails.

File renamed :)

Actually I think it do not need to be in this github folder. I might have been a bit too enthousiastic to have it run and quickly saved the few lines of codes in a run.sh file. But I could save it somewhere else for myself only.

Tbh I added quickly this file on the bowser interface, using "Add file". I didn't know about the shell --force , it might be default via the interface. I'll be more carefull now!

jaclyn-taroni · 2024-08-05T11:08:57Z

analyses/cell-type-wilms-tumor-06/run.sh

@@ -0,0 +1,21 @@
+# ids defined in image for the rstudio user, if not define as such, it is not possible to login to RStudio


I have not tested this (and probably could not satisfactorily without access to your system), so please test it on your end. I think it would be better to pass the password as an argument:

Suggested change

# ids defined in image for the rstudio user, if not define as such, it is not possible to login to RStudio

PASSWORD="$1"

# ids defined in image for the rstudio user, if not define as such, it is not possible to login to RStudio

I will add what I think needs to happen in line 18 for this to work.

jaclyn-taroni · 2024-08-05T11:09:09Z

analyses/cell-type-wilms-tumor-06/run.sh

+  --gidmap $gid:0:1 --gidmap 0:1:$gid --gidmap $(($gid+1)):$(($gid+1)):$(($subgidSize-$gid)) \
+  --group-add=keep-groups \
+  --volume=$PWD/:/home/rstudio \
+  -e PASSWORD=wordpass \


Suggested change

-e PASSWORD=wordpass \

-e PASSWORD=$PASSWORD \

jaclyn-taroni · 2024-08-05T11:13:27Z

analyses/cell-type-wilms-tumor-06/README.md

+
+If you are on a Mac with an M series chip, you will not be able to use RStudio Server if you are using a `linux/amd64` or `linux/x86_84` (like the ones available from ECR).
+You must build an ARM image locally to be able to use RStudio Server within the container.
+


Can you add an H4 section here on Halbritter lab internal development please?

I'd expect that would include how to run the script (this is taking into account some review feedback):

./run-podman-internal.py {PASSWORD}

This hopefully helps you with your own development if, for example, you go on vacation for two weeks and come back to this! 😄 But it also helps others understand that this bash script isn't for their use.

I'll have to ask for some help in the Halbritter lab, add it on my to do list ;)

analyses/cell-type-wilms-tumor-06/README.md

Co-authored-by: Jaclyn Taroni <[email protected]>

jaclyn-taroni

@maud-p - I think this is good to go in as is 👍🏻

Congratulations on your first contribution 🥳

If you get more information about running this internally or need to make refinements, it's fine to add that in a later pull request!

I'm going to merge the AlexsLemonade:main branch into this one to make sure it is up-to-date, which is a prerequisite for merging this into AlexsLemonade:main.

Once this goes in, I recommend creating a new branch to add your clustering analyses from AlexsLemonade:main.

Here are some instructions using GitKraken: https://openscpca.readthedocs.io/en/latest/contributing-to-analyses/working-with-git/working-with-branches/#creating-a-feature-branch-in-gitkraken

Side note: If you haven't checked out GitKraken yet, you can use the free version with this project: https://www.gitkraken.com/

I'm not 100% sure how feasible it is to use GitKraken with your lab's setup, but I figured I'd let you know about it.

If you can't use GitKraken, those instructions are less helpful 😅 so if you have any questions, ping me and/or @jashapiro + @sjspielman (Data Lab scientists) in a comment on #679

Thank you, and congratulations again!

maud-p · 2024-08-05T13:46:14Z

Thank you very much!!! I am really happy about it and looking forward the next step of the analysis :)

Thank you for your help setting up all of this!

maud-p and others added 12 commits July 23, 2024 11:44

Create download-data.py

c1071c3

create module 1

81c1712

Update Dockerfile

caddbbf

Update README.md

c29e1cb

Update README.md

999d2f2

Add files via upload

671d4d7

Update GeneticAlterations_metadata.csv

ef22cb6

Update README.md

ab263d3

Update README.md

201bd53

Update README.md

3f7d64b

Update README.md

81e1478

changes to issue AlexsLemonade#671

b754e5d

maud-p requested a review from jaclyn-taroni as a code owner August 1, 2024 12:34

jaclyn-taroni mentioned this pull request Aug 1, 2024

Metadata file: compilation of a metadata file of marker genes for expected cell types that will be used for validation at a later step #672

Closed

7 tasks

jaclyn-taroni reviewed Aug 1, 2024

View reviewed changes

maud-p and others added 13 commits August 1, 2024 21:24

Update analyses/cell-type-wilms-tumor-06/Dockerfile

b2a1167

Co-authored-by: Jaclyn Taroni <[email protected]>

Update analyses/cell-type-wilms-tumor-06/Dockerfile

b096507

Co-authored-by: Jaclyn Taroni <[email protected]>

Update analyses/cell-type-wilms-tumor-06/README.md

89d8299

Co-authored-by: Jaclyn Taroni <[email protected]>

Update analyses/cell-type-wilms-tumor-06/README.md

e25c391

Co-authored-by: Jaclyn Taroni <[email protected]>

Update analyses/cell-type-wilms-tumor-06/README.md

97799c2

Co-authored-by: Jaclyn Taroni <[email protected]>

Update analyses/cell-type-wilms-tumor-06/README.md

85d4a09

Co-authored-by: Jaclyn Taroni <[email protected]>

Update Dockerfile

09e102b

change inpu data

09de604

update_data_download_steps

fa2d6a5

add sample metadata information

38c1ae2

remove unused mount directories

cbd6be3

remove unused volumes

835b0c5

adding renv.lock file

3adc01b

jaclyn-taroni added 15 commits August 3, 2024 11:53

Remove lock file (potentially temporarily!)

9a2f1cf

Base image with RStudio that's compatible with ARM + renv

6862fcc

Commit output from renv::init()

83a6fe5

Add an empty dependencies.R file

c4ec00a

Add tidyverse to dependencies

4973f16

Update lockfile after library(tidyverse)

230bf62

Uncomment renv steps from Dockerfile

b4eebec

Snapshot with some single-cell packages

c4e64c1

renv::update()

46c7e8b

Stub in run workflow; don't uncomment on PR

ebd2bf1

Add Docker build and push for Wilms cell typing module

41bf440

Add to list of modules with Docker images

afaff45

Document experience with Docker and renv

d3bce0e

Remove files specific to Halbritter lab setup and stop tracking

dd17e27

Add newline

650df7b

jaclyn-taroni mentioned this pull request Aug 4, 2024

Build and test Docker image using bioconductor/bioconductor_docker:3.19 and renv maud-p/OpenScPCA-analysis#3

Merged

maud-p added 2 commits August 4, 2024 19:26

Merge pull request #3 from jaclyn-taroni/jaclyn-taroni/wilms-06-build…

47a84bc

…-test Build and test Docker image using `bioconductor/bioconductor_docker:3.19` and `renv`

Create run.sh

dca3199

jaclyn-taroni reviewed Aug 5, 2024

View reviewed changes

maud-p and others added 6 commits August 5, 2024 14:11

DOI Correction

410ff39

DOI Correction

4d6e232

Rename run.sh to run-podman-internal.sh

5559521

Update run-podman-internal.sh

edd0e01

Update analyses/cell-type-wilms-tumor-06/README.md

3f6439b

Co-authored-by: Jaclyn Taroni <[email protected]>

Update README.md halbritter specific section to run container

405730a

jaclyn-taroni approved these changes Aug 5, 2024

View reviewed changes

Merge branch 'main' into start-wilms-analysis

a71de56

jaclyn-taroni merged commit c109d48 into AlexsLemonade:main Aug 5, 2024
4 checks passed

maud-p deleted the start-wilms-analysis branch August 5, 2024 18:24

		\|SIX1\|ENSG00000126778\|malignant\|blastema\|10.1016/j.ccell.2015.01.002\|NA\|
		\|SIX2\|ENSG00000170577\|malignant\|blastema\|10.1016/j.ccell.2015.01.002\|NA\|

		@@ -0,0 +1,21 @@
		# ids defined in image for the rstudio user, if not define as such, it is not possible to login to RStudio


		If you are on a Mac with an M series chip, you will not be able to use RStudio Server if you are using a `linux/amd64` or `linux/x86_84` (like the ones available from ECR).
		You must build an ARM image locally to be able to use RStudio Server within the container.

Start wilms analysis #681

Start wilms analysis #681

Conversation

maud-p commented Aug 1, 2024 • edited Loading

Purpose/implementation Section

Please link to the GitHub issue that this pull request addresses.

What is the goal of this pull request?

Briefly describe the general approach you took to achieve this goal.

If known, do you anticipate filing additional pull requests to complete this analysis module?

What is the name of your results bucket on S3?

What types of results does your code produce (e.g., table, figure)?

Provide directions for reviewers

What are the software and computational requirements needed to be able to run the code in this PR?

Are there particularly areas you'd like reviewers to have a close look at?

Is there anything that you want to discuss further?

Author checklists

Analysis module and review

Reproducibility checklist

jaclyn-taroni commented Aug 1, 2024

jaclyn-taroni left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

maud-p Aug 2, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

maud-p commented Aug 1, 2024 • edited Loading

jaclyn-taroni left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

maud-p Aug 5, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

maud-p Aug 5, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jaclyn-taroni left a comment

Choose a reason for hiding this comment

maud-p commented Aug 5, 2024

maud-p commented Aug 1, 2024 •

edited

Loading

maud-p Aug 2, 2024 •

edited

Loading

maud-p commented Aug 1, 2024 •

edited

Loading

maud-p Aug 5, 2024 •

edited

Loading

maud-p Aug 5, 2024 •

edited

Loading