Script to run AUCell on all samples using EWS-FLI1 high/low gene signatures #998
+356
−4
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Purpose/implementation Section
Please link to the GitHub issue that this pull request addresses.
Closes #985
What is the goal of this pull request?
Here I am adding two new scripts, one to run
AUCell
on a single SCE object using a set of custom gene signatures for defining tumor cell states (mainly EWS-FLI1 high and EWS-FLI1 low) and a second script to run the first script on all samples in the Ewing project. Ultimately, we want to use the results from AUCell to help label cells based on the specific cell state they are in.I am using the two custom gene sets that we have stored in
references/gene_signatures
that are marker gene lists for EWS-FLI1 high and low along with a set of MSigDB gene sets that we have identified from the literature as being potentially useful. It's pretty quick to runAUCell
so I figured it wouldn't hurt to just use all the gene sets in that list.I did not include genes in our two marker gene lists,
visser-all-marker-genes.tsv
andtumor-cell-state-markers.tsv
since the gene lists for each cell type there are quite small and would probably skew the AUC results.Briefly describe the general approach you took to achieve this goal.
aucMaxRank
, which I set to the default of 0.01.AUCell
. I don't know that we will use it, but I think it could be helpful when we go to plot this data.If known, do you anticipate filing additional pull requests to complete this analysis module?
Yes see #993
Results
What is the name of your results bucket on S3?
s3://researcher-211125375652-us-east-2/cell-type-ewings/aucell-ews-signatures
What types of results does your code produce (e.g., table, figure)?
TSV files with the AUC values
What is your summary of the results?
Coming next!
Author checklists
Check all those that apply.
Note that you may find it easier to check off these items after the pull request is actually filed.
Analysis module and review
README.md
has been updated to reflect code changes in this pull request.Reproducibility checklist
Dockerfile
.environment.yml
file.renv.lock
file.