Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question regarding custom gene set #747

Closed
Krithika-Bhuvan opened this issue Dec 13, 2024 · 9 comments
Closed

Question regarding custom gene set #747

Krithika-Bhuvan opened this issue Dec 13, 2024 · 9 comments

Comments

@Krithika-Bhuvan
Copy link

Hello , I'm working on GSEA on a custom gene set. I created a custom gene set that included 8 gene sets. Lets call it geneset1, geneset2...geneset8. I saved them as GMT files, and was able to run GSEA using the following function shown at the bottom.

For most of my DEG lists, the code worked perfectly. For two of my DEG lists, the result table only had 5 of the 8 gene sets. Note that code worked fine, just that the the result table only had 5 of the 8 gene sets.

The investigator would like to see the results for all of the 8 gene sets. I've tried changing the settings of minGSSize, maxGSSize, pvalueCutoff and eps, but I'm not able to get the p-value and other information to show up for the missing gene sets.

Can you give some insights why the rest are missing ? Are they all NA values , if so why are they NA values ?

set.seed(1234)
gsea_results <- GSEA(
  geneList = lfc_vector, # Ordered ranked gene list
  minGSSize = 2, # Minimum gene set size
  maxGSSize = 2000, # Maximum gene set set
  pvalueCutoff = 1, # p-value cutoff
  eps = 0, # Boundary for calculating the p value
  seed = TRUE, # Set seed to make results reproducible
  pAdjustMethod = "BH", # Benjamini-Hochberg correction
  TERM2GENE = GeneSets$names, #custom gene set
  by = "fgsea", 
)

Logs when running

using 'fgsea' for GSEA analysis, please cite Korotkevich et al (2019).

preparing geneSet collections...
GSEA analysis...
Warning in preparePathwaysAndStats(pathways, stats, minSize, maxSize, gseaParam,  :
  There are ties in the preranked stats (0.06% of the list).
The order of those tied genes will be arbitrary, which may produce unexpected results.
Warning in fgseaMultilevel(pathways = pathways, stats = stats, minSize = minSize,  :
  There were 2 pathways for which P-values were not calculated properly due to unbalanced (positive and negative) gene-level statistic values. For such pathways pval, padj, NES, log2err are set to NA. You can try to increase the value of the argument nPermSimple (for example set it nPermSimple = 100000)
leading edge analysis...
done...
@guidohooiveld
Copy link

guidohooiveld commented Dec 16, 2024

Some remarks:

It is not clear to me what you mean with the NA values. Could you post the full, exact code and output?

You mention lists of DEG (differentially expressed genes), but GSEA requires as input all genes measured in an experiment, thus also including the non-regulated genes. If you would like to analyze a subset of genes (your DEG?), then you should perform an over-representation analysis (ORA).

Also note that your issue cannot be reproduced by others, because of the lack of the input files (i.e. reproducible example).

@Krithika-Bhuvan
Copy link
Author

Hello. Thank you for responding. I don't see NA values in my data, I'm, trying to figure out the underlying reason.

Yes, I am using the all genes measured. I have several GSEA to run, and I'm using the fold change from each group comparison as the ranking criteria. That's what I was referring to.

Let me see how to make a reproducible example

@guidohooiveld
Copy link

For a starter, it would already be helpful if you could show your full code, and the output in which the NA are present.

@Krithika-Bhuvan
Copy link
Author

To clarify my question. There is no error or NA in the input or output.
I created a custom gene set that included 8 gene sets. Lets call it geneset1, geneset2...geneset8. I saved them as GMT files, and was able to run GSEA .

The result table only had 5 of the 8 gene sets. The investigator would like to see the GSEA result metrics for all of the 8 gene sets. Can you give some insights why the result metrics for the 3 of the gene sets are missing ? What criteria is used ?

I will get the full code and data in the meantime

@Krithika-Bhuvan
Copy link
Author

Steps

  • Import the filtered 15,000 gene list after removing the lowly expressed genes. Also include the log 2 fold change
    • Convert Ensembl id to Gene symbols
    • Check for duplicates - for all duplicate genes, take the largest fold change value
    • Define ranking criteria - here it is based on log fold change.
    • Save the log fold change as vector. Rdata object attached.
  • Save the custom gene list as GMT file. Total of 8 custom gene set GMT file shared here.
  • Run GSEA
#######  set path
in_path <- "enter path"
gmt_path <- "enter path"

# import log fold change vector
load(paste0(in_path, "/lfc_vector_RH302_P10-control.Rdata")) ### CHANGE
head(lfc_vector)

# read gmt file
gmt_files <- list.files(path = gmt_path, pattern = '.gmt', full.names = TRUE)
gmt_files[1] 

GeneSets = vector(mode='list')
GeneSets$pax3foxo <- read.gmt(gmt_files[1] )
x <- read.gmt(gmt_files[1] )

# run gsea
set.seed(1234)
gsea_results <- GSEA(
  geneList = lfc_vector, # Ordered ranked gene list
  minGSSize = 2, # Minimum gene set size
  maxGSSize = 2000, # Maximum gene set set
  pvalueCutoff = 2, # p-value cutoff
  eps = 0, # Boundary for calculating the p value
  seed = TRUE, # Set seed to make results reproducible
  pAdjustMethod = "BH", # Benjamini-Hochberg correction
  TERM2GENE = GeneSets$pax3foxo,
  by = "fgsea"
)

# save as data frame
gsea_result_df <- data.frame(gsea_results@result)

head(gsea_result_df)
  • Result gives 5 of 8 gene sets
  • Just trying to understand what happened to the missing 3 . Did i make a mistake ?

Attached zip file
Archive.zip

@guidohooiveld
Copy link

guidohooiveld commented Dec 17, 2024

Thanks for providing the example files. By using these I could reproduce your 'issue', which IMO is actually the intended/expected behavior.

Let me explain:

Using the provided files I ran your code. I am only showing the output of the last step:

> # run gsea
> set.seed(1234)
> gsea_results <- GSEA(
+   geneList = lfc_vector, # Ordered ranked gene list
+   minGSSize = 2, # Minimum gene set size
+   maxGSSize = 2000, # Maximum gene set set
+   pvalueCutoff = 2, # p-value cutoff
+   eps = 0, # Boundary for calculating the p value
+   seed = TRUE, # Set seed to make results reproducible
+   pAdjustMethod = "BH", # Benjamini-Hochberg correction
+   TERM2GENE = GeneSets$pax3foxo,
+   by = "fgsea"
+ )
using 'fgsea' for GSEA analysis, please cite Korotkevich et al (2019).

preparing geneSet collections...
GSEA analysis...
leading edge analysis...
done...
Warning messages:
1: In preparePathwaysAndStats(pathways, stats, minSize, maxSize, gseaParam,  :
  There are ties in the preranked stats (0.07% of the list).
The order of those tied genes will be arbitrary, which may produce unexpected results.
2: In fgseaMultilevel(pathways = pathways, stats = stats, minSize = minSize,  :
  There were 2 pathways for which P-values were not calculated properly due to unbalanced (positive and negative) gene-level statistic values. For such pathways pval, padj, NES, log2err are set to NA. You can try to increase the value of the argument nPermSimple (for example set it nPermSimple = 10000)
> 
> 
> gsea_results
#
# Gene Set Enrichment Analysis
#
#...@organism    UNKNOWN 
#...@setType     UNKNOWN 
#...@geneList    Named num [1:13028] 6.63 6.27 5.36 5.32 5.31 ...
 - attr(*, "names")= chr [1:13028] "RPS15P2" "RPL35P1" "RPL7AP50" "CGB8" ...
#...nPerm        
#...pvalues adjusted by 'BH' with cutoff <2 
#...5 enriched terms found
'data.frame':   5 obs. of  11 variables:
 $ ID             : chr  "GENESET4_TABLE3_TF" "GENESET6_TABLE4_FP_RMS_SE_MOTIFS" "GENESET8_TABLE6_MOLECSENS" "GENESET1_TABLE1" ...
 $ Description    : chr  "GENESET4_TABLE3_TF" "GENESET6_TABLE4_FP_RMS_SE_MOTIFS" "GENESET8_TABLE6_MOLECSENS" "GENESET1_TABLE1" ...
 $ setSize        : int  138 45 238 58 247
 $ enrichmentScore: num  -0.311 -0.233 0.225 0.221 0.163
 $ NES            : num  -1.417 -0.865 0.734 0.639 0.534
 $ pvalue         : num  0.00652 0.70659 0.93706 0.94651 1
 $ p.adjust       : num  0.0326 1 1 1 1
 $ qvalue         : num  0.0275 0.8421 0.8421 0.8421 0.8421
 $ rank           : num  2780 780 3206 2748 2462
 $ leading_edge   : chr  "tags=33%, list=21%, signal=26%" "tags=13%, list=6%, signal=13%" "tags=25%, list=25%, signal=19%" "tags=26%, list=21%, signal=20%" ...
 $ core_enrichment: chr  "ETV2/SNAPC2/MSC/HIC2/TCF7L1/TRMT1/HDAC2/HES6/ZNF444/ZNF696/TOX3/ETV4/PKNOX2/RUNX2/NR2E3/PCGF2/TCF3/MYOG/JARID2/"| __truncated__ "ZNF143/TEAD4/MYCN/MYOD1/MYC/NRF1" "OPRL1/DRD4/MAPK11/HPD/ADORA2B/ADRB2/TRPV4/TLR4/PTK6/CHRNA7/SERPINE1/ITK/NKAIN1/FGF2/SCN9A/RRM2/CCL2/FAAH/MAP3K8"| __truncated__ "DIRC3/BCL11A/SLC46A3/ACVR1C/MTUS2/ALOX5AP/EPAS1/DCLK1/TNFRSF19/HS1BP3/OXER1/SERTAD2/INO80D/MAP3K2/KLF5" ...
#...Citation
S Xu, E Hu, Y Cai, Z Xie, X Luo, L Zhan, W Tang, Q Wang, B Liu, R Wang, W Xie, T Wu, L Xie, G Yu. Using clusterProfiler to characterize multiomics data. Nature Protocols. 2024, doi:10.1038/s41596-024-01020-z 

> 

Indeed, results for only 5 gene sets (out of 8) are reported...

How come?

Since no significance cutoff is applied (because of the setting pvalueCutoff = 2), a likely reason this is caused by the gene sets used as input.
Indeed, when having a closer look at these sets, it becomes clear that one of them (GENESET3_TABLE2_RH4) has a size larger than the cutoff imposed (maxGSSize = 2000), even after filtering the genes that are not present in the ranked input (lfc_vector)!

Some code to show this:

> ## convert gene set data frame into list
> ## one list element corresponds to a gene set
> genesets.list <- split(GeneSets$pax3foxo$gene, GeneSets$pax3foxo$term)
> 
> ## check
> str(genesets.list)
List of 8
 $ GENESET1_TABLE1                 : chr [1:9191] "ATOH8" "ITSN2" "HS1BP3" "DARS" ...
 $ GENESET2_TABLE2_P3F             : chr [1:9191] "PON2" "GNAS" "PEG10" "SFMBT2" ...
 $ GENESET3_TABLE2_RH4             : chr [1:9192] "NMNAT1" "PALMD" "FRRS1" "AGL" ...
 $ GENESET4_TABLE3_TF              : chr [1:9191] "MYOG" "MYOD1" "MYCN" "SOX8" ...
 $ GENESET5_TABLE3_RMS_SE_TF       : chr [1:9191] "MYOD1" "MYOG" "FOXO1" "TGIF1" ...
 $ GENESET6_TABLE4_FP_RMS_SE_MOTIFS: chr [1:9191] "AHR" "AMYB" "AP" "AP-1" ...
 $ GENESET7_TABLE5_P3F_HSMM        : chr [1:9191] "ARID1A" "ASS1" "CITED2" "CREBBP" ...
 $ GENESET8_TABLE6_MOLECSENS       : chr [1:9191] "HDAC1" "BRD4" "HDAC6" "HDAC4" ...
> 
> ## note that all elements (gene sets) have same size (9191).
> ## this may be caused by the use of Excel when making the gmt file, 
> ## since when saved empty values are included, so their length is
> ## equal to the longest element (= largest gene set).
> ## so remove these empty values to get the 'real size'
> 
> ## to illustrate: this is indeed OK...
> head(genesets.list[["GENESET1_TABLE1"]])
[1] "ATOH8"  "ITSN2"  "HS1BP3" "DARS"   "SACS"   "MYCNOS"
> 
> ## ... but note the empty values present at the end of the gene sets
> tail(genesets.list[["GENESET1_TABLE1"]])
[1] "" "" "" "" "" ""
> 
> ## remove empty values
> genesets.list <- lapply(genesets.list, function(z){ z[!is.na(z) & z != ""]})
> 
> ## check
> str(genesets.list)
List of 8
 $ GENESET1_TABLE1                 : chr [1:143] "ATOH8" "ITSN2" "HS1BP3" "DARS" ...
 $ GENESET2_TABLE2_P3F             : chr [1:1010] "PON2" "GNAS" "PEG10" "SFMBT2" ...
 $ GENESET3_TABLE2_RH4             : chr [1:9192] "NMNAT1" "PALMD" "FRRS1" "AGL" ...
 $ GENESET4_TABLE3_TF              : chr [1:170] "MYOG" "MYOD1" "MYCN" "SOX8" ...
 $ GENESET5_TABLE3_RMS_SE_TF       : chr [1:291] "MYOD1" "MYOG" "FOXO1" "TGIF1" ...
 $ GENESET6_TABLE4_FP_RMS_SE_MOTIFS: chr [1:242] "AHR" "AMYB" "AP" "AP-1" ...
 $ GENESET7_TABLE5_P3F_HSMM        : chr [1:592] "ARID1A" "ASS1" "CITED2" "CREBBP" ...
 $ GENESET8_TABLE6_MOLECSENS       : chr [1:406] "HDAC1" "BRD4" "HDAC6" "HDAC4" ...
> 

Thus, only gene set GENESET3_TABLE2_RH4 consists of 9192 genes, all others are much smaller.

Next check how many unique genes are actually present in the gene sets:

> sapply(genesets.list, function(x) sum( !duplicated(x) ))
                 GENESET1_TABLE1              GENESET2_TABLE2_P3F 
                             109                             1010 
             GENESET3_TABLE2_RH4               GENESET4_TABLE3_TF 
                            9096                              170 
       GENESET5_TABLE3_RMS_SE_TF GENESET6_TABLE4_FP_RMS_SE_MOTIFS 
                             291                              242 
        GENESET7_TABLE5_P3F_HSMM        GENESET8_TABLE6_MOLECSENS 
                             592                              406 
> 

This shows that e.g. GENESET1_TABLE1 consists of 109 unique genes, and 34 genes are thus duplicates!!

How many of the unique genes in the gene sets are actually also measured in the (filtered) input data:

> ## first de-duplicate genesets.list
> dedupped <- sapply(genesets.list, unique)
> str(dedupped)
List of 8
 $ GENESET1_TABLE1                 : chr [1:109] "ATOH8" "ITSN2" "HS1BP3" "DARS" ...
 $ GENESET2_TABLE2_P3F             : chr [1:1010] "PON2" "GNAS" "PEG10" "SFMBT2" ...
 $ GENESET3_TABLE2_RH4             : chr [1:9096] "NMNAT1" "PALMD" "FRRS1" "AGL" ...
 $ GENESET4_TABLE3_TF              : chr [1:170] "MYOG" "MYOD1" "MYCN" "SOX8" ...
 $ GENESET5_TABLE3_RMS_SE_TF       : chr [1:291] "MYOD1" "MYOG" "FOXO1" "TGIF1" ...
 $ GENESET6_TABLE4_FP_RMS_SE_MOTIFS: chr [1:242] "AHR" "AMYB" "AP" "AP-1" ...
 $ GENESET7_TABLE5_P3F_HSMM        : chr [1:592] "ARID1A" "ASS1" "CITED2" "CREBBP" ...
 $ GENESET8_TABLE6_MOLECSENS       : chr [1:406] "HDAC1" "BRD4" "HDAC6" "HDAC4" ...
> 
>
> sapply(dedupped, function(x) sum(x %in% names(lfc_vector) ))
                 GENESET1_TABLE1              GENESET2_TABLE2_P3F 
                              58                              949 
             GENESET3_TABLE2_RH4               GENESET4_TABLE3_TF 
                            6191                              138 
       GENESET5_TABLE3_RMS_SE_TF GENESET6_TABLE4_FP_RMS_SE_MOTIFS 
                             247                               45 
        GENESET7_TABLE5_P3F_HSMM        GENESET8_TABLE6_MOLECSENS 
                             522                              238 
> 

The table above shows that many genes in the gene sets have not been 'measured' (i.e. are not present in lfc_vector) , and these genes are therefore removed from the gene sets. For example, GENESET1_TABLE1 consists of 143 genes (of which 109 are unique), but only 58 of these are present in lfc_vector.

Rerun GSEA, but increase max gene sets size to 7000 (from 2000).

> gsea_results <- GSEA(
+   geneList = lfc_vector, # Ordered ranked gene list
+   minGSSize = 2, # Minimum gene set size
+   maxGSSize = 7000, # Maximum gene set set
+   pvalueCutoff = 2, # p-value cutoff
+   eps = 0, # Boundary for calculating the p value
+   seed = TRUE, # Set seed to make results reproducible
+   pAdjustMethod = "BH", # Benjamini-Hochberg correction
+   TERM2GENE = GeneSets$pax3foxo,
+   by = "fgsea"
+ )
using 'fgsea' for GSEA analysis, please cite Korotkevich et al (2019).

preparing geneSet collections...
GSEA analysis...
leading edge analysis...
done...
Warning messages:
1: In preparePathwaysAndStats(pathways, stats, minSize, maxSize, gseaParam,  :
  There are ties in the preranked stats (0.07% of the list).
The order of those tied genes will be arbitrary, which may produce unexpected results.
2: In fgseaMultilevel(pathways = pathways, stats = stats, minSize = minSize,  :
  There were 2 pathways for which P-values were not calculated properly due to unbalanced (positive and negative) gene-level statistic values. For such pathways pval, padj, NES, log2err are set to NA. You can try to increase the value of the argument nPermSimple (for example set it nPermSimple = 10000)
> gsea_results 
#
# Gene Set Enrichment Analysis
#
#...@organism    UNKNOWN 
#...@setType     UNKNOWN 
#...@geneList    Named num [1:13028] 6.63 6.27 5.36 5.32 5.31 ...
 - attr(*, "names")= chr [1:13028] "RPS15P2" "RPL35P1" "RPL7AP50" "CGB8" ...
#...nPerm        
#...pvalues adjusted by 'BH' with cutoff <2 
#...6 enriched terms found
'data.frame':   6 obs. of  11 variables:
 $ ID             : chr  "GENESET4_TABLE3_TF" "GENESET6_TABLE4_FP_RMS_SE_MOTIFS" "GENESET3_TABLE2_RH4" "GENESET8_TABLE6_MOLECSENS" ...
 $ Description    : chr  "GENESET4_TABLE3_TF" "GENESET6_TABLE4_FP_RMS_SE_MOTIFS" "GENESET3_TABLE2_RH4" "GENESET8_TABLE6_MOLECSENS" ...
 $ setSize        : int  138 45 6191 238 58 247
 $ enrichmentScore: num  -0.311 -0.233 0.231 0.225 0.221 ...
 $ NES            : num  -1.413 -0.877 0.786 0.738 0.641 ...
 $ pvalue         : num  0.0064 0.6805 1 0.9339 0.9366 ...
 $ p.adjust       : num  0.0384 1 1 1 1 ...
 $ qvalue         : num  0.0337 0.8772 0.8772 0.8772 0.8772 ...
 $ rank           : num  2780 780 2412 3206 2748 ...
 $ leading_edge   : chr  "tags=33%, list=21%, signal=26%" "tags=13%, list=6%, signal=13%" "tags=18%, list=19%, signal=28%" "tags=25%, list=25%, signal=19%" ...
 $ core_enrichment: chr  "ETV2/SNAPC2/MSC/HIC2/TCF7L1/TRMT1/HDAC2/HES6/ZNF444/ZNF696/TOX3/ETV4/PKNOX2/RUNX2/NR2E3/PCGF2/TCF3/MYOG/JARID2/"| __truncated__ "ZNF143/TEAD4/MYCN/MYOD1/MYC/NRF1" "EDARADD/LOC440311/BCL2L10/CHAC1/CSNK2A3/CXCL8/PABPC3/NECTIN4/TMEM119/PWAR5/PLEKHG4/NACA2/RND1/PADI2/KCNC4/RSC1A"| __truncated__ "OPRL1/DRD4/MAPK11/HPD/ADORA2B/ADRB2/TRPV4/TLR4/PTK6/CHRNA7/SERPINE1/ITK/NKAIN1/FGF2/SCN9A/RRM2/CCL2/FAAH/MAP3K8"| __truncated__ ...
#...Citation
S Xu, E Hu, Y Cai, Z Xie, X Luo, L Zhan, W Tang, Q Wang, B Liu, R Wang, W Xie, T Wu, L Xie, G Yu. Using clusterProfiler to characterize multiomics data. Nature Protocols. 2024, doi:10.1038/s41596-024-01020-z 

> as.data.frame(gsea_results)
                                                               ID
GENESET4_TABLE3_TF                             GENESET4_TABLE3_TF
GENESET6_TABLE4_FP_RMS_SE_MOTIFS GENESET6_TABLE4_FP_RMS_SE_MOTIFS
GENESET3_TABLE2_RH4                           GENESET3_TABLE2_RH4
GENESET8_TABLE6_MOLECSENS               GENESET8_TABLE6_MOLECSENS
GENESET1_TABLE1                                   GENESET1_TABLE1
GENESET5_TABLE3_RMS_SE_TF               GENESET5_TABLE3_RMS_SE_TF
                                                      Description setSize
GENESET4_TABLE3_TF                             GENESET4_TABLE3_TF     138
GENESET6_TABLE4_FP_RMS_SE_MOTIFS GENESET6_TABLE4_FP_RMS_SE_MOTIFS      45
GENESET3_TABLE2_RH4                           GENESET3_TABLE2_RH4    6191
GENESET8_TABLE6_MOLECSENS               GENESET8_TABLE6_MOLECSENS     238
GENESET1_TABLE1                                   GENESET1_TABLE1      58
GENESET5_TABLE3_RMS_SE_TF               GENESET5_TABLE3_RMS_SE_TF     247
                                 enrichmentScore        NES      pvalue
GENESET4_TABLE3_TF                    -0.3108523 -1.4133314 0.006395038
GENESET6_TABLE4_FP_RMS_SE_MOTIFS      -0.2327198 -0.8772425 0.680473373
GENESET3_TABLE2_RH4                    0.2312194  0.7857807 1.000000000
GENESET8_TABLE6_MOLECSENS              0.2247054  0.7376688 0.933875890
GENESET1_TABLE1                        0.2205905  0.6409977 0.936635945
GENESET5_TABLE3_RMS_SE_TF              0.1633441  0.5366699 0.998986829
                                   p.adjust     qvalue rank
GENESET4_TABLE3_TF               0.03837023 0.03365809 2780
GENESET6_TABLE4_FP_RMS_SE_MOTIFS 1.00000000 0.87719298  780
GENESET3_TABLE2_RH4              1.00000000 0.87719298 2412
GENESET8_TABLE6_MOLECSENS        1.00000000 0.87719298 3206
GENESET1_TABLE1                  1.00000000 0.87719298 2748
GENESET5_TABLE3_RMS_SE_TF        1.00000000 0.87719298 2462
                                                   leading_edge
GENESET4_TABLE3_TF               tags=33%, list=21%, signal=26%
GENESET6_TABLE4_FP_RMS_SE_MOTIFS  tags=13%, list=6%, signal=13%
GENESET3_TABLE2_RH4              tags=18%, list=19%, signal=28%
GENESET8_TABLE6_MOLECSENS        tags=25%, list=25%, signal=19%
GENESET1_TABLE1                  tags=26%, list=21%, signal=20%
GENESET5_TABLE3_RMS_SE_TF        tags=15%, list=19%, signal=13%
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      core_enrichment
GENESET4_TABLE3_TF                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   ETV2/SNAPC2/MSC/HIC2/TCF7L1/TRMT1/HDAC2/HES6/ZNF444/ZNF696/TOX3/ETV4/PKNOX2/RUNX2/NR2E3/PCGF2/TCF3/MYOG/JARID2/EYA1/TFAP4/OLIG2/UHRF1/CNOT3/RARA/ZNF593/SIX4/NCOR2/SOX8/EN2/SMYD5/TEAD4/POU4F1/LBX1/MYCN/HOXC10/TBX1/SIX2/SIX1/PITX3/MYOD1/MXD3/SIM2/OLIG1/FOXD4
GENESET6_TABLE4_FP_RMS_SE_MOTIFS                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     ZNF143/TEAD4/MYCN/MYOD1/MYC/NRF1
GENESET3_TABLE2_RH4              EDARADD/LOC440311/BCL2L10/CHAC1/CSNK2A3/CXCL8/PABPC3/NECTIN4/TMEM119/PWAR5/PLEKHG4/NACA2/RND1/PADI2/KCNC4/RSC1A1/ZNF836/HRC/KCNF1/LMNTD2/NRIP2/IFI30/MPZ/RALGAPA1P1/GPR137C/HAS3/ISLR/DRAXIN/INHBE/ATP2A3/FOSB/LOC105371730/PRR5L/GOLGA8T/VASN/EXOC3L4/FBLIM1/ANGPTL1/INHA/CRISPLD2/MISP/DRD4/VWF/HSD17B6/GSG1/KIF21B/KRT17/SAMD11/FCGR2A/TRPV2/PRR36/UNC13A/CHRM4/SORL1/ZNF487/ADGRB2/SYT7/ARHGEF16/MYADML2/NPIPA2/FAIM2/RIMKLA/LUZP2/ITPRIPL2/TSPAN2/KCNJ12/PLEKHB1/CEMIP/IFIT2/LOC105370792/SSTR2/IFIT3/FSCN2/CACNA1H/FAM24B-CUZD1/TSPAN1/HNRNPA1L2/SHISA4/SESN2/SOCS1/CEP126/CARD8-AS1/CALHM2/DDIT3/IL23A/GPRC5A/C1QL1/EIF5AL1/KCNH3/GDF15/LHX1/GPR132/CCDC88C/CTSH/DDIT4/CX3CL1/TMEM91/DNAH17/SULT1A1/RD3/NXPH3/KCND3/ADGRG1/PTPRJ/CACNG4/STOX1/BMF/ADORA2B/ATF3/NMRAL1/PLEKHS1/PRKG1/RAB3A/NUPR1/NCF2/DIO2/TP53TG3/CASC2/GCH1/MTMR11/POU2F2/P2RY6/ASPHD1/SYT17/OBSCN/NUDT8/MAP4K2/ZNF625/B3GALT2/PKP1/RAP1GAP/MST1L/SSTR5-AS1/CRLF1/ZNF442/ANKRD65/NFASC/GOLGA8R/DUSP8/KAZN/EFNA1/GFI1/RHOU/EFEMP2/SLC48A1/KANK4/LRRC37A2/NPAS1/BBS2/KITLG/LRRC8C/NQO1/TMEM100/ZFP69B/LEFTY1/ZNF730/EPS8L2/CTH/CARMIL2/PGBD2/HSPB6/SERTAD4/BATF2/TMED8/NATD1/PLXNC1/ZNF460/ARG2/RAB27A/SLC16A12/GBP3/SYNPO2L/YPEL2/SBF2-AS1/SPIRE2/ADAMTS15/RAD51D/DOK4/SMAD9/MFSD13A/ZNF570/PMAIP1/MCOLN3/FAR2/PRR19/LRRC75A/ROM1/SLC16A4/ZNF10/DUSP16/ZNF829/RAB11B-AS1/RCAN3/ZNF418/BIRC3/PODNL1/ARHGAP9/ZNF181/MBP/C1QL4/PALM/RORA/CEBPA/MMP15/RBFOX3/SAMD15/CCDC169/MFAP4/FAM161B/ZNF583/ZNF772/ZNF814/CYS1/PINLYP/HSP90B2P/MYEOV/ZNF846/FRAT1/FABP3/APLP1/TRPV4/GOLGA8N/MYO15B/RASSF5/FSTL3/EXOC3L1/MEGF6/KIF17/SMG1P7/UNC13D/ZNF540/CDH24/DNAH10/PTPRR/KRT7/ZNF257/DRC3/VMAC/SLX1B/ZNF709/CDC42BPG/MYPN/CCNE1/MAP1LC3C/C11orf96/RGPD5/MAST3/LOC101928844/S1PR2/ZNF790/TP73/C11orf52/RCBTB2/NRIP3/ZIC5/ZNF805/VSIG10L/SH2D5/FGD6/NANOS1/BDNF/ZNF566/NBR2/PKP2/SRGAP1/ZNF543/MALAT1/SLC8A3/SLC46A3/IL11/MYO5C/FAM111A/TGFB3/CEBPG/ALX4/PARD6A/ZNF365/RGS9/ST20/IPP/EFCAB2/OSCP1/DNAJC9-AS1/PIK3AP1/SRL/ZNF585B/ZKSCAN2/ZNF799/PRRT2/AKR7L/SPTBN4/LRRC56/FICD/PIANP/GPR158/MPO/ZNF573/SESN3/ADAM11/IPO9-AS1/ST8SIA1/BATF3/ENO2/IL27RA/CNTNAP1/ZDHHC22/DLL4/ZNF563/MAP3K9/ZNF429/ZC3H12C/GNG4/FAM111B/ZNF681/CHRNA7/SLC30A4/BBC3/BEST1/DACT3/ZNF443/SLC25A42/GRHL1/ESR2/SLC26A11/CARMIL3/PRELID3A/CARD14/ZNF878/MAML2/GAS7/ONECUT2/CHST3/SLC16A6/ASGR1/BDKRB2/PSMB10/KATNAL1/N4BP2L2-IT2/ZNF180/ZNF527/VASH1/IQCC/SSTR5/LINC01140/ZNF682/SEC14L1P1/DGKA/CCDC88B/ZNF345/SETDB2/LYST/DACT1/NKAIN1/UNC5B/RNF125/STAT6/PIDD1/TIRAP/RASAL2-AS1/PPP1R15A/SLC29A2/ZNF222/SLC8B1/RNF152/WDR90/SLC37A2/C1orf226/ABCB9/SNURF/CTF1/GOLGA6L4/ATP2B4/ZNF584/RAB39A/PYROXD2/ZNF529/E2F8/ISG15/AGAP2/BOLA1/VSIG10/GPR157/PTGES3L-AARSD1/PIK3R3/ZNF607/CPEB1/RPS6KA5/SLC9A5/SMPD1/ZNF337-AS1/CCDC15/PABPC4L/RPH3AL/ZFP3/MSS51/ZNF184/ZNF697/TGFB2-OT1/ZFP90/MNS1/ABLIM1/IL1R1/CCDC144A/SLC16A3/EPHX1/WNT3/HSD11B2/ZNF486/CDK18/LINC01224/MLLT11/PER3/ZNF808/RRM2/PROCA1/EFNB3/NRP1/ZNF684/NR4A1/SBK1/GDPD5/TCIRG1/ZNF773/MRGPRF/ZNF678/CCL2/IFI44/CEP170B/GCLM/F3/ZNF695/PPP1R13B/SCN3B/ZNF470/ZNF490/ZNF611/DYRK3/ZNF569/RNF19B/TEX14/TMEM182/MAP3K12/S100A4/NAGS/KNDC1/DDN/ZNF436/IL17D/LIN37/C17orf100/TEX9/TRIM5/ELOVL2/TBC1D3B/SCN8A/TMEM221/EXO5/DTL/ALG10B/ZNF530/MAP3K6/ACACB/RHOJ/HOXC6/RASIP1/GADD45A/LDB3/TMEM170A/PDIK1L/BMS1P4/ATP1A2/ASAH2B/TMEM106A/ADAMTS4/IRF7/BRPF3/SRCIN1/CLSPN/AXIN2/FAAH/NDRG4/LITAF/CDC42EP2/POLD3/SYNM/ZNF700/H6PD/PTPN22/ZNF471/AFAP1L2/ZNF610/ZNF260/ADCK1/C12orf76/ARHGDIB/CACNB3/KRBA2/RASSF7/TMEM79/ADPRHL1/ZNF888/SLC25A29/FBXL22/ETNK2/ZNF701/ZNF441/DDX11-AS1/TDRD1/MAP3K8/PRKACB/EXO1/ZNF585A/MAST1/TMEM107/PMEL/LAMB3/NMNAT1/ZNF816/SLC6A16/CBX5/GPT2/ZNF44/ICAM5/ZNF613/MOK/PGPEP1/CYLD/ZNF230/ZNF821/GBP1/MTUS2/RAB3B/ZNF416/TLE2/ALG10/MAP4K1/ZNF23/CCPG1/ZFP82/CLCC1/PIGV/B4GALT6/RARG/PADI3/MMP19/ZNF761/FOXA1/SOWAHC/MYL4/CLMN/FLVCR2/SLC25A45/HOXB9/TMEM86A/C16orf87/DISC1/SCN1B/RNASEK-C17orf49/ZNF287/DNAJC6/CRACR2B/TMEM254/CLEC2D/ATL1/ZMYM5/ZNF90/ATF5/NT5M/AVPI1/LPXN/ZNF264/ADAT1/PUS7L/RGL3/B4GALNT1/NETO2/ZNF101/ZNF382/LINC01583/APH1B/ZNF641/FOXD2/TYRO3/KLHL42/SMAD7/ZSCAN22/C1orf105/LPGAT1/LMBRD2/SLC16A13/ZFYVE19/SFT2D2/PITPNM2/GDPGP1/ALOX5AP/ZBTB1/CACNA1G/LURAP1/FOXL1/NLRP11/CCP110/TTC39A/DMXL2/LRRC37B/PSMC3IP/GXYLT1/COL1A1/DPF1/SFTA1P/KCNJ14/TICAM1/ADCY6/DCUN1D3/ZNF30/NEIL1/ACSF2/HEXIM2/PALB2/BRCA1/DYNC2H1/MATN1-AS1/MDM1/ZNF324B/ATP8B4/EXOSC6/ANK3/CCDC24/TRAPPC6B/PDLIM1/ZNF837/ADAMTSL3/ZNF606/ZFP30/CACHD1/ABHD4/ZNF200/SVIP/TOB1-AS1/GADD45B/ZBTB41/ZNF211/SECTM1/CEP295/NFAT5/SPOCD1/SOX6/BBOF1/PAX6/TEX30/ATL3/TEX19/PITPNC1/ZNF675/TSSK6/ARNT2/FAAP24/ZNF14/ZNF33A/HELLS/SPHK1/ZNF850/CDR2L/LIN7A/KIF1A/KIAA1549L/CGNL1/ZNF770/ZNF548/IRAK4/ANO8/ABHD3/LCOR/DLGAP1/RAD51AP1/SLC35G1/STK33/PCDH9/LRRC37A/SYNC/ARID3B/PDE4B/PNPO/ENTPD7/ZNF100/MXI1/TIGD7/LRP4/HELB/ZNF93/AKAP11/ZFP28/EMILIN2/CDK5R1/TTC39C/ERN1/SYNE2/PDE4A/SDE2/E2F7/MOB3C/LRP8/FBF1/SCNN1D/GLCE/DGKH/ZNF43/SLC41A2/IGFLR1/MTHFR/FCRLB/ZNF554/GNAL/ZNF232/BTG2/ATF7/LTBP4/BRI3BP/ASB16-AS1/DENND2C/MPPE1/CCDC28B/NBPF1/AGO3/ZNF226/PRDM11/FOXO3B/RTN4RL1/MYRF/SYT8/DNM3/C18orf54/ALDH3B1/IGSF8/SH3D21/SCARNA17/SLFNL1-AS1/RELB/GSKIP/CILP/B3GNT9/ZNF235/DDB2/RDH5/STK38L/KIF7/LTK/STYX/CHRNA5/TXNDC16/MOV10/KLF2/GPR3/IRGQ/IDI1/BRIP1/ZNF28/CRIP2/ZDHHC1/PLCE1/PRKCH/ASB13/LINC00626/TIGD3/ZNF488/HAUS5/OSGIN1/LOC728485/LRRC8E/ACER3/THAP10/SLC41A1/IRF8/PIP4K2C/TNS2/ZNF223/EGR2/ITGB3/EPB41/ZNF85/ACSBG1/FAM177A1/SOCS2/PYGO1/TMEM117/TAF13/PPM1E/ZNF780A/ZNF45/SPATA33/RAVER2/C14orf132/MLH3/LOC105371795/SPTBN2/RGL1/C1orf74/ZNF670/SLC3A2/ANKRD26/CSPG4/CMTM4/PARP16/ANKRD1/DCLK1/APOLD1/PALMD/ZNF525/LPIN2/ZNF26/LIN7C/ZNF671/ZNF667/TMEM106C/NLRP1/GAS6/ZNF383/ZNF577/LINC01572/CNN1/RASSF8/FADS1/SLC35D1/MAF/SLC46A1/LRRC57/DPY19L3/ZNF461/GATA3/ZNF136/SULT1A3/FANCI/CAPS/ZNF195/SH2D7/FBXL8/AGL/CDC42EP4/PLCB2/MOAP1/ASCC1/PTPRQ/MPP2/ZNF419/LOC100505715/EFNA2/CDC6/ZNF274/PTGES3L/SUSD6/PLEKHG2/LOC728392/ZC3H6/DNAJC22/NOS1AP/SPOCK2/ZNF841/FBN1/NUCB2/WNT5B/GNB5/OTUB2/ERCC6/ZSCAN30/PGGHG/TMOD3/MDK/GAREM1/VWCE/PTPRB/ZNF8/ZBTB8A/ZNF561/TP53I11/DNA2/UNKL/GPR137/ZNF468/PPM1D/TRIM47/PEAK1/ZNF324/KBTBD7/SELPLG/DHRS1/RABIF/ZNF426/TRIM66/TCEANC2/LYSMD1/SYT1/TVP23C/PGM2L1/ZNF430/RRM1/GOLGA6L17P/ZNF550/ZNF254/CPEB3/MYBPHL/JRKL/SLC20A1/A1BG-AS1/ZNF823/HCN3/PMP22/NEXN-AS1/RASD1/TPCN2/TMC7/ZNF420/ZNF567/DENND1B/TNFRSF19/CMIP/TBX2-AS1/NIPA1/HES4/ZNF17/GINS3/ANKRD49/ZNF813/ZNF225/EPC1/MARK1/IFFO2/ZNF473/ZNF667-AS1/ZNF519/SLCO3A1/SMG9/C15orf39/RAD9B/SNTB2/ZNF547/ZNF649/ARL5B/HSPB8/B4GAT1/NFKBID/MYBPC2/INSM2/TTBK2/LOC105370941/ZNF253/ZNF506/U2AF1L4/ZNF790-AS1/PSTK/COTL1/L2HGDH/C16orf95/CLCF1/TMEM263/KLHDC1/RUSC1-AS1/CADM1/ZDHHC13/INTS7/AKAP5/RAB8B/SLC2A13/ASAH2/ZNF362/PLEKHH3/DAAM1/LIPT2/ARID5B/ZNF221/TMEM87B/DNAL1/TMEM105/GRAMD1B/CRY2/NRXN2/ZNF549/CHD9/RAB30/ZNF91/SORBS1/TMEM135/LTBP3/ACBD5/COL16A1/SENP1/ZFPM1/TMEM25/SLC9A1/EMP2/SKIDA1/ZNF844/NEO1/ERBB3/PRSS53/UBR7/DUSP14/STON2/ZNF329/KATNAL2/ARHGEF10L/APLN/DNASE2/SIPA1L1/ZNF18/TMEM62/ZNF350/SHC4/ABCB10/CD27-AS1/COQ10A/TUBG2/MFSD11/ZDHHC6/TMTC3/TSC22D1/ZNF33B/LINC00639/LRTOMT/HOOK2/CDKL1/RAB2B/SLC38A4/ZNF672/SLC12A4/RNF138/ZFP69/PLPPR2/AUNIP/ZNF112/SP110/GCC2/GAN/EVA1B/MARVELD1/NPIPB9/CCDC102A/GIPR/SV2A/ULK3/MTHFS/ANKRD42/C1orf216/EDRF1/GOLT1B/ORC1/HEATR5A/ZNF551/CLMP/SOCS4/TMEM41B/IFI27L2/ZNF546/MEIS2/SNX7/PTCH2/MED18/SLC2A3/SLC12A6/CRYL1/FTL/CKB/ZNF408/ZNF776/CPNE2/LRRC8D/ZNF331/ATP2A1/ENTPD5/ZNF597/ZNF845/ADPRM/ZNF765/GCOM1/APC2/NFKBIA/SYDE2/TBC1D12/HAPLN3/IVNS1ABP/HAUS2/INSR/EDEM3/PPCS/GPR135/KCNMB4/ZNF699/SYNGR3/CDK2/OPTN/EMX2/HCFC2/POU6F1/ZNF599/THNSL1/DHRS13/LAMC2/QPRT/ADD3/MYBPH/ZNF615/ZNF627/RBBP8/PPP1R3E/LRIF1/RCN3
GENESET8_TABLE6_MOLECSENS                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                OPRL1/DRD4/MAPK11/HPD/ADORA2B/ADRB2/TRPV4/TLR4/PTK6/CHRNA7/SERPINE1/ITK/NKAIN1/FGF2/SCN9A/RRM2/CCL2/FAAH/MAP3K8/PTGS1/RARG/HRH1/ALOX5AP/MGLL/SPHK1/ERN1/PDE4A/SCNN1D/PDGFRB/ADORA2A/CLCN2/RRM1/EDNRA/JAK2/PIM1/SMO/CDK2/CASP3/SIRT1/PDE10A/SRD5A1/MDM2/RPS6KA1/CAMKK1/XDH/TP53/SGK1/PNP/DPYD/CALM1/PIK3CA/CYP51A1/BLM/PTP4A1/BRAF/BCL2/PRKAA1/XIAP/LAP3/CDC7
GENESET1_TABLE1                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                DIRC3/BCL11A/SLC46A3/ACVR1C/MTUS2/ALOX5AP/EPAS1/DCLK1/TNFRSF19/HS1BP3/OXER1/SERTAD2/INO80D/MAP3K2/KLF5
GENESET5_TABLE3_RMS_SE_TF                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            DDIT3/TFCP2L1/HES1/BCL11A/RORA/CEBPB/GBX2/ALX4/NR4A1/DLX2/HOXC6/ZNF366/RARG/FOXL1/ARNT2/BHLHE40/HIVEP1/TNFRSF13C/HOXA5/MYF5/NFIB/CREB5/TSC22D3/ZNF420/ZNF473/ZNF649/ARID5B/DNASE2/HIVEP2/TSC22D1/HOXB5/MEIS2/ZNF331/ZNF699/HOXA4/EMX2/HOXA3/ZFP1
> 

Note that now the results of 6 gene sets are reported (not 5); results for set GENESET3_TABLE2_RH4 are now included.
Also note that values for the reported setSize are equal to those calculated manually in the code chunk before.

But why still not results for all 8 input sets?

This has to do with the (2nd) warning that is reported:
2: In fgseaMultilevel(pathways = pathways, stats = stats, minSize = minSize, : There were 2 pathways for which P-values were not calculated properly due to unbalanced (positive and negative) gene-level statistic values. For such pathways pval, padj, NES, log2err are set to NA. You can try to increase the value of the argument nPermSimple (for example set it nPermSimple = 10000).

This warning is thrown by fgsea, which is used under the hood when running the GSEA function from clusterProfiler.
The warning notifies you that for 2 gene sets there is a dis-balance in the regulation of the genes belonging to those gene sets (most of them have a positive logFC; are thus all up-regulated), and this invalidates an assumption of the GSEA procedure. See e.g. my previous post here (#721 (comment)), that also links to a thread at the GitHub from fgsea (ctlab/fgsea#124).
Because of this dis-balance, fgsea sets all results for these 2 gene sets to NA, and since the GSEA function filters results with NA (because there are no (adjusted) p-values to filter on), these 2 gene sets are not present in the results.

This is why in the end results for only 6 of the 8 sets are reported!

Note: to have all sets analyzed, in the fgsea thread it is suggested to change the default value for the argument scoreType to "pos" or "neg" (from "std"). Realize that by doing so the interpretation of the results changes (see ?fgseaMultilevel and the fgsea thread for more on this).

Code to show fgsea output (fgsea is used under the hood by clusterProfiler):

> library(fgsea)
> res.fgsea <- fgseaMultilevel(
+              pathways = genesets.list,
+              stats = lfc_vector,
+              minSize = 2,
+              maxSize = 7000,
+              eps = 0,
+              scoreType = c("std"),
+              nPermSimple = 10000)
Warning messages:
1: In preparePathwaysAndStats(pathways, stats, minSize, maxSize, gseaParam,  :
  There are ties in the preranked stats (0.07% of the list).
The order of those tied genes will be arbitrary, which may produce unexpected results.
2: In fgseaMultilevel(pathways = genesets.list, stats = lfc_vector,  :
  There were 2 pathways for which P-values were not calculated properly due to unbalanced (positive and negative) gene-level statistic values. For such pathways pval, padj, NES, log2err are set to NA. You can try to increase the value of the argument nPermSimple (for example set it nPermSimple = 100000)
> res.fgsea
                            pathway       pval       padj     log2err
                             <char>      <num>      <num>       <num>
1:                  GENESET1_TABLE1 0.93508590 1.00000000 0.006966464
2:              GENESET2_TABLE2_P3F         NA         NA          NA
3:              GENESET3_TABLE2_RH4 1.00000000 1.00000000 0.000000000
4:               GENESET4_TABLE3_TF 0.01012094 0.06072565 0.380730401
5:        GENESET5_TABLE3_RMS_SE_TF 0.99979818 1.00000000 0.001397729
6: GENESET6_TABLE4_FP_RMS_SE_MOTIFS 0.69178922 1.00000000 0.040450477
7:         GENESET7_TABLE5_P3F_HSMM         NA         NA          NA
8:        GENESET8_TABLE6_MOLECSENS 0.93898990 1.00000000 0.003973030
           ES        NES  size  leadingEdge
        <num>      <num> <int>       <list>
1:  0.2205905  0.6420197    58 DIRC3, B....
2: -0.2649017         NA   949 GAS1, CA....
3:  0.2312194  0.7843553  6191 EDARADD,....
4: -0.3108523 -1.4454048   138 FOXD4, O....
5:  0.1633441  0.5346445   247 DDIT3, T....
6: -0.2327198 -0.8723426    45 NRF1, MY....
7: -0.2137811         NA   522 MAML3, G....
8:  0.2247054  0.7344090   238 OPRL1, D....
> 

When setting scoreType to pos (only 1st warning still present):

> res.fgsea <- fgseaMultilevel(
+              pathways = genesets.list,
+              stats = lfc_vector,
+              minSize = 2,
+              maxSize = 7000,
+              eps = 0,
+              scoreType = c("pos"),
+              nPermSimple = 10000)
Warning message:
In preparePathwaysAndStats(pathways, stats, minSize, maxSize, gseaParam,  :
  There are ties in the preranked stats (0.07% of the list).
The order of those tied genes will be arbitrary, which may produce unexpected results.
> res.fgsea
                            pathway      pval  padj      log2err          ES
                             <char>     <num> <num>        <num>       <num>
1:                  GENESET1_TABLE1 0.8237176     1 0.0066740910 0.220590489
2:              GENESET2_TABLE2_P3F 1.0000000     1 0.0000000000 0.006705853
3:              GENESET3_TABLE2_RH4 1.0000000     1 0.0000000000 0.231219438
4:               GENESET4_TABLE3_TF 0.9991001     1 0.0004329817 0.088599964
5:        GENESET5_TABLE3_RMS_SE_TF 0.9953005     1 0.0009913447 0.163344070
6: GENESET6_TABLE4_FP_RMS_SE_MOTIFS 0.8768123     1 0.0054076227 0.183895057
7:         GENESET7_TABLE5_P3F_HSMM 1.0000000     1 0.0000000000 0.107980000
8:        GENESET8_TABLE6_MOLECSENS 0.9320068     1 0.0038967140 0.224705449
          NES  size  leadingEdge
        <num> <int>       <list>
1: 0.68927996    58 DIRC3, B....
2: 0.02251511   949 SERPINE2....
3: 0.78414977  6191 EDARADD,....
4: 0.28613395   138 ZNF354A,....
5: 0.53653607   247 DDIT3, T....
6: 0.56815628    45 IRF1, ST....
7: 0.35953728   522 RNF144B,....
8: 0.73791523   238 OPRL1, D....
>

@Krithika-Bhuvan
Copy link
Author

Thank you SO MUCH for looking into this and for the detailed explanation. This has been incredibly helpful.
To avoid unbalanced sets in the future, do you suggest using a different ranking criteria ? such as combination of fold change and p-value , or other ? Looking for your advice

@guidohooiveld
Copy link

That is a good question.

I commonly use the (moderated) t-value for ranking genes, and I have never had to deal with unbalanced sets. Yet, I don't want to claim the t-value is 'the best' ranking metric... but it is arguably used often.

@Krithika-Bhuvan
Copy link
Author

Thank you very much

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants