Molecular Subtyping Resource

INTRODUCTION:

Welcome to the Molecular Subtyping Resource ( MouS^R ):

Number of user(s) currently connected to this App

Please Note: Chrome, Edge and then Firefox are the recommended browsers for the best App experience.

This Molecular Subtyping Resource has been developed within the Molecular Pathology Research Group Dunne-Lab as part of the CRUK-funded ACRCelerate colorectal cancer pre-clinical accelerator programme.

The ACRCelerate project will bring together an european-wide consortium of basic and clinical scientists at the forefront of CRC research to interrogate a suite of state-of-the-art preclinical models. The overarching aim of ACRCelerate is to generate robust and reproducible preclinical data to de-risk future clinical trials via patient stratification. Specifically, the various models will be categorised into subtypes based on their gene activity so that treatments can be aimed at particular subgroups of patients. In doing so, the network will be able to accelerate the next generation of stratified trials for CRC through accurate disease subtype positioning. Using this preclinical testing platform, the consortium aims to accelerate the identification and development of new drugs for CRC.

This portal accepts uploads of either; text, csv files. The format of your Transcriptional Data Matrix RNA-seq-Read Count(Both Decimal and Integer numbers) should be similar to this table, with a size limit of up to 30 MB, length no restriction: If your Transcriptional Data Matrix file is not in the same orientation to the table below, you will need to transpose it. A transposing tool can be accessed via this link: Click here

*Transcriptional Data Matrix Table*
ID	symbol	KPNt1	KPNt2	KPNt3	KPt1	KPt2	KPt3	APNt1	APNt2	APNt3	......
ENSMUSG00000000001	Gnai3	2792	867	3074	2096	4537	5269	5635	6359	9770	......
ENSMUSG00000000028	Cdc45	510	143	131	178	289	316	479	608	1065	......
ENSMUSG00000000031	H19	212	128	796	300	634	598	100495	2082	18134	......
ENSMUSG00000000037	Scml2	8	0	0	11	1	2	27	122	135	......
ENSMUSG00000000049	Apoh	10	26	1	25	27	15	0	23	40	......
ENSMUSG00000000056	Narf	925	176	730	569	850	830	1790	1632	2115	......
ENSMUSG00000000058	Cav2	2195	568	460	1480	2586	1211	303	457	605	...
ENSMUSG00000000078	Klf6	3651	838	2349	4263	5373	5483	3156	5324	6919	......
ENSMUSG00000000085	Scmh1	764	301	491	1175	1099	601	1420	814	1634	......
ENSMUSG00000000088	Cox5a	2127	693	1716	3127	2506	3221	1333	3077	4507	......
ENSMUSG00000000093	Tbx2	127	31	86	250	269	192	106	206	344	......
ENSMUSG00000000094	Tbx4	5	0	24	20	42	14	2028	143	209	......
ENSMUSG00000000120	Ngfr	28	9	108	47	96	45	571	71	76	......
ENSMUSG00000000125	Wnt3	0	0	12	1	2	6	20	99	100	......
ENSMUSG00000000126	Wnt9a	89	5	18	190	171	50	24	25	22	......
......	......	......	......	......	.......	......	......	......	......	......	......

The format of your Sample Information and Labels Table should be similar to this, If your types are two words, such as Early tumor, you should write them as Early_tumor. Also please note the order of your Sample input should be exact same match with your Transcriptional Data Matrix columns.

*Sample Information and Labels Table*
Group	Samples	Group-ID
KPN_tumour	KPNt1	KPNt1
KPN_tumour	KPNt2	KPNt2
KPN_tumour	KPNt3	KPNt3
KP_tumour	KPt1	KPt1
KP_tumour	KPt2	KPt2
KP_tumour	KPt3	KPt3
APN_tumour	APNt1	APNt1
APN_tumour	APNt2	APNt2
APN_tumour	APNt3	APNt3
......	......	......

Source of inspirations:

DEApp

Input Data: Please Upload your Data

Data Input-Video Watch it here

Select one Option based on the Format of your Transcriptional Data Matrix

None selected

Read count (Integer Numbers)

Read count (Decimal Numbers)

Please note In Read count (Decimal Numbers) option, the Entire Differential Gene Expressions section is not available in the current version of the App.

Input 1: Transcriptional Data Matrix

Upload Your Transcriptional Data Matrix File here:

Browse...

Separator(file)

Comma

Semicolon

Tab

The Examplar file of 'Read count (Integer Numbers)' is available here

Input 2: Sample Information and Labels

Upload Your Sample Information and Labels File here:

Browse...

Separator(file)

Comma

Semicolon

Tab

The Examplar file of 'Sample Information and Labels'is available here

Clicking Check Input Files again before the upload is complete will reset the process

Input Details

Input Information Summary

Everything looks correct?

PCA 3D Analysis

PCA 3D-Video Watch it here

Select Plot Downloading Format

png

svg

Plotly Modebar control: From left to right

1)Download plot as a selected format.

2) Zoom on selection. Click and hold your mouse button to select a region to zoom on.

3)Image panning.

4)Rotates the plot around its middle point in three-dimensional space.

5)Rotates the plot around its middle point while constraining one axis slightly.

6)Reset axis to the original axis ranges.

7)Reset to the last saved axis ranges.

8)Show closest data on hover -- this function is always on.

Group color can be changed using PCA2D Pick a color option

PCA 2D Analysis

Different Filtering Options

PCA 2D-Video Watch it here

Select one option

PC1&PC2

PC1&PC3

PC2&PC3

Select label option

OFF

PCA :A dimensionality-reduction method to assess quality and clustering characteristics of data sets. We used the prcomp function in the R stats package to perform PCA analysis.

PCA References

Select Plot Downloading Format

png

svg

Size of the labels

Size of the points

Plot_Height (# pixels):

Plot_Width (# pixels):

Put plot option on PC1&PC2 first:

PCA Analysis 2D Plot

Download

MDS 2D Analysis

Different Filtering Options

MDS 2D-Video Watch it here

Select label option

OFF

MDS: Multidimensional Scaling is a dimension-reduction technique designed to project high dimensional data down to 2 dimensions while preserving relative distances between observations. We used the cmdscale function in the R stats package to perform MDS analysis.

MDS References

Select Plot Downloading Format

png

svg

Size of the labels

Size of the points

Plot_Height (# pixels):

Plot_Width (# pixels):

MDS Analysis 2D Plot

Download

ssGSEA Analysis

Analysis Filtering Criteria

ssGSEA-Video Watch it here

Select one Species

Mouse

Human

Select one option

Hallmark

GeneOntology

USER_Geneset

Upload Your list of Gene's File here (.rdata only):

Browse...

Convert yout .txt file to .rdata here

Please pick a related Species according to your already Uploaded Transcriptional Data Matrix.

If your Uploaded data is Read count (Integer Numbers) and you didn't run the Differential Gene Expression Analysis yet, first you need to run Differential Gene Expression Analysis (Heatmap section) to get your data normalized then come back to run this analysis.

ssGSEA: The modification of standard gene set enrichment analysis (GSEA) specifically for single sample classification (ssGSEA) is performed using the GSVA package (version 1.32.0)

GSVA

The R package msigdbr (version 7.1.1) is used to retrieve mouse Hallmark and biological processes (GO_BP) gene sets and applied to samples.

msigdbr

Enrichment score from ssGSEA demonstrate the degree to which the genes in a particular gene set are co-ordinately up- or down-regulated within a sample

Enrichment score

Analysis Results

Download

ssGSEA Hallmark Plot

Select Plot Downloading Format

png

svg

Plot height (# pixels):

Plot width (# pixels):

Download

Gene Set Enrichment Analysis (GSEA) Filtering

Please Choose Two Groups

Select one Species

Mouse

Human

Please Note for GeneOntology option, Only the first 50 pathways having highest ES will be displayed here.

Download-Genelists

GSEA: Gene set enrichment analysis (GSEA) is a computational method, used to determine whether a priori defined set of genes display significant differences between two biological phenotypes. This is performed using the fgsea package (version 1.16.0) and enrichplot(version 1.10.0)

GSEA

GSEA-Video Watch it here

Gene Set Enrichment Analysis Plot

Select Plot Downloading Format

png

svg

Plot height (# pixels):

Plot width (# pixels):

Download

CRIS-Classifier

Analysis Filtering Criteria

Select one Species

Mouse

Human

If your Uploaded data is a Read count (Integer Numbers) and you didn't run the Differential Gene Expression Analysis yet, first you need to run Differential Gene Expression Analysis (Heatmap section) to get your data normalized then come back to run this analysis.

Warning, you should have at least 20 samples to get reliable result using current classifer method.

CRIS-Classifier for Human option: This method performs CRC intrinsic subtypes (CRIS) classification based on nearest template prediction (NTP) method and CRIS template used in CMScaller R package.

CRIS-Classifier for Mouse option: In order to do CRC intrinsic subtypes (CRIS) classification in mouse, the human CRIS template, embedded in CMScalller, was converted to mouse orthologues using biomaRt, R package, and NTP method was applied to call CRIS subtypes in mouse tissues.

biomaRt / CMScaller / NTP / CRIS

Download

Select Plot Downloading Format

png

svg

Plot height (# pixels):

Plot width (# pixels):

Download

CMS-Classifier

Analysis Filtering Criteria

Select one Species

Mouse

Human

Warning, you should have at least 20 samples to get reliable result using current classifer method.

CMS-Classifier for Human option: This method performs Consensus Molecular Subtypes (CMS) classification in human based on nearest template prediction (NTP) method and CMS template used in CMScaller R package.

CMS-Classifier for Mouse option: In order to do CMS classification in mouse, the human CMS template, embedded in CMScalller, is converted to mouse orthologues using biomaRt, R package, and intersected mouse genes across CMS subtypes were removed. Then, NTP method was applied to call CMS in mouse tissues.

biomaRt / CMScaller / NTP

CMS-Classifier Table and Plot

CMS_Table
CMS_Plot

Download

Select Plot Downloading Format

png

svg

Plot height (# pixels):

Plot width (# pixels):

Download

Gene Expression Levels

Gene Expression Tables

Gene Expression Levels-Video Watch it here

Gene Expression Plot

Pick on Option for Plot

CountPlot

CountPlot: click on a gene of interest from the table above. (The Table is only for this option)

Select Plot Downloading Format

png

svg

Plot_Height (# pixels):

Plot_Width (# pixels):

Download

Differential Gene Expression Analysis

Differential Categories comparison Analysis

Please select Maximum 5 groups for analysis in each Differential Category A and B

Please DO Not compare 2 same groups

Differential gene expression analysis: DESeq2 package (version 1.24.0) was used to identify genes which are differentially expressed between the two groups selected by user.

DESeq2

Heatmap: The heatmaply package (version 1.1.1) is used to visualize the differentially expressed genes.

heatmaply

Heatmap-Video Watch it here

By clicking Create Plot button, the Heatmap plot will be created using default values as below. Then you can change these values and press it again to Update plot.

Padj-value:

log2FoldChange(min):

log2FoldChange(max):

Select Plot Downloading Format

png

svg

Select label option

OFF

Plot_Height (# pixels):

Plot_Width (# pixels):

Output Filename

Top Significant Genes Heatmap

Heatmap Plot
Differential Gene Expression Table

Plotly Modebar control: From left to right

1)Download plot as a selected format.

2) Zoom on selection. Click and hold you mouse button to select a region to zoom on.

3)Image panning.

4)Zoom IN.

5)Zoom OUT.

6)Reset axis to the original axis ranges.

7)Toggle spike lines on and off. Shows lines on a graph indicating the exact x-axis and y-axis.

Download txt-file

Volcano Analysis

Volcano Plot
Differential Gene Expression Table

The Volcano plot is created using default values as below using Plotly. You can change these values and press Update plot.

Afterwards, you can pick any gene names (max=10), choose selected Genes option and then new plot will be created.

Volcano-Video Watch it here

Select Plot type

using-plotly

selectedGenes

Size of the points

Output Filename

Fold Change threshold(min):

log2FoldChange(min):

Significance threshold:

User selected Genes (max= 10):

Download Selected GenePlot

Select Plot Downloading Format

png

svg

Plot_Height (# pixels):

Plot_Width (# pixels):

Plotly Modebar control: From left to right

1)Download plot as a selected format.

2) Zoom on selection. Click and hold you mouse button to select a region to zoom on.

3)Image panning.

4)Zoom IN.

5)Zoom OUT.

6)Reset axis to the original axis ranges.

7)Toggle spike lines on and off. Shows lines on a graph indicating the exact x-axis and y-axis.

Download txt-file

Selected Gene List Heatmap Plot

Heatmap Selected Gene-Video Watch it here

Please Pick one option for your plots Format:

Select one Options

Plot by Samples

Plot by z-score and Groups

Select Plot Options

Without_Thresholds

Default_Thresholds

Please Click on Minimum Two Genes For Clustering:

Without-Thresholds:This Table is created without applying any Thresholds.

Default-Thresholds:This Table is created using Default values in previous section.

Select Plot Downloading Format

png

svg

Select label option

OFF

Plot_Height (# pixels):

Plot_Width (# pixels):

Output Filename

Selected Genes List Heatmap

Plotly Modebar control: From left to right

1)Download plot as a selected format.

2) Zoom on selection. Click and hold you mouse button to select a region to zoom on.

3)Image panning.

4)Zoom IN.

5)Zoom OUT.

6)Reset axis to the original axis ranges.

7)Toggle spike lines on and off. Shows lines on a graph indicating the exact x-axis and y-axis.

Estimate The Abundance of Microenvironment Cell Populations In Mouse/Human Tissue

Select one Species

Mouse

Human

mMcp-Video Watch it here

mMCP-counter: This function estimates the quantity of several immune and stromal cell populations from heterogeneous transcriptomic data, which has been modified for use specifically murine samples. mMCP-counter

MCP-counter: This function estimates the quantity of several immune and stromal cell populations from absolute abundance cell populations in heterogeneous transcriptomic data, which has been proposed for use specifically with human samples.

MCP-counter

Download

Analysis Plot

Select Plot Downloading Format

png

svg

Plot height (# pixels):

Plot width (# pixels):

Download