Glossary Item |
Definition |
Week |
Box Plot |
A standardized way of displaying the distribution of data based on the five number summary: minimum, first quartile, median, third quartile, and maximum. Source: Wasserstein, Ronald L.; Lazar, Nicole A. (7 March 2016). "The ASA's Statement on p-Values: Context, Process, and Purpose". The American Statistician. 70 (2): 129–133. doi:10.1080/00031305.2016.1154108. Retrieved 30 October 2016. |
2 |
Chromosomal Instability |
A type of genomic instability in which chromosomes are unstable, such that either whole chromosomes or parts of chromosomes are duplicated or deleted |
3 |
False Discovery Rate |
A way of conceptualizing the rate of type I errors in null hypothesis testing when conducting multiple comparisons. Source: Benjamini, Yoav; Hochberg, Yosef (1995). "Controlling the false discovery rate: a practical and powerful approach to multiple testing" (PDF). Journal of the Royal Statistical Society, Series B. 57 (1): 289–300. MR 1325392. |
2, 4 |
Family wise error rate |
The probability of making one or more false discoveries, or type I errors, among all the hypotheses when performing multiple hypotheses tests |
2 |
Fold Change |
A measure describing how much a quantity changes going from an initial to a final value. In gene expression, this is usually comparison between the read-count in the control group and experimental group. |
2 |
Fold Enrichment |
A measure describing how much a quantity changes going from an initial to a final value. In gene expression, this is usually comparison between the read-count in the control group and experimental group. |
2, 3, 4, 7 |
Gene Ontology Enrichment (Profiling) |
Enrichment analysis will find which GO terms are over-represented (or under-represented) using annotations for that gene set using a null hypothesis that the representation is random. Results in the demo include the background frequency, the sample frequency, Expected p-value, an indication of over/underrepresentation for each term, and p-value. |
6 |
Heat Map |
A representation of data in the form of a map or diagram in which data values are represented as colors |
3, 4 |
Human Genome Nomenclature Committee |
A committee of the Human Genome Organisation (HUGO) that sets the standards for human gene nomenclature. HGNC also assigns a symbol to each gene (only unique to a species). |
6 |
KM Plot |
A non-parametric statistic used to estimate the survival function from lifetime data. In medical research, it is often used to measure the fraction of patients living for a certain amount of time after treatment. |
2 |
P-Value |
The probability of obtaining a result equal to or "more extreme" than what was actually observed, when the null hypothesis is true. High p-values indicate that data is likely with a true null while low p-values indicate that data is unlikely with a true null. |
2, 3, 4, 6, 7 |
Pathway Enrichment (Profiling) |
Pathway enrichment analysis will find which pathways are over-represented (or under-represented) by the gene/protein set using a null hypothesis that the representation is random. Similar to GO enrichment in part 2. |
6 |
Standard Deviation |
A measure that is used to quantify the amount of variation or dispersion of a set of data values |
2 |
T-Test |
An analysis of two populations means through the use of statistical examination. Refresher for T-Test: http://blog.minitab.com/blog/adventures-in-statistics-2/understanding-t-tests-t-values-and-t-distributions |
2, 3, 4, 7 |
UniProt |
An online resource that provides comprehensive, high-quality and freely accessible tools to access protein sequences and their functional information |
6 |
Wilcoxon Test |
A non-parametric statistical hypothesis test used when comparing two related samples, matched samples, or repeated measurements on a single sample to assess whether their population mean ranks differ. Source: Lowry, Richard. "Concepts & Applications of Inferential Statistics". Retrieved 24 March 2011. |
2 |