Term IRI	Term label	Parent term IRI	Parent term label	Alternative term	Definition
http://purl.obolibrary.org/obo/STATO_0000477	partial least square weight	http://purl.obolibrary.org/obo/IAO_0000027	data item	PLS weight	PLS weight (or Partial Least Square weight) is a information content entity which is generated when performing a Partial Least Square analysis
http://purl.obolibrary.org/obo/STATO_0000690	area under the value-time curve	http://purl.obolibrary.org/obo/STATO_0000209	area under curve	area under the value-by-time curve	An area under the curve where the curve is the repeated measures of a variable over time and the domain of interest is time.
http://purl.obolibrary.org/obo/STATO_0000737	correction factor for multiple testing	http://purl.obolibrary.org/obo/IAO_0000027	data item	correction for multiple testing	a data item used as input to a multiple testing correction method that means the alpha level is adjusted for multiple comparisons.
http://purl.obolibrary.org/obo/STATO_0000713	type I error	http://purl.obolibrary.org/obo/STATO_0000242	statistical error	type 1 error	the event of incorrectly rejecting the null hypothesis
http://purl.obolibrary.org/obo/STATO_0000714	type II error	http://purl.obolibrary.org/obo/STATO_0000242	statistical error	type 2 error	the event of incorrectly failing to reject the null hypothesis.
http://purl.obolibrary.org/obo/STATO_0000712	superiority margin	http://purl.obolibrary.org/obo/STATO_0000709	testing margin	minimal clinically important difference	A hypothesis testing margin of zero used in a study with a superiority goal.
The superiority margin (also sometimes called the minimal clinically important difference) defines how much better one treatment must be to be considered meaningfully superior — not just statistically different.
http://purl.obolibrary.org/obo/IAO_0000005	objective specification	http://purl.obolibrary.org/obo/IAO_0000033	directive information entity	goal specification	A directive information entity that describes an intended process endpoint. When part of a plan specification the concretization is realized in a planned process in which the bearer tries to effect the world so that the process endpoint is achieved.
http://purl.obolibrary.org/obo/IAO_0000027	data item	http://purl.obolibrary.org/obo/IAO_0000030	information content entity	data	a data item is an information content entity that is intended to be a truthful statement about something (modulo, e.g., measurement precision or other systematic errors) and is constructed/acquired by a method which reliably tends to produce (approximately) truthful statements.
http://purl.obolibrary.org/obo/IAO_0000184	scatter plot	http://purl.obolibrary.org/obo/IAO_0000038	graph	scattergraph	A scatterplot is a graph which uses Cartesian coordinates to display values for two variables for a set of data. The data is displayed as a collection of points, each having the value of one variable determining the position on the horizontal axis and the value of the other variable determining the position on the vertical axis.
http://purl.obolibrary.org/obo/IAO_0000300	textual entity	http://purl.obolibrary.org/obo/IAO_0000030	information content entity	text	A textual entity is a part of a manifestation (FRBR sense), a generically dependent continuant whose concretizations are patterns of glyphs intended to be interpreted as words, formulas, etc.
http://purl.obolibrary.org/obo/IAO_0000573	line graph	http://purl.obolibrary.org/obo/IAO_0000038	graph	line chart	A line graph is a type of graph created by connecting a series of data
points together with a line.
http://purl.obolibrary.org/obo/IAO_0000577	centrally registered identifier symbol	http://purl.obolibrary.org/obo/IAO_0000028	symbol	CRID symbol	A symbol that is part of a CRID and that is sufficient to look up a record from the CRID's registry.
http://purl.obolibrary.org/obo/IAO_0000578	centrally registered identifier	http://purl.obolibrary.org/obo/IAO_0000030	information content entity	CRID	An information content entity that consists of a CRID symbol and additional information about the CRID registry to which it belongs.
http://purl.obolibrary.org/obo/IAO_0000579	centrally registered identifier registry	http://purl.obolibrary.org/obo/IAO_0000100	data set	CRID registry	A CRID registry is a dataset of CRID records, each consisting of a CRID symbol and additional information which was recorded in the dataset through a assigning a centrally registered identifier process.
http://purl.obolibrary.org/obo/IAO_0000584	time sampled measurement data set	http://purl.obolibrary.org/obo/IAO_0000100	data set	experimental time series	A data set that is an aggregate of data recording some measurement at a number of time points. The time series data set is an ordered list of pairs of time measurement data and the corresponding measurement data acquired at that time.
http://purl.obolibrary.org/obo/OBI_0000070	assay	http://purl.obolibrary.org/obo/OBI_0000011	planned process	scientific observation	A planned process with the objective to produce information about the material entity that is the evaluant, by physically examining it or its proxies.
http://purl.obolibrary.org/obo/OBI_0000086	reagent role	http://purl.obolibrary.org/obo/BFO_0000023	role	reagent	A role inhering in a biological or chemical entity that is intended to be applied in a scientific technique to participate (or have molecular components that participate) in a chemical reaction that facilitates the generation of data about some entity distinct from the bearer, or the generation of some specified material output distinct from the bearer.
http://purl.obolibrary.org/obo/OBI_0000094	material processing	http://purl.obolibrary.org/obo/OBI_0000011	planned process	material transformation	A planned process which results in physical changes in a specified input material
http://purl.obolibrary.org/obo/OBI_0000250	dye role	http://purl.obolibrary.org/obo/OBI_0002743	molecular label role	dye	A molecular label role which inheres in a material entity and which is realized in the process of detecting a molecular dye that imparts color to some material of interest. 
http://purl.obolibrary.org/obo/OBI_0000423	extract	http://purl.obolibrary.org/obo/OBI_0000047	processed material	extracted material	an extract is a material entity which results from an extraction process
http://purl.obolibrary.org/obo/OBI_0000424	transcription profiling assay	http://purl.obolibrary.org/obo/OBI_0000443	analyte assay	gene expression profiling	An assay which aims to provide information about gene expression and transcription activity using ribonucleic acids collected from a material entity using a range of techniques and instrument such as DNA sequencers, DNA microarrays, Northern Blot
http://purl.obolibrary.org/obo/OBI_0000435	genotyping assay	http://purl.obolibrary.org/obo/OBI_0000443	analyte assay	genotype profiling, SNP genotyping	an assay which generates data about a genotype from a specimen of genomic DNA. A variety of
techniques and instruments can be used to produce information about sequence variation at particular genomic positions.
http://purl.obolibrary.org/obo/OBI_0000456	material transformation objective	http://purl.obolibrary.org/obo/IAO_0000005	objective specification	artifact  creation objective	an objective specifiction that creates an specific output object from input materials.
http://purl.obolibrary.org/obo/OBI_0000626	DNA sequencing assay	http://purl.obolibrary.org/obo/OBI_0600047	sequencing assay	nucleotide sequencing	DNA sequencing is a sequencing process which uses deoxyribonucleic acid as input and results in a the creation of DNA sequence information artifact using a DNA sequencer instrument.
http://purl.obolibrary.org/obo/OBI_0000648	clustered data set	http://purl.obolibrary.org/obo/IAO_0000100	data set	data set with assigned discovered class labels	A data set that is produced as the output of a class discovery data transformation and consists of a data set with assigned discovered class labels.
http://purl.obolibrary.org/obo/OBI_0000659	specimen collection process	http://purl.obolibrary.org/obo/OBI_0000011	planned process	specimen collection	A planned process with the objective of collecting a specimen.
http://purl.obolibrary.org/obo/OBI_0000675	statistical hypothesis test objective	http://purl.obolibrary.org/obo/OBI_0200166	data transformation objective	hypothesis test objective	is a data transformation objective where the aim is to estimate statistical significance with the aim of proving or disproving a hypothesis by means of some data transformation 
http://purl.obolibrary.org/obo/OBI_0000722	paired-end library	http://purl.obolibrary.org/obo/OBI_0000047	processed material	paired-end tag (PET) library	is a collection of short paired tags from the two ends of DNA fragments are extracted and covalently linked as ditag constructs
http://purl.obolibrary.org/obo/OBI_0000727	k-nearest neighbors	http://purl.obolibrary.org/obo/APOLLO_SV_00000796	dataset creating	k-NN	A k-nearest neighbors is a data transformation which achieves a class discovery or partitioning objective, in which an input data object with vector y is assigned to a class label based upon the k closest training data set points to y; where k is the largest value that class label is assigned.
http://purl.obolibrary.org/obo/OBI_0000736	single fragment library	http://purl.obolibrary.org/obo/OBI_0000047	processed material	fragment library	is a collection of short tags from DNA fragments, are extracted and covalently linked as single tag constructs
http://purl.obolibrary.org/obo/OBI_0000739	Student's t-test	http://purl.obolibrary.org/obo/OBI_0000673	statistical hypothesis test	t-test	Studen't t-test is a data transformation with the objective of a statistical hypothesis test in which the test statistic has a Student's t distribution if the null hypothesis is true. It is applied when the population is assumed to be normally distributed but the sample sizes are small enough that the statistic on which inference is based is not normally distributed because it relies on an uncertain estimate of standard deviation rather than on a precisely known value.
http://purl.obolibrary.org/obo/OBI_0000744	material sampling process	http://purl.obolibrary.org/obo/OBI_0000659	specimen collection process	sampling	A specimen gathering process with the objective to obtain a specimen that is representative of the input material entity
http://purl.obolibrary.org/obo/OBI_0000747	material sample	http://purl.obolibrary.org/obo/OBI_0100051	specimen	sample population	A material entity that has the material sample role
http://purl.obolibrary.org/obo/OBI_0000750	study design independent variable	http://purl.obolibrary.org/obo/STATO_0000258	variable	independent variable	a directive information entity that is part of a study design. Independent variables are entities whose values are selected to determine its relationship to an observed phenomenon (the dependent variable). In such an experiment, an attempt is made to find evidence that the values of the independent variable determine the values of the dependent variable (that which is being measured). The independent variable can be changed as required, and its values do not represent a problem requiring explanation in an analysis, but are taken simply as given. The dependent variable on the other hand, usually cannot be directly controlled
http://purl.obolibrary.org/obo/OBI_0000751	study design dependent variable	http://purl.obolibrary.org/obo/STATO_0000258	variable	dependent variable	dependent variable specification is part of a study design. The dependent variable is the event studied and expected to change when the independent variable varies.
http://purl.obolibrary.org/obo/OBI_0000791	multiple testing correction objective	http://purl.obolibrary.org/obo/OBI_0200186	error correction objective	multiple comparison correction objective	A multiple testing correction objectives is a data transformation objective where the aim is to correct for a set of statistical inferences considered simultaneously
http://purl.obolibrary.org/obo/OBI_0000938	categorical measurement datum	http://purl.obolibrary.org/obo/IAO_0000109	measurement datum	nominal mesurement datum	A measurement datum that is reported on a categorical scale
http://purl.obolibrary.org/obo/OBI_0000968	device	http://purl.obolibrary.org/obo/OBI_0000047	processed material	instrument	A material entity that is designed to perform a function in a scientific investigation, but is not a reagent.
http://purl.obolibrary.org/obo/OBI_0001143	labeled nucleic acid extract	http://purl.obolibrary.org/obo/OBI_0000924	labeled specimen	labeled extract	a labeled specimen that is the output of a labeling process and has grain labeled nucleic acid for detection of the nucleic acid in future experiments.
http://purl.obolibrary.org/obo/OBI_0001271	RNA-seq assay	http://purl.obolibrary.org/obo/OBI_0600047	sequencing assay	transcription profiling by high throughput sequencing	An assay in which sequencing technology (e.g. Solexa/454) is used to generate RNA sequence, analyse the transcibed regions of the genome, and or to quantitate transcript abundance
http://purl.obolibrary.org/obo/OBI_0001442	q-value	http://purl.obolibrary.org/obo/OBI_0000662	error corrected data set	FDR adjusted p-value	A quantitative confidence value that measures the minimum false discovery rate that is incurred when calling that test significant. 
To compute q-values, it is necessary to know the p-value produced by a test and possibly set a false discovery rate level.
http://purl.obolibrary.org/obo/OBI_0001479	specimen from organism	http://purl.obolibrary.org/obo/OBI_0100051	specimen	tissue specimen	A specimen that derives from an anatomical part or substance arising from an organism. Examples of tissue specimen include tissue, organ, physiological system, blood, or body location (arm).
http://purl.obolibrary.org/obo/OBI_0001755	selection criterion	http://purl.obolibrary.org/obo/IAO_0000033	directive information entity	selection rule	A directive information entity which defines and states a principle of standard by which selection process may take place.
http://purl.obolibrary.org/obo/OBI_0100060	cultured cell population	http://purl.obolibrary.org/obo/OBI_0000047	processed material	cell culture sample	A processed material comprised of a collection of cultured cells that has been continuously maintained together in culture and shares a common propagation history.
http://purl.obolibrary.org/obo/OBI_0200000	data transformation	http://purl.obolibrary.org/obo/OBI_0000011	planned process	data processing	A planned process that produces output data from input data.
http://purl.obolibrary.org/obo/OBI_0200050	dimensionality reduction	http://purl.obolibrary.org/obo/APOLLO_SV_00000796	dataset creating	data projection	A dimensionality reduction is data partitioning which transforms each input m-dimensional vector (x_1, x_2, ..., x_m) into an output n-dimensional vector (y_1, y_2, ..., y_n), where n is smaller than m.
http://purl.obolibrary.org/obo/OBI_0200051	principal components analysis dimensionality reduction	http://purl.obolibrary.org/obo/OBI_0200050	dimensionality reduction	pca data reduction	A principal components analysis dimensionality reduction is a dimensionality reduction achieved by applying principal components analysis and by keeping low-order principal components and excluding higher-order ones.
http://purl.obolibrary.org/obo/OBI_0200073	family wise error rate correction method	http://purl.obolibrary.org/obo/APOLLO_SV_00000796	dataset creating	FWER correction	A family wise error rate correction method is a multiple testing procedure that controls the probability of at least one false positive.
http://purl.obolibrary.org/obo/OBI_0200083	survival analysis objective	http://purl.obolibrary.org/obo/OBI_0200166	data transformation objective	survival analysis	A data transformation objective which has the data transformation aims to model time to event data (where events are e.g. death and or disease recurrence); the purpose of survival analysis is to model the underlying distribution of event times and to assess the dependence of the event time on other explanatory variables
http://purl.obolibrary.org/obo/OBI_0200089	multiple testing correction method	http://purl.obolibrary.org/obo/OBI_0000668	error correction data transformation	multiple testing procedure	A multiple testing correction method is a hypothesis test performed simultaneously on M > 1 hypotheses.  Multiple testing procedures produce a set of rejected hypotheses that is an estimate for the set of false null hypotheses while controlling for a suitably define Type I error rate
http://purl.obolibrary.org/obo/OBI_0200111	data visualization	http://purl.obolibrary.org/obo/OBI_0000011	planned process	data encoding as image	An planned process that creates images, diagrams or animations from the input data.
http://purl.obolibrary.org/obo/OBI_0200154	agglomerative hierarchical clustering	http://purl.obolibrary.org/obo/OBI_0200042	hierarchical clustering	bottom-up hierarchical clustering	An agglomerative hierarchical clustering is a hierarchical clustering which starts with separate clusters and then successively combines these clusters until there is only one cluster remaining.
http://purl.obolibrary.org/obo/OBI_0200155	divisive hierarchical clustering	http://purl.obolibrary.org/obo/OBI_0200042	hierarchical clustering	top-down hierarchical clustering	A divisive hierarchical clustering is a hierarchical clustering which starts with a single cluster and then successively splits resulting clusters until only clusters of individual objects remain.
http://purl.obolibrary.org/obo/OBI_0200163	false discovery rate correction method	http://purl.obolibrary.org/obo/APOLLO_SV_00000796	dataset creating	FDR correction method	The false discovery rate is a  data transformation used in multiple hypothesis testing to correct for multiple comparisons. It controls the expected proportion of incorrectly rejected null hypotheses (type I errors) in a list of rejected hypotheses. It is a less conservative comparison procedure with greater power than familywise error rate (FWER) control, at a cost of increasing the likelihood of obtaining type I errors. .
http://purl.obolibrary.org/obo/OBI_0200175	class discovery data transformation	http://purl.obolibrary.org/obo/APOLLO_SV_00000796	dataset creating	unsupervised classification data transformation	A class discovery data transformation (sometimes called unsupervised classification) is a data transformation that has objective class discovery.
http://purl.obolibrary.org/obo/OBI_0200178	class discovery objective	http://purl.obolibrary.org/obo/OBI_0200166	data transformation objective	clustering objective	A class discovery objective (sometimes called unsupervised classification) is a data transformation objective where the aim is to organize input data  (typically vectors of attributes) into classes, where the number of classes and their specifications are not known a priori. Depending on usage, the class assignment can be definite or probabilistic.
http://purl.obolibrary.org/obo/OBI_0200201	ANOVA	http://purl.obolibrary.org/obo/STATO_0000279	between group comparison statistical test	Analysis of Variance	ANOVA or analysis of variance is a data transformation in which a statistical test of whether the means of several groups are all equal.
http://purl.obolibrary.org/obo/OBI_0400043	flow cell	http://purl.obolibrary.org/obo/OBI_0000967	container	flow_cell	Aparatus in the fluidic subsystem where the sheath and sample meet. Can be one of several types; jet-in-air, quartz cuvette, or a hybrid of the two. The sample flows through the center of a fluid column of sheath fluid in the flow cell.
http://purl.obolibrary.org/obo/OBI_0400148	DNA microarray	http://purl.obolibrary.org/obo/OBI_0400147	microarray	DNA-array	A DNA-microarray is a microarray that is used as a physical 2D immobilisation matrix for DNA sequences. DNA microarray-bound DNA fragments are used as targets for a hybridising probed sample.
http://purl.obolibrary.org/obo/OBI_0500000	study design	http://purl.obolibrary.org/obo/IAO_0000104	plan specification	experimental design	A plan specification comprised of protocols (which may specify how and what kinds of data will be gathered) that are executed as part of an investigation and is realized during a study design execution.
http://purl.obolibrary.org/obo/OBI_0500006	parallel group design	http://purl.obolibrary.org/obo/OBI_0500000	study design	independent measure design	A parallel group design or independent measure design is a study design which uses unique experimental unit each experimental group, in other word no two individuals are shared between experimental groups, hence also known as parallel group design. Subjects of a treatment group receive a unique combination of independent variable values making up a treatment
http://purl.obolibrary.org/obo/OBI_0600015	group assignment	http://purl.obolibrary.org/obo/OBI_0000011	planned process	study assignment	group assignment is a process which has an organism as specified input and during which a role is assigned
http://purl.obolibrary.org/obo/OBI_0600038	addition of molecular label	http://purl.obolibrary.org/obo/OBI_0000094	material processing	labeling	a material processing technique intended to add a molecular label to some input material entity, to allow detection of the molecular target of this label in a detection of molecular label assay
http://purl.obolibrary.org/obo/OBI_0600064	recombinant vector cloning	http://purl.obolibrary.org/obo/OBI_0000094	material processing	molecular cloning	a planned process with the objective to insert genetic material into a cloning vector for future replication of the inserted material
http://purl.obolibrary.org/obo/OBI_1000029	phage display library	http://purl.obolibrary.org/obo/OBI_0000731	recombinant vector	display library	a phage display library is a collection of materials in which a mixture of genes or gene fragments is expressed and can be individually selected and amplified.
http://purl.obolibrary.org/obo/SO_0000001	region	http://purl.obolibrary.org/obo/BFO_0000031	generically dependent continuant	primary structure of sequence macromolecule	A sequence_feature with an extent greater than zero. A nucleotide region is composed of bases and a polypeptide region is composed of amino acids.
http://purl.obolibrary.org/obo/STATO_0000023	studentized range distribution	http://purl.obolibrary.org/obo/STATO_0000067	continuous probability distribution	q distribution	The studentized range (q) distribution is a probability distribution used by the Tukey Honestly Significant Difference test. 

The distribution of the statistic
[x̄(k)- x̄(1)]/(s/√n)
where random samples of size n have been taken from k independent and identically distributed normal populations, with x̄(1) and x̄(k) being, respectively, the smallest and largest of the k sample means, and s2 being the pooled estimate of the common variance. This statistic is particularly used in multiple comparison tests.
http://purl.obolibrary.org/obo/STATO_0000040	MA plot	http://purl.obolibrary.org/obo/IAO_0000184	scatter plot	M vs A plot	an MA plot is a scatter plot of the log intensity ratios M = log_2(T/R) versus the average log intensities A = log_2(T*T)/2, where T and R represent the signal intensities in the test and reference channels respectively.
http://purl.obolibrary.org/obo/STATO_0000243	box and whisker plot	http://purl.obolibrary.org/obo/IAO_0000038	graph	box plot	a box plot is a graph which plots datasets relying on their quartiles and the interquartile range to create the box and the whiskers.
http://purl.obolibrary.org/obo/STATO_0000274	receiver operational characteristics curve	http://purl.obolibrary.org/obo/IAO_0000038	graph	ROC curve	receiver operational characteristics curve is a graphical plot which illustrates the performance of a binary classifier system as its discrimination threshold (aka cut-off point) is varied by plotting  sensitivity vs (1 − specificity)
http://purl.obolibrary.org/obo/STATO_0000283	negative binomial distribution	http://purl.obolibrary.org/obo/STATO_0000117	discrete probability distribution	Pólya distribution	negative binomial probability distribution is a discrete probability distribution of the number of successes in a sequence of Bernoulli trials before a specified (non-random) number of failures (denoted r) occur. The negative binomial distribution, also known as the Pascal distribution or Pólya distribution, gives the probability of  r-1 successes and x  failures in x+r-1 trials, and success on the (x+r)th trial.
http://purl.obolibrary.org/obo/APOLLO_SV_00000796	dataset creating	http://purl.obolibrary.org/obo/OBI_0000011	planned process	dataset creation	A planned process that has a data set as its specified output.
http://purl.obolibrary.org/obo/STATO_0000652	calibration	http://purl.obolibrary.org/obo/OBI_0200000	data transformation	predictive model calibration	calibration in statistics refers to the process of ensuring that the predicted probabilities or scores from a statistical model accurately reflect the true probabilities or outcomes observed in the data. It is an essential aspect of predictive modeling to ensure the reliability and interpretability of model predictions, where the goal is to estimate the likelihood of certain events or outcomes.
http://purl.obolibrary.org/obo/CHEBI_33839	macromolecule	http://purl.obolibrary.org/obo/CHEBI_23367	molecular entity	polymer	A macromolecule is a molecule of high relative molecular mass, the structure of which essentially comprises the multiple repetition of units derived, actually or conceptually, from molecules of low relative molecular mass.
http://purl.obolibrary.org/obo/STATO_0000134	true negative rate	http://purl.obolibrary.org/obo/STATO_0000184	ratio	specificity	specificity is a measurement datum qualifying a binary classification test and is computed by substracting the false positive rate to the integral numeral 1
http://purl.obolibrary.org/obo/STATO_0000190	threshold cycle	http://purl.obolibrary.org/obo/STATO_0000232	number of PCR cycle	Ct	Threshold cycle (or Ct or Cq) is a count which is defined as the fractional PCR cycle number at which the reporter fluorescence is greater than the threshold in the context of the RT-qPCR assay. The Ct is a basic principle of real time PCR and is an essential component in producing accurate and reproducible data.
http://purl.obolibrary.org/obo/STATO_0000221	random variable	http://purl.obolibrary.org/obo/STATO_0000258	variable	stochastic variable	a random variable (or aleatory variable or stochastic variable) in probability and statistics,  is a variable whose value is subject to variations due to chance (i.e. randomness, in a mathematical sense)
http://purl.obolibrary.org/obo/STATO_0000228	ordinal variable	http://purl.obolibrary.org/obo/STATO_0000252	categorical variable	ranked variable	ordinal variable is a categorical variable where  the discrete possible values are ordered or  correspond to an implicit ranking
http://purl.obolibrary.org/obo/STATO_0000233	true positive rate	http://purl.obolibrary.org/obo/STATO_0000184	ratio	sensitivity	sensitivity is a measurement datum qualifying a binary classification test and is computed by substracting the false negative rate to the integral numeral 1
http://purl.obolibrary.org/obo/STATO_0000252	categorical variable	http://purl.obolibrary.org/obo/STATO_0000258	variable	qualitative factor	a categorical variable  is a variable  which that can only assume a finite number  of value and cast observation in a small number of categories
http://purl.obolibrary.org/obo/STATO_0000011	cartesian coordinate axis	http://purl.obolibrary.org/obo/STATO_0000007	axis	rectangular coordinate axis	a cartesian axis is one of 3 the axis in a cartesian coordinate system defining a referential in 3 dimensions. each of the axis is orthogonal to the other 2
http://purl.obolibrary.org/obo/STATO_0000027	test of association between categorical variables	http://purl.obolibrary.org/obo/OBI_0000673	statistical hypothesis test	test of association	linkage between 2 categorical variable test is a statistical test which evaluates if there is an association between a predictor variable assuming discrete values and a response variable also assuming discrete values
http://purl.obolibrary.org/obo/STATO_0000037	standard error of the mean	http://purl.obolibrary.org/obo/STATO_0000562	standard error of estimate	SEM	The standard error of the mean (SEM) is data item denoting the standard deviation of the sample-mean's estimate of a population mean.
It is calculated by dividing the sample standard deviation (i.e., the sample-based estimate of the standard deviation of the population) by the square root of n , the size (number of observations) of the sample.

A measure of dispersion applied to means across hypothetical repeated random samples.
http://purl.obolibrary.org/obo/STATO_0000044	one-way ANOVA	http://purl.obolibrary.org/obo/OBI_0200201	ANOVA	one factor ANOVA	one-way anova is an analysis of variance where the different groups being compared are associated with the factor levels of only one independent variable. The null hypothesis is an absence of difference between the means calculated for each of the groups. The test assumes normality and equivariance of the data.
http://purl.obolibrary.org/obo/STATO_0000045	two-way ANOVA	http://purl.obolibrary.org/obo/OBI_0200201	ANOVA	two factor ANOVA	two-way anova is an analysis of variance where the different groups being compared are associated the factor levels of exatly 2 independent variables. The null hypothesis is an absence of difference between the means calculated for each of the groups. The test assumes normality and equivariance of the data.
http://purl.obolibrary.org/obo/STATO_0000054	homoskedasticity hypothesis	http://purl.obolibrary.org/obo/STATO_0000065	hypothesis	equality of variance	homoskedasticity states that all variances under consideration are homogenous.
http://purl.obolibrary.org/obo/STATO_0000076	Mann-Whitney U-test	http://purl.obolibrary.org/obo/STATO_0000198	non-parametric test	rank-sum test for the comparison of two samples	The Mann-Whitney U-test is a null hypothesis statistical testing procedure which allows two groups (or conditions or treatments) to be compared without making the assumption that values are normally distributed. 
The Mann-Whitney test is the non-parametric equivalent of the t-test for independent samples
http://purl.obolibrary.org/obo/STATO_0000077	Shapiro-Wilk test	http://purl.obolibrary.org/obo/STATO_0000191	goodness of fit statistical test	S-W test	Shapiro-Wilk test is a goodness of fit test which evaluates the null hypothesis that the sample is drawn from a population following a normal distribution
http://purl.obolibrary.org/obo/STATO_0000081	Pearson's Chi square test of independence between categorical variables	http://purl.obolibrary.org/obo/STATO_0000027	test of association between categorical variables	Chi2 test for independence	Pearson's Chi-Squared test is a statistical null hypothesis test which is used to either evaluate goodness of fit of dataset to a Chi-Squared distribution or used to test independence  of 2 categorical variables (ie absence of association between those variables).
http://purl.obolibrary.org/obo/STATO_0000083	Kolmogorov-Smirnov test	http://purl.obolibrary.org/obo/STATO_0000191	goodness of fit statistical test	K-S test	Kolmogorov-Smirnov test is a goodness of fit test which evaluates the null hypothesis that a sample is drawn from a population that follows a specific continuous probability distribution.
http://purl.obolibrary.org/obo/STATO_0000094	Kruskal Wallis test	http://purl.obolibrary.org/obo/STATO_0000198	non-parametric test	rank-sum test for the comparison of multiple (more than 2) samples.	The Kruskal–Wallis test is a null hypothesis statistical testing objective which  allows multiple (n>=2) groups (or conditions or treatments) to be compared, without making the assumption that values are normally distributed. The Kruskal–Wallis test is the non-parametric equivalent of the independent samples ANOVA.
The Kruskal–Wallis test is most commonly used when there is one nominal variable and one measurement variable, and the measurement variable does not meet the normality assumption of an anova.
http://purl.obolibrary.org/obo/STATO_0000099	random effect model	http://purl.obolibrary.org/obo/STATO_0000107	statistical model	variance components model	a random effect(s) model, also called a variance components model, is a kind of hierarchical linear model. It assumes that the dataset being analysed consists of a hierarchy of different populations whose differences relate to that hierarchy.
http://purl.obolibrary.org/obo/STATO_0000100	standardized mean difference	http://purl.obolibrary.org/obo/STATO_0000613	difference	SMD	standardized mean difference is statistic computed by forming the difference between two means, divided by an estimate of the within-group standard deviation.
It is used to provide an estimation of the effect size between two treatments when the predictor (independent variable) is categorical and the response(dependent) variable is continuous.

A standardized mean difference is a statistic that is a difference between two means, divided by a statistical measure of dispersion. 

The term Standardized Mean Difference is a description of the concept without an explicit type of statistical measure of dispersion. If the statistical measure of dispersion is specified, then a type (child term) of Standardized Mean Difference is preferred.
http://purl.obolibrary.org/obo/STATO_0000113	variance	http://purl.obolibrary.org/obo/STATO_0000028	measure of dispersion	σ2	variance is a data item about a random variable or probability distribution. it is equivalent to the square of the standard deviation.  It is one of several descriptors of a probability distribution, describing how far the numbers lie from the mean (expected value).The variance is the second moment of a distribution.
http://purl.obolibrary.org/obo/STATO_0000123	r2 measure of linkage desequilibrium	http://purl.obolibrary.org/obo/STATO_0000142	correlation coefficient	R squared measure of LD	r2 is a correlation coefficient which is computed over the frequency of 2 dichotomous variable and is used as a measure of Linkage Disequilibrium and as input data item to the creation of an LD plot
http://purl.obolibrary.org/obo/STATO_0000127	99% confidence interval	http://purl.obolibrary.org/obo/STATO_0000196	confidence interval	confidence interval at 1% of type I error rate	a confidence interval which covers 99% of the sampling distribution, meaning that there is a 1% risk of false positive (type I error)
http://purl.obolibrary.org/obo/STATO_0000133	post-hoc analysis	http://purl.obolibrary.org/obo/STATO_0000279	between group comparison statistical test	a posteriori test	A post-hoc analysis is a statistical test carried out following an analysis of variance which ruled out the null hypothesis of absence of difference between group which allows identifying which groups differ.
http://purl.obolibrary.org/obo/STATO_0000148	Cochran-Armitage test for trend	http://purl.obolibrary.org/obo/STATO_0000027	test of association between categorical variables	CATT	The Cochran-Armitage test is a statistical test used in categorical data analysis when the aim is to assess for the presence of an association between a dichotomous variable (variable with two categories) and a polychotomous variable  (a variable with k categories).

The two-level variable represents the response, and the other represents an explanatory variable with ordered levels. The null hypothesis is the hypothesis of no trend, which means that the binomial proportion is the same for all levels of the explanatory variable

For example, doses of a treatment can be ordered as 'low', 'medium', and 'high', and we may suspect that the treatment benefit cannot become smaller as the dose increases. The trend test is often used as a genotype-based test for case-control genetic association studies.
http://purl.obolibrary.org/obo/STATO_0000150	minimum observed value	http://purl.obolibrary.org/obo/STATO_0000039	statistic	minimum value	a minimum value is a data item which denotes the smallest value found in a dataset or resulting from a calculation.
http://purl.obolibrary.org/obo/STATO_0000151	maximum observed value	http://purl.obolibrary.org/obo/STATO_0000039	statistic	maximum value	maximum value is a data item which denotes the largest value found in a dataset or resulting from a calculation.
http://purl.obolibrary.org/obo/STATO_0000172	FPKM-fragments per kilobase of transcript per million fragments mapped	http://purl.obolibrary.org/obo/STATO_0000047	count	FPKM	FPKM, which is the expected fragments per kilobase of transcript per million fragments mapped is a metric used to report transcript expression event as generated by RNA-Seq using paired-end library. The calculated value results from 2 types of normalization, one to take into account the difference in reads counts associated with transcript length (at equal abundance, longer transcripts will have more reads than shorter transcripts) , (hence the 'per kilobase of transcript') and the other one to take into account different sequencing depth during distinct sequencing runs (hence the 'per millions mapped fragment'. The metric is specifically produced by cufflink software.

Recommended use:

-FPKM measure may be used for gene count comparisons within a sample. 

-FPKM measure is NOT suited for between sample comparisons.
-FPKM measure is NOT suited for Differential Expression analysis.
http://purl.obolibrary.org/obo/STATO_0000201	Spearman's rank correlation coefficient	http://purl.obolibrary.org/obo/STATO_0000142	correlation coefficient	Spearman's rho	Spearman's rank correlation coefficient is a correlation coefficient which is a nonparametric measure of statistical dependence between two ranked variables. It assesses how well the relationship between two variables can be described using a monotonic function. If there are no repeated data values, a perfect Spearman correlation of +1 or −1 occurs when each of the variables is a perfect monotone function of the other.
Spearman's coefficient may be used when the conditions for computing Pearson's correlation are not met (e.g linearity, normality of the 2 continuous variables) but may require a ranking transformation of the variables
http://purl.obolibrary.org/obo/STATO_0000204	F-distribution	http://purl.obolibrary.org/obo/STATO_0000067	continuous probability distribution	Snedecor Fisher distribution	the F-distribution is a continuous probability distribution which arises in the testing of whether two observed samples have the same variance.
http://purl.obolibrary.org/obo/STATO_0000206	RPKM-reads per kilobase of transcript per million fragments mapped	http://purl.obolibrary.org/obo/STATO_0000047	count	RPKM	RPKM is a kind of count which numbers the sequence reads found per kilobase of transcript reported to million of sequence reads. RPKM is a metric generated by ERANGE software tool as reported by Mortazi et al, in 2008.
The metric has been enhanced and replaced by FPKM to better take into account splice variant. FKPM uses a statistical model to perform the computation.

Recommended use:

-RPKM measure may be used for gene count comparisons within a sample. 

-RPKM measure is NOT suited for between sample comparisons.
-RPKM measure is NOT suited for Differential Expression analysis.
http://purl.obolibrary.org/obo/STATO_0000214	50% confidence interval	http://purl.obolibrary.org/obo/STATO_0000196	confidence interval	confidence interval at 10% of type I error rate	a confidence interval which covers 50% of the sampling distribution, meaning that there is a 50% risk of false positive (type I error)
http://purl.obolibrary.org/obo/STATO_0000222	graeco-latin square design	http://purl.obolibrary.org/obo/OBI_0500007	randomized complete block design	orthogonal latin squares design	graeco-latin square design is_a study design which allows in its simpler form controlling 3 levels of nuisance variables (also known as blocking variables). The 3 nuisance factors are divided into a tabular grid with the property that each row and each column receive each treatment exactly once.
http://purl.obolibrary.org/obo/STATO_0000226	equal variance testing objective	http://purl.obolibrary.org/obo/OBI_0000675	statistical hypothesis test objective	homoschedacity testing objective	It is a testing objective to ensure the variances of the different groups used in a statistical test are similar (i.e. not too different).
http://purl.obolibrary.org/obo/STATO_0000227	normal distribution	http://purl.obolibrary.org/obo/STATO_0000329	symmetric distribution	Gaussian distribution	a normal distribution is a continuous probability distribution described by a probability distribution function described here:
http://mathworld.wolfram.com/NormalDistribution.html
http://purl.obolibrary.org/obo/STATO_0000230	expected value	http://purl.obolibrary.org/obo/STATO_0000029	measure of central tendency	μ	the expected value (or expectation, mathematical expectation, EV, mean, or the first moment) of a random variable is a data item which corresponds to the weighted average of all possible values that this random variable can take on. The weights used in computing this average correspond to the probabilities in case of a discrete random variable, or densities in case of a continuous random variable. From a rigorous theoretical standpoint, the expected value is the integral of the random variable with respect to its probability measure.
http://purl.obolibrary.org/obo/STATO_0000231	95% confidence interval	http://purl.obolibrary.org/obo/STATO_0000196	confidence interval	confidence interval at 5% of type I error rate	a confidence interval which covers 95% of the sampling distribution, meaning that there is a 5% risk of false positive (type I error). If the number of observations made is large enough, the sampling distribution can be assumed to be normal, which entails that 95% of the sampling distributions falls within roughly2 (1.96) standard deviations from the mean.
http://purl.obolibrary.org/obo/STATO_0000237	standard deviation	http://purl.obolibrary.org/obo/STATO_0000028	measure of dispersion	σ	The standard deviation of a random variable, statistical population, data set, or probability distribution is a measure of variation which correspond to the average distance from the mean of the data set to any given point of that dataset. It also corresponds to the square root of its variance.
http://purl.obolibrary.org/obo/STATO_0000240	Kendall's correlation coefficient	http://purl.obolibrary.org/obo/STATO_0000142	correlation coefficient	Kendall's tau (τ) coefficient	Kendall's correlation coefficient is a correlation coefficient between 2 ordinal variables (natively or following a ranking procedure) and may be used when the conditions for computing Pearson's correlation are not met (e.g linearity, normality of the 2 continuous variables)
http://purl.obolibrary.org/obo/STATO_0000245	relative risk	http://purl.obolibrary.org/obo/STATO_0000620	risk	risk ratio	Relative risk is a measurement datum which denotes the risk of an 'event' relative to an 'exposure'. Relative risk is calculated by forming the ratio of the probability of the event occurring in the exposed group versus the probability of this event occurring in the non-exposed group.
http://purl.obolibrary.org/obo/STATO_0000248	blocking variable	http://purl.obolibrary.org/obo/STATO_0000258	variable	nuisance variable	a blocking variable is a independent variable which is used in a blocking process part of an experiment with the purpose of maximizing the signal coming from the main variable.
http://purl.obolibrary.org/obo/STATO_0000260	repeated measure ANOVA	http://purl.obolibrary.org/obo/STATO_0000202	within subject comparison statistical test	ANOVA for correlated samples	repeated measure ANOVA is a kind of ANOVA specifically developed for non-independent observations as found when repeated measurements on the sample experimental unit. 
repeated measure ANOVA is sensitive to departure from normality (evaluation using Bartlett's test), more so in the case of unbalanced groups (i.e. different sizes of sample populations).
Departure from sphericity (evaluation  using Mauchly'test)  used to be an issue which is now handled robustly by modern tools such as R's lme4 or nlme, which accommodate dependence assumptions other than sphericity.
http://purl.obolibrary.org/obo/STATO_0000264	factor level combination	http://purl.obolibrary.org/obo/IAO_0000100	data set	treatment combination	a factor level combination is one a possible sets of factor levels resulting from the cartesian product of sets of factor and their levels as defined in a factorial design
http://purl.obolibrary.org/obo/STATO_0000265	factor level	http://purl.obolibrary.org/obo/IAO_0000027	data item	treatment	A factor level is data item which corresponds to one of the value assumed by a factor or independent variable manipulated and set by the experimentalist. In the context of factorial design, a factor level is assumed to be or treated as a category in a categorical variable
http://purl.obolibrary.org/obo/STATO_0000275	transmission disequilibrium test	http://purl.obolibrary.org/obo/STATO_0000027	test of association between categorical variables	TDT	The transmission disequilibrium test is a statistical test for genetic linkage between genetic marker and a trait in families. The test is robust to population structure.
http://purl.obolibrary.org/obo/STATO_0000280	Pearson's correlation coefficient	http://purl.obolibrary.org/obo/STATO_0000142	correlation coefficient	r statistics	The Pearson's correlation coefficient is a correlation coefficient which evaluates two continuous variables for association strength in a data sample. It assumes that both variables are normally distributed and linearity exists. 
The coefficient is calculated by dividing their covariance with the product of their individual standard deviations. It is a normalized measurement of how the two are linearly related.
http://purl.obolibrary.org/obo/STATO_0000286	one tailed test	http://purl.obolibrary.org/obo/OBI_0200000	data transformation	one sided test	a one-tailed test is a statistical test which, assuming an unskewed probability distribution, allocates all of the significance level to evaluate only one hypothesis to explain a difference.
The one-tailed test provides more power to detect an effect in one direction by not testing the effect in the other direction.
one-tailed test should be preceded by two-tailed test in order to avoid missing out on detecting alternate effect explaining an observed difference.
http://purl.obolibrary.org/obo/STATO_0000289	design matrix	http://purl.obolibrary.org/obo/STATO_0000642	matrix	model matrix	a design matrix is an information content entity which denotes a study design. The design matrix is a n by m matrix where n the number of rows, corresponds to the number of observations (4 rows if quadruplicates) and where m, the number of columns corresponds to the number of independent variables. Each element in the matrix correspond to a discretized value representing one of the factor levels for a given factor.
A design matrix can be used as input to statistical modeling or statistical analysis.

The design matrix contains data on the independent variables (also called explanatory variables) in statistical models which attempt to explain observed data on a response variable (often called a dependent variable) in terms of the explanatory variables. The theory relating to such models makes substantial use of matrix manipulations involving the design matrix: see for example linear regression. A notable feature of the concept of a design matrix is that it is able to represent a number of different experimental designs and statistical models, e.g., ANOVA, ANCOVA, and linear regression
http://purl.obolibrary.org/obo/STATO_0000296	absence of enrichment hypothesis	http://purl.obolibrary.org/obo/STATO_0000295	absence of positive difference hypothesis	absence of over representation hypothesis	absence of depletion difference hypothesis is a hypothesis which assumes that the representation of an element significantly greater than a threshold does not exist.
http://purl.obolibrary.org/obo/STATO_0000297	absence of depletion hypothesis	http://purl.obolibrary.org/obo/STATO_0000294	absence of negative difference hypothesis	absence of under representation hypothesis	absence of depletion difference hypothesis is a hypothesis which assumes that the representation of an element significantly less than a threshold does not exist.
http://purl.obolibrary.org/obo/STATO_0000303	two sample t-test with equal variance	http://purl.obolibrary.org/obo/STATO_0000279	between group comparison statistical test	two sample t-test	two sample t-test is a null hypothesis statistical test which is used to reject or accept the hypothesis of absence of difference between the means over 2 randomly sampled populations.
It uses a t-distribution for the test and assumes that the variables in the population are normally distributed and with equal variances.
http://purl.obolibrary.org/obo/STATO_0000304	two sample t-test with unequal variance	http://purl.obolibrary.org/obo/STATO_0000279	between group comparison statistical test	Welsh t-test	Welch t-test is a two sample t-test used when the variances of the 2 populations/samples are thought to be unequal (homoskedasticity hypothesis not verified). In this version of the two-sample t-test, the denominator used to form the t-statistics, does not rely on a 'pooled variance' estimate.
http://purl.obolibrary.org/obo/STATO_0000309	Pearson's Chi square test of goodness of fit	http://purl.obolibrary.org/obo/STATO_0000191	goodness of fit statistical test	Chi2 test for goodness of fit	Pearson's Chi-Squared test for goodnes of fit is a statistical null hypothesis test which is used to either evaluate goodness of fit of dataset to a Chi-Squared distribution
http://purl.obolibrary.org/obo/STATO_0000316	root-mean-square standardized effect	http://purl.obolibrary.org/obo/STATO_0000085	effect size estimate	Ψ	root-mean-square standardized effect is a statistic which denotes effect size in the context of analysis of variance and corresponds to the square root of the arithmetic average of p  standardized effects  (effects normalized to be expressed in standard deviation units).
http://purl.obolibrary.org/obo/STATO_0000347	Matern function anisotropic covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure	Matern covariance function	Given two sets of locations computes the Matern cross covariance matrix for covariances among all pairings.
http://purl.obolibrary.org/obo/STATO_0000349	spatial linear geometric anisotropic covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure	SP(LINGA)	spatial linear geometric anisotropic covariance structure is a type of covariance structure characterized by its anisotropy, i.e., the variation of properties can be different in directions x and y, which is this case give linear features.
http://purl.obolibrary.org/obo/STATO_0000351	spatial gaussian geometric anisotropic covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure	SP(GAUGA)	spatial gaussian geometric anisotropic covariance structure is a type of covariance structure characterized by its anisotropy, i.e., the variation of properties can be different in directions x and y, which is this case give gaussian features.
http://purl.obolibrary.org/obo/STATO_0000352	spatial exponential geometric anisotropic covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure	SP(EXPGA)	spatial exponential geometric anisotropic covariance structure is a type of covariance structure characterized by its anisotropy, i.e., the variation of properties can be different in directions x and y, which is this case give exponential features.
http://purl.obolibrary.org/obo/STATO_0000353	spatial exponential anisotropic covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure	SP(EXPA)(c-list)	spatial exponential anisotropic covariance structure is a type of covariance structure characterized by its anisotropy, i.e., the variation of properties can be different in directions x and y, which is this case give exponential features.
http://purl.obolibrary.org/obo/STATO_0000355	heterogeneous Toeplitz covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure	TOEPH	This covariance structure has heterogenous variances and heterogenous correlations between elements. The correlation between adjacent elements is homogenous across pairs of adjacent elements. The correlation between elements separated by a third is again homogenous, and so on.
http://purl.obolibrary.org/obo/STATO_0000358	Huynh-Feldt covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure	HF	a form of covariance structure used to provide analysis ground s in the context of repeated measures datasets (longitudinal, time series)
http://purl.obolibrary.org/obo/STATO_0000364	first order autoregressive moving average covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure	ARMA(1,1)	first order autoregressive moving average covariance structure is a type of covariance structure which is used in the context of time series analysis
http://purl.obolibrary.org/obo/STATO_0000399	interquartile mean	http://purl.obolibrary.org/obo/STATO_0000163	trimmed mean	IQM	The interquartile mean (IQM) (or midmean) is a statistical measure of central tendency based on the truncated mean of the interquartile range. 

In the calculation of the IQM, only the data in the second and third quartiles is used (as in the interquartile range), and the lowest 25% and the highest 25% of the scores are discarded. These points are called the first and third quartiles, hence the name of the IQM.
http://purl.obolibrary.org/obo/STATO_0000400	quadratic mean	http://purl.obolibrary.org/obo/STATO_0000029	measure of central tendency	root mean square	The root mean square (abbreviated RMS or rms), also known as the quadratic mean, in statistics is a statistical measure of central tendency defined as the square root of the mean of the squares of a sample.

( To find the root mean square of a set of numbers, square all the numbers in the set and then find the arithmetic mean of the squares. Take the square root of the result. This is the root mean square.)
http://purl.obolibrary.org/obo/STATO_0000402	population mean	http://purl.obolibrary.org/obo/STATO_0000029	measure of central tendency	distribution mean	the population mean or distribution mean is a parameter of a probability distribution or population indicative of the data dispersion. For continous probabibility distribution, the population mean is computed using the probability density function, for discrete probability distributions, a mass density function is used instead. 
A population mean can be estimated by computing a sample mean
http://purl.obolibrary.org/obo/STATO_0000464	linear mixed model	http://purl.obolibrary.org/obo/STATO_0000189	mixed effect model	LMM	A lnear mixed model is a mixed model containing both fixed effects and random effects and in which factors and covariates are assumed to have a linear relationship to the dependent variable. These models are useful in a wide variety of disciplines in the physical, biological and social sciences. They are particularly useful in settings where repeated measurements are made on the same statistical units (longitudinal study), or where measurements are made on clusters of related statistical units. Because of their advantage in dealing with missing values, mixed effects models are often preferred over more traditional approaches such as repeated measures ANOVA.

Fixed-effects factors are generally considered to be the variables whose values of interest are all represented in the data file.

Random-effects factors are variables whose values correspond to unwanted variation. They are useful when trying to understand variability in the dependent variable which was not anticipated and exceeds what was expected.

Linear mixed models also allow to specify specific interactions between factors, and allow the evaluation of the various linear effect that a particular combination of factor levels may have on a response variable.

Finally, linear mixed models allow to specify variance components in order to describe the relation between various random effects levels.
http://purl.obolibrary.org/obo/STATO_0000682	kappa	http://purl.obolibrary.org/obo/STATO_0000623	measure of agreement	kappa statistic	A measure of agreement among categorical assessments, corrected for chance agreement.
http://purl.obolibrary.org/obo/STATO_0000691	area under the precision-recall curve	http://purl.obolibrary.org/obo/STATO_0000209	area under curve	AUPRC	An area under the curve where the curve is the precision and the domain of interest is the recall.
http://purl.obolibrary.org/obo/STATO_0000695	precision-recall curve	http://purl.obolibrary.org/obo/IAO_0000038	graph	PR-curve	The precision-recall curve is a graph which plots the sensitivity or recall on the X-axis and the precision on the Y-axis.
The precision-recall curves focuses on the trade-off between Precision and Recall at various thresholds. It also highlights the performance of the model for the positive class (often the minority or more important class).
It is particularly suited for for imbalanced datasets where the positive class is rare.
The PR-curve is more sensitive to class imbalance. As precision drops significantly if the model predicts too many false positives, it makesthe PR-curve better for evaluating performance on imbalanced datasets.
http://purl.obolibrary.org/obo/STATO_0000699	t-score	http://purl.obolibrary.org/obo/STATO_0000039	statistic	t-value	A t-score generally refers to the value resulting from standardizing a sample statistic (e.g., sample mean) in terms of the t-distribution. It quantifies how far the sample statistic is from the hypothesized population parameter in units of the estimated standard error.

A t-score is a statistic that represents a measure of the divergence of an individual experimental result from a specified mean, taking sample size into account and expressed in terms of the number of standard errors from the mean value.
http://purl.obolibrary.org/obo/STATO_0000700	p-value	http://purl.obolibrary.org/obo/OBI_0000071	quantitative confidence value	p value	A hypothesis testing measure that represents the probability of obtaining a result at least as far from the value actually obtained as the value expected, assuming the null hypothesis is true.
http://purl.obolibrary.org/obo/STATO_0000706	Bayesian inference	http://purl.obolibrary.org/obo/STATO_0000299	statistical inference	Bayesian approach	A statistical inference paradigm in which the assumptions are that parameters are random variables with probability distributions, that prior knowledge can be represented with such distributions, and that one learns from data by updating those prior distributions using Bayes' Rule.
http://purl.obolibrary.org/obo/STATO_0000707	frequentist inference	http://purl.obolibrary.org/obo/STATO_0000299	statistical inference	frequentist statistical inference paradigm	A statistical inference paradigm in which the assumptions are that probabilities reflect relative frequencies of events and that the probability of observing data are governed by fixed yet unknown population parameters.
http://purl.obolibrary.org/obo/stato.owl#http://purl.obolibrary.org/obo/STATO_0000669	sum	http://purl.obolibrary.org/obo/STATO_0000039	statistic	total	A statistic that represents the result of adding all the values in a collection of values.
http://purl.obolibrary.org/obo/STATO_0000287	two tailed test	http://purl.obolibrary.org/obo/OBI_0200000	data transformation	two sided test	a two tailed test is a statistical test which assess the null hypothesis of absence of difference assuming a symmetric (not skewed) underlying probability distribution by allocating half of the significance level selected to each of the direction of change which could explain a difference (for example, a difference can be an excess or a loss).
http://purl.obolibrary.org/obo/STATO_0000628	F1 score	http://purl.obolibrary.org/obo/STATO_0000663	F-beta score	F1-score	A ratio representing the harmonic mean of recall and precision.

The F1-score is used as a measure of quality for classification algorithms and information retrieval strategies, where 1 represents the best precision and recall and 0 represents the worst precision and recall.


A harmonic mean of a set of quantities is the reciprocal of the arithmetic mean of the reciprocals of each quantity.  The F score is thus calculated as 1 / (the arithmetic mean of the reciprocals), or:


F = 1 / ( ( (1/recall) + (1/precision) ) / 2 )


F = 2*( (precision*recall) / (precision+recall) )


Recall is sensitivity STATO:0000233
Precision (PPV) is STATO:0000416
http://purl.obolibrary.org/obo/STATO_0000592	Rand index	http://purl.obolibrary.org/obo/STATO_0000184	ratio	RI	the Rand index is a ratio, related to the notion of accuracy (STATO_0000415), which is used to compare the similarity of two clustering outcomes.
http://purl.obolibrary.org/obo/STATO_0000664	mean of differences	http://purl.obolibrary.org/obo/STATO_0000573	average value	mean of paired differences	A mean of values in which each value is the subtraction of one quantity from another.
http://purl.obolibrary.org/obo/STATO_0000679	diagnostic odd ratio	http://purl.obolibrary.org/obo/STATO_0000184	ratio	DOR	A measure of association that is the ratio of the odds of a positive test in those with disease relative to the odds of a positive test in those without disease.
http://purl.obolibrary.org/obo/STATO_0000692	probability distribution mean	http://purl.obolibrary.org/obo/STATO_0000573	average value	mu	A probability distribution attribute that represents the expected value of a variable that has that distribution.
http://purl.obolibrary.org/obo/GO_0032991	protein-containing complex	http://purl.obolibrary.org/obo/BFO_0000040	material entity	protein complex	A stable assembly of two or more macromolecules, i.e. proteins, nucleic acids, carbohydrates or lipids, in which at least one component is a protein and the constituent parts function together.
http://purl.obolibrary.org/obo/STATO_0000640	log-rank test	http://purl.obolibrary.org/obo/OBI_0200194	survival analysis data transformation	logrank test	The logrank test is a statistical hypothesis test used to compare the survival distributions of two or more groups. It is commonly employed in survival analysis, where the primary interest lies in comparing the survival experiences of different groups over time.
http://purl.obolibrary.org/obo/STATO_0000589	TPM-transcript per million	http://purl.obolibrary.org/obo/STATO_0000047	count	TPM	TPM, which stands for Transcript Per Million,  is a data item produced during a normilization procedure of RNA-Seq data (i.e. transcription profiling reads produced by nucleic acid sequencing) and corresponds to the count per length of transcript (kb) per million reads mapped.

-TPM considers the gene length and sequencing depth for normalization
-TPM proposed as an alternative to RPKM due to inaccuracy in RPKM measurement (Wagner et al., 2012)
-TPM is suitable for sequencing protocols where reads sequencing depends on gene length

-TPM is RPKM scaled by a constant to make sure the sum of all values is 1 million.

-TPM (transcript per million) = 1,000,000*θi/(ℓi*c), where c is a constant value, sum_[j](θj/ℓj), sum_[i]TPM=1,000,000


Recommended use:

-TPM measure is suited for gene count comparisons within a sample or between samples of the same sample group. 

-TPM measure is NOT suited for Differential Expression analysis.
http://purl.obolibrary.org/obo/STATO_0000590	median of the ratios corrected count	http://purl.obolibrary.org/obo/STATO_0000047	count		median of the ratios corrected count is kind of count produced during an  RNA-Seq data normalization procedure which corresponds to dividing  counts  by sample-specific size factors determined by median ratio of gene counts relative to geometric mean per gene.
It was first described by Anders and Huber in 2010 (https://doi.org/10.1186/gb-2010-11-10-r106)


Recommended use:

-"median of the ratios corrected count" is suited for Differential Expression analysis or between samples. 

-"median of the ratios corrected count" is NOT suited for gene count comparisons within a sample.
http://purl.obolibrary.org/obo/STATO_0000591	TMM-trimmed mean of M-values corrected count	http://purl.obolibrary.org/obo/STATO_0000047	count	TMM	TMM or Trimmed Mean of M-Values is a  measurement datum which is the output of the TMM normalization procedure for RNA-Seq data proposed by Robinson and Oshlack (2010). A scaling normalization method for differential expression analysis of RNA-seq data.
The TMM procedure is doubly trimmed, by log-fold-changes  (sample k relative to sample r for gene g) and by absolute intensity (A g ).

It was first release as part of the R package edgeR, 10.1093/bioinformatics/btp616


Recommended use:

-TMM measure is suited for gene count comparisons within a sample or between samples of the same sample group. 

-TMM measure is suited for Differential Expression analysis.
http://purl.obolibrary.org/obo/STATO_0000593	adjusted Rand index	http://purl.obolibrary.org/obo/STATO_0000592	Rand index	ARI	the adjusted Rand index is a measure which rescales the Rand index, taking into account that random chance will cause some objects to occupy the same clusters, so the Rand Index will never actually be zero.
http://purl.obolibrary.org/obo/STATO_0000594	confusion matrix	http://purl.obolibrary.org/obo/STATO_0000642	matrix	http://purl.obolibrary.org/obo/IAO_0000122	a confusion matrix is a 2 by 2 contingency table used to evaluate the performance of a classifier, often a machine-learning classifier and  that allows visualization of the performance of an algorithm, typically a supervised learning one. It defines two dimensions ("actual" and "predicted"), and identical sets of "classes" in both dimensions (each combination of dimension and class is a variable in the contingency table).
http://purl.obolibrary.org/obo/STATO_0000608	area under the receiver operator characteristic curve	http://purl.obolibrary.org/obo/STATO_0000209	area under curve	concordance index	An area under the curve where the curve is the true positive rate and the range of interest is the false positive rate.
http://purl.obolibrary.org/obo/STATO_0000611	measure of correlation	http://purl.obolibrary.org/obo/STATO_0000610	measure of association	correlation	measure of correlation is a measure of association between ordinal or continuous variables.

A value of 0 means no association. 
A positive value means a positive association (as one variable increases, the other variable increases). 
A negative value means a negative association (as one variable increases, the other variable decreases).
For correlation coefficients, the possible values range from +1 (perfect positive association) to -1 (perfect negative association)
http://purl.obolibrary.org/obo/STATO_0000612	Goodman and Kruskal's gamma	http://purl.obolibrary.org/obo/STATO_0000142	correlation coefficient	gamma statistic	A measure of correlation, ranging from -1 to 1, that measures the strength and direction of the relationship between ranks by value of two ordinal or continuous variables, and is calculated based on the difference in the number of concordant and discordant pairs of rankings divided by the total number of pairs of rankings, where ties are not counted among the pairs of rankings.
http://purl.obolibrary.org/obo/STATO_0000617	median difference	http://purl.obolibrary.org/obo/STATO_0000614	absolute difference	difference in median	A difference that is a subtraction of one median from another.
http://purl.obolibrary.org/obo/STATO_0000630	Cohen's kappa	http://purl.obolibrary.org/obo/STATO_0000682	kappa	Cohen's kappa statistic	A Kappa statistic in which the expected agreement by chance is based on an assumption that the likelihood of each category for assignment is based on the proportion observed, and the number of raters is 2.
http://purl.obolibrary.org/obo/STATO_0000632	misclassification rate	http://purl.obolibrary.org/obo/STATO_0000623	measure of agreement	classification error	A ratio of the number of incorrect results to the total number tested.
http://purl.obolibrary.org/obo/STATO_0000635	number needed to treat	http://purl.obolibrary.org/obo/STATO_0000634	reciprocal of difference	number needed to treat to benefit	A statistic that represents the number of units that needs to be treated to prevent one additional undesired outcome. The Number Needed to Treat is calculated as the reciprocal of a treatment effect estimate, where the effect estimate is expressed as a risk difference.

The Number Needed to Treat (NNT) value is often rounded up to the next highest whole integer
http://purl.obolibrary.org/obo/STATO_0000637	number needed to screen to detect	http://purl.obolibrary.org/obo/STATO_0000634	reciprocal of difference	NNS	A statistic that represents the number of units that needs to be tested to identify one additional case. The Number Needed to Screen to Detect is calculated as the reciprocal of a difference in rate of detected cases with and without screening.

The Number Needed to Screen (NNS) value is often rounded up to the next highest whole integer.  The Number Needed to Screen to Detect is distinct from the Number Needed to Screen to Prevent as the formulas to calculate are different, even though both may be abbreviated as NNS.
http://purl.obolibrary.org/obo/STATO_0000638	number needed to harm	http://purl.obolibrary.org/obo/STATO_0000634	reciprocal of difference	number needed to treat to harm	A statistic that represents the number of units that, if treated or exposed to the intervention, lead to one additional undesired outcome. The Number Needed to Harm is calculated as the reciprocal of a treatment effect estimate, where the effect estimate is expressed as a risk difference.
http://purl.obolibrary.org/obo/STATO_0000639	percentage	http://purl.obolibrary.org/obo/STATO_0000184	ratio	%	A ratio formed between a numerator and a denominator (both described by the same unit), and that is multiplied by 100.
http://purl.obolibrary.org/obo/STATO_0000643	variance for sample	http://purl.obolibrary.org/obo/STATO_0000113	variance	sample variance	the sample variance is a variance computed over the actual observations made, which correspond to a sample drawn from a population in an experiment. the sample variance can be used to estimate the true variance of the underlying population/distribution.
http://purl.obolibrary.org/obo/STATO_0000646	population variance	http://purl.obolibrary.org/obo/STATO_0000113	variance	distribution variance	the population variance is a variance of the true population from which a sample is derived. 
the population variance describes the variability of a characteristic in the population.
http://purl.obolibrary.org/obo/STATO_0000650	sampling distribution variance	http://purl.obolibrary.org/obo/STATO_0000471	estimate	sampling variance	sampling variance refers to the variability in the estimates of a population parameter that arises from random sampling.
sampling variance is a variance of the sampling distribution of a random variable and estimates the dispersion of sample estimates about their expected value in hypothetical repetitions of the sample.
http://purl.obolibrary.org/obo/STATO_0000660	posterior predictive value	http://purl.obolibrary.org/obo/OBI_0000071	quantitative confidence value	Bayesian p-value	A hypothesis testing measure that is the predictive probability that a statistic is equal to or more extreme than that observed under the assumptions of the model.
http://purl.obolibrary.org/obo/STATO_0000661	p-value for one-sided test	http://purl.obolibrary.org/obo/OBI_0000071	quantitative confidence value	one-tailed test p-value	A p-value which represents the probability of obtaining a result at least as far, in one direction, from the value actually obtained as the value expected, assuming the null hypothesis is true.
http://purl.obolibrary.org/obo/STATO_0000662	p-value for two-sided test	http://purl.obolibrary.org/obo/OBI_0000071	quantitative confidence value	two-tailed test p-value	A p-value which represents the probability of obtaining a result at least as far, in either direction, from the value actually obtained as the value expected, assuming the null hypothesis is true. A p-value for two-sided test interprets 'at least as far from' with both the direction of 'greater than' and the direction of 'less than'. For hypothesis test interpretation, the two-tailed p-value is compared to the alpha setting divided by 2.
http://purl.obolibrary.org/obo/STATO_0000670	incidence rate	http://purl.obolibrary.org/obo/STATO_0000645	rate	incidence density	A rate in which the number of new events per total at risk is divided by an interval of time.
http://purl.obolibrary.org/obo/STATO_0000574	center value	http://purl.obolibrary.org/obo/STATO_0000152	quartile	second quartile	The median is that value of the variate which divides the total frequency into two halves. The median is measure of central tendency of data. It is obtained by arranging the observations in order from smallest to largest value. If there is an odd number of observations, the median is the middle value. If there is an even number of observations, the median is the average of the two middle values.
http://purl.obolibrary.org/obo/STATO_0000406	Yuen t-Test with trimmed means	http://purl.obolibrary.org/obo/STATO_0000279	between group comparison statistical test	Yuen-Welch's t-test	The Yuen's t-test is a two sample t-test with populations of unequal variance which provides a more robust t-test procedure under normal distribution and long tailed distributions.
The test computes a t statistics not using 'arithmetic means' but using 'trimmed means' instead as well as winsorized variances.
http://purl.obolibrary.org/obo/STATO_0000410	negative likelihood ratio	http://purl.obolibrary.org/obo/STATO_0000409	likelihood ratio	likelihood ratio for negative results	the likelihood ratio of negative results is a ratio which is formed  by dividing the difference between 1 and sensitivity of the test by the specificity value of a test. This can be expressed also as dividing the probability of a person who has the disease testing negative by the probability of a person who does not have the disease testing negative.
http://purl.obolibrary.org/obo/STATO_0000411	positive likelihood ratio	http://purl.obolibrary.org/obo/STATO_0000409	likelihood ratio	likelihood ratio for positive results	the likelihood ratio of positive results is a ratio which is form by dividing the sensitivity value of a test by the difference between 1 and specificity of the test. This can be expressed also as dividing the probability of the test giving a positive result when testing an affected subject versus the probability of the test giving a positive result when a subject is not affected.
http://purl.obolibrary.org/obo/STATO_0000413	incidence	http://purl.obolibrary.org/obo/STATO_0000607	proportion	incidence proportion	A proportion in which the numerator represents new events.
http://purl.obolibrary.org/obo/STATO_0000416	precision	http://purl.obolibrary.org/obo/STATO_0000607	proportion	positive predictive value	precision or positive predictive value is defined as the proportion of the true positives against all the positive results (both true positives and false positives)

A proportion in which the numerator represents the correctly detected items within the denominator that represents all items detected.
http://purl.obolibrary.org/obo/STATO_0000420	I-squared	http://purl.obolibrary.org/obo/STATO_0000418	measure of heterogeneity	I2	The quantity called I2, describes the percentage of total variation across studies that is due to heterogeneity rather than chance. I2 can be readily calculated from basic results obtained from a typical meta-analysis as I2 = 100%×(Q - df)/Q, where Q is Cochran's heterogeneity statistic and df the degrees of freedom. Negative values of I2 are put equal to zero so that I2 lies between 0% and 100%. A value of 0% indicates no observed heterogeneity, and larger values show increasing heterogeneity.

Unlike Cochran's Q, it does not inherently depend upon the number of studies considered. A confidence interval for I² is constructed using either i) the iterative non-central chi-squared distribution method of Hedges and Piggott (2001); or ii) the test-based method of Higgins and Thompson (2002). The non-central chi-square method is currently the method of choice (Higgins, personal communication, 2006) – it is computed if the 'exact' option is selected.
http://purl.obolibrary.org/obo/STATO_0000424	risk difference	http://purl.obolibrary.org/obo/STATO_0000614	absolute difference	absolute risk difference	The risk difference is the difference between the observed risks (proportions of individuals with the outcome of interest) in the two groups.
The risk difference is straightforward to interpret: it describes the actual difference in the observed risk of events between experimental and control interventions.
http://purl.obolibrary.org/obo/STATO_0000430	meta analysis by Hartung-Knapp-Sidik-Jonkman method	http://purl.obolibrary.org/obo/STATO_0000155	meta analysis	HKSJ method	a random effect meta analysis procedure defined by Hartung and Knapp and by Sidik and Jonkman which performs better than DerSimonian and Laird approach, especially when there is heterogeneity and the number of studies in the meta-analysis is small.
http://purl.obolibrary.org/obo/STATO_0000433	McNemar test	http://purl.obolibrary.org/obo/OBI_0200000	data transformation	test of the marginal homogeneity of a contingency table	McNemar's test is a statistical test used on paired nominal data. It is applied to 2 × 2 contingency tables with a dichotomous trait, with matched pairs of subjects, to determine whether the row and column marginal frequencies are equal (that is, whether there is "marginal homogeneity"). It is named after Quinn McNemar, who introduced it in 1947.
An application of the test in genetics is the transmission disequilibrium test for detecting linkage disequilibrium
http://purl.obolibrary.org/obo/STATO_0000440	Dixon Q test	http://purl.obolibrary.org/obo/STATO_0000443	generalized extreme studentized deviate test	Q test	Dixon test is a statistical test used to detect outliers in a univariate data set assumed to come from a normally distributed population.
http://purl.obolibrary.org/obo/STATO_0000441	Grubbs' test	http://purl.obolibrary.org/obo/STATO_0000443	generalized extreme studentized deviate test	maximum normed residual test	Grubbs' test is a statistical test used to detect one outlier in a univariate data set assumed to come from a normally distributed population.
http://purl.obolibrary.org/obo/STATO_0000442	Tietjen-Moore test for outliers	http://purl.obolibrary.org/obo/STATO_0000443	generalized extreme studentized deviate test	Tietjen-Moore test	Tietjen-Moore test for outlier is a statistical test used to detect outliers and corresponds to a generalization of the Grubb's test, thus allowing detection of more than one outlier  in a univariate data set assumed to come from a normally distributed population.
If testing for a single outlier, the Tietjen-Moore test is equivalent to the Grubbs' test.
http://purl.obolibrary.org/obo/STATO_0000443	generalized extreme studentized deviate test	http://purl.obolibrary.org/obo/OBI_0200000	data transformation	generalized ESD test for outliers	The Extreme Studentized Deviate Test  is a statistical test used to detect outliers in a univariate data set assumed to come from a normally distributed population.
The ESD Test differs from the Grubbs' test and the Tietjen-Moore test in the sense that it contains built-in correction for multiple testing.
http://purl.obolibrary.org/obo/STATO_0000455	credible interval	http://purl.obolibrary.org/obo/STATO_0000028	measure of dispersion	Bayesian credibility interval	In Bayesian statistics context, a credible interval is an interval of a posterior distribution which is such that the density at any point inside the interval is greater than the density at any point outside and that the area under the curve for that interval is equal to a prespecified probability level. For any probability level there is generally only one such interval, which is also often known as the highest posterior density region. Unlike the usual confidence interval associated with frequentist inference, here the intervals specify the range within which parameters lie with a certain probability.

The Bayesian counterparts of the confidence interval used in Frequentists Statistics.
http://purl.obolibrary.org/obo/STATO_0000456	95% credible interval	http://purl.obolibrary.org/obo/STATO_0000455	credible interval	Bayesian credibility interval at 95%	In Bayesian statistics context, a 95% credible interval is a credible interval which,given the data, includes the true parameter  with probability of 95%.
http://purl.obolibrary.org/obo/STATO_0000458	99% credible interval	http://purl.obolibrary.org/obo/STATO_0000455	credible interval	Bayesian credible interval at 99%	In Bayesian statistics context, a 99% credible interval is a credible interval which, given the data, includes the true parameter  with probability of 99%.
http://purl.obolibrary.org/obo/STATO_0000467	model random effect term	http://purl.obolibrary.org/obo/STATO_0000466	model term	variance component	the model random effect term is model term which aims to account for the unwanted variability in the data associated with a range of independent variables which are not the primary interest in the dataset. It is there also known as the variance component of the model
http://purl.obolibrary.org/obo/STATO_0000474	Satterthwaite degree of freedom approximation	http://purl.obolibrary.org/obo/STATO_0000472	degree of freedom approximation	satterthwaite	Satterthwaite degree of freedom approximation is a type of degree of freedom approximation which is used to estimate an “effective degrees of freedom” for a probability distribution formed from several independent normal distributions where only estimates of the variance are known. It was originally developed by statistician Franklin E. Satterthwaite.
http://purl.obolibrary.org/obo/STATO_0000475	between-within denominator degrees of freedom approximation	http://purl.obolibrary.org/obo/STATO_0000472	degree of freedom approximation	between-within	a data transformation to determine the number of degree of freedom
http://purl.obolibrary.org/obo/STATO_0000501	Weibull probability distribution	http://purl.obolibrary.org/obo/STATO_0000067	continuous probability distribution	Weibull distribution	the Weibull probability distribution is continuous probabibility distribution which is used to model time to fail, time to repair and material strength in material science. In biomedicine, the Weibull probability is used to in determining 'hazard functions'. 

The 'location parameter' of the Weibull probability distribution can be used to define a failure-free zone.

If the quantity X is a "time-to-failure", the Weibull distribution gives a distribution for which the failure rate is proportional to a power of time. The shape parameter, k, is that power plus one, and so this parameter can be interpreted directly as follows:
A value of {\displaystyle k<1\,} {\displaystyle k<1\,} indicates that the failure rate decreases over time. This happens if there is significant "infant mortality", or defective items failing early and the failure rate decreasing over time as the defective items are weeded out of the population. In the context of the diffusion of innovations, this means negative word of mouth: the hazard function is a monotonically decreasing function of the proportion of adopters;
A value of {\displaystyle k=1\,} {\displaystyle k=1\,} indicates that the failure rate is constant over time. This might suggest random external events are causing mortality, or failure. The Weibull distribution reduces to an exponential distribution;
A value of {\displaystyle k>1\,} {\displaystyle k>1\,} indicates that the failure rate increases with time. This happens if there is an "aging" process, or parts that are more likely to fail as time goes on. In the context of the diffusion of innovations, this means positive word of mouth: the hazard function is a monotonically increasing function of the proportion of adopters. The function is first concave, then convex with an inflexion point at {\displaystyle (e^{1/k}-1)/e^{1/k},k>1\,} {\displaystyle (e^{1/k}-1)/e^{1/k},k>1\,}.
http://purl.obolibrary.org/obo/STATO_0000504	line intercept sampling	http://purl.obolibrary.org/obo/STATO_0000509	systematic sampling	LIS sampling	line intercept sampling  is a sampling process by which an element in a spatial region is included in a sample if it is intersected by a line chosen by the operator.
http://purl.obolibrary.org/obo/STATO_0000524	Matthews correlation coefficient	http://purl.obolibrary.org/obo/STATO_0000622	ratio-based measure of association	phi coefficient	Matthews Correlation Coefficient (or MCC) is a correlation coefficient which is a measure of the quality of binary (two-class) classifications, introduced by biochemist Brian W. Matthews in 1975.
http://purl.obolibrary.org/obo/STATO_0000526	numerator relationship matrix	http://purl.obolibrary.org/obo/STATO_0000642	matrix	A matrix	the numerator relationship matrix is the matrix of *expected* additive genetic relationships between individuals. This matrix was originally used by Henderson (Henderson, C.R. 1976. A simple method for computing the inverse of a numerator relationship matrix used in prediction of breeding values. Biometrics 32:69-83.) to account for covariances between random effects, and therefore to use information from relatives in estimation of breeding value. Among the properties of the NRM matrix (also known as the A matrix), it is symmetric, the diagonal value correspond to 1+ the inbreeding coefficient for an individual.
http://purl.obolibrary.org/obo/STATO_0000560	Rao's score	http://purl.obolibrary.org/obo/STATO_0000039	statistic	Rao-Scott test	the Rao-Scott score is a statistic which is used to test the hypothesis that all coefficients associated with a particular regression term are zero (or have some other specified values). the LRT uses a linear combination of chi-squared distributions
http://purl.obolibrary.org/obo/STATO_0000564	coefficient of determination	http://purl.obolibrary.org/obo/STATO_0000039	statistic	r2	The coefficient of determination is a data item measuring the proportion of the variance in the dependent variable that is predictable from the independent variable(s).
In the case of a linear regression mode, the coefficient of determination r2 is the quotient of the variances of the fitted values and observed values of the dependent variable.
http://purl.obolibrary.org/obo/STATO_0000571	Partial Least Square regression	http://purl.obolibrary.org/obo/OBI_0200175	class discovery data transformation	PLS	Partial least squares regression (PLS regression) is a data transformation that bears some relation to principal components regression; instead of finding hyperplanes of maximum variance between the response and independent variables, it finds a linear regression model by projecting the predicted variables and the observable variables to a new space. Because both the X and Y data are projected to new spaces, the PLS family of methods are known as bilinear factor models. Partial least squares Discriminant Analysis (PLS-DA) is a variant used when the Y is categorical.

PLS is used to find the fundamental relations between two matrices (X and Y), i.e. a latent variable approach to modeling the covariance structures in these two spaces. A PLS model will try to find the multidimensional direction in the X space that explains the maximum multidimensional variance direction in the Y space. PLS regression is particularly suited when the matrix of predictors has more variables than observations, and when there is multicollinearity among X values. By contrast, standard regression will fail in these cases (unless it is regularized).

Partial least squares was introduced by the Swedish statistician Herman O. A. Wold, who then developed it with his son, Svante Wold. An alternative term for PLS (and more correct according to Svante Wold[1]) is projection to latent structures, but the term partial least squares is still dominant in many areas. Although the original applications were in the social sciences, PLS regression is today most widely used in chemometrics and related areas. It is also used in bioinformatics, sensometrics, neuroscience and anthropology.
http://purl.obolibrary.org/obo/STATO_0000572	Partial Least Square Discriminant Analysis	http://purl.obolibrary.org/obo/OBI_0200175	class discovery data transformation	PLS-DA	a version of PLS used for classification, where the input y-block are group labels (categorical variable) rather than a continuous variable
http://purl.obolibrary.org/obo/STATO_0000575	non-iterative Partial Least Squares	http://purl.obolibrary.org/obo/OBI_0200000	data transformation	NIPALS	a data transformation which finds principal component  by applying non-linear iterative partial least squares algorithm
http://purl.obolibrary.org/obo/STATO_0000580	variable importance in projection	http://purl.obolibrary.org/obo/STATO_0000039	statistic	VIP	variable importance in projection is a measure computed as part of a partial least square regression  to accumulate the importance of each variable j being reflected by w from each component.
http://purl.obolibrary.org/obo/OBI_0002743	molecular label role	http://purl.obolibrary.org/obo/OBI_0000086	reagent role	molecular tracer role	A reagent role inhering in a molecular entity intended to associate with some molecular target to serve as a proxy for the presence, abundance, or location of this target in a detection of molecular label assay.
http://purl.obolibrary.org/obo/OBI_0002744	molecular label	http://purl.obolibrary.org/obo/OBI_0001879	reagent	molecular tracer	A molecular reagent intended to associate with some molecular target to serve as a proxy for the presence, abundance, or location of this target in a detection of molecular label assay
http://purl.obolibrary.org/obo/STATO_0000671	hazard rate	http://purl.obolibrary.org/obo/STATO_0000645	rate	instantaneous hazard rate	A conditional instantaneous rate in which the numerator represents an incidence conditioned on survival to a specified time, and the denominator represents a time interval with a duration approaching zero.
http://purl.obolibrary.org/obo/STATO_0000633	cutoff	http://purl.obolibrary.org/obo/IAO_0000030	information content entity	limit	a cutoff is an information content entity that represents or sets the boundary at which something changes.
http://purl.obolibrary.org/obo/STATO_0000680	incidence rate ratio	http://purl.obolibrary.org/obo/STATO_0000622	ratio-based measure of association	IRR	A measure of association that is the ratio of two incidence rates.
http://purl.obolibrary.org/obo/STATO_0000681	standardized incidence ratio	http://purl.obolibrary.org/obo/STATO_0000680	incidence rate ratio	SIR	An incidence rate ratio in which the numerator is the incidence rate in a group and the denominator is the incidence rate for a reference population.
http://purl.obolibrary.org/obo/STATO_0000687	calibration slope	http://purl.obolibrary.org/obo/STATO_0000685	mesure of calibration	calibration in the small	A measure of calibration that is the rate of change in the appropriately transformed value per unit change of the correspondingly transformed predicted value.
http://purl.obolibrary.org/obo/STATO_0000565	regression coefficient	http://purl.obolibrary.org/obo/STATO_0000610	measure of association		a regression coefficient is a measure of association that is used as the coefficient of an independent variable in a regression model, of the dependent variable, which is linear in its parameters.

A value of zero means no association. The sign (positive or negative) reflects the direction of association.

a regression coefficient is a measure of association  generated by a type of data transformation called a regression, which aims to model a response variable by expression the predictor variables as part of a function where variable terms are modified by a number. A regression coefficient is one such number.
http://purl.obolibrary.org/obo/STATO_0000636	number needed to screen to prevent	http://purl.obolibrary.org/obo/STATO_0000634	reciprocal of difference	NNS	A statistic that represents the number of units that needs to be tested to prevent one additional adverse outcome, assuming that positive testing will lead to preventive intervention. The Number Needed to Screen to Prevent is calculated as the Number Needed to Treat divided by the prevalence.

Comment for application: 
The Number Needed to Screen (NNS) value is often rounded up to the next highest whole integer.  The Number Needed to Screen to Detect is distinct from the Number Needed to Screen to Prevent as the formulas to calculate are different, even though both may be abbreviated as NNS.

The formula may be adjusted for test performance characteristics (e.g. dividing by the sensitivity) or assumptions regarding acceptance or adherence of interventions.
http://purl.obolibrary.org/obo/STATO_0000502	statistical sampling	http://purl.obolibrary.org/obo/OBI_0000011	planned process		statistical sampling is a planned process which aims at assembling a population of observation units (samples) in as an unbiaised manner as possible in order to obtain or infer information about the actual population these samples have been drawn.
http://purl.obolibrary.org/obo/CHEBI_23367	molecular entity	http://purl.obolibrary.org/obo/BFO_0000040	material entity		Any constitutionally or isotopically distinct atom, molecule, ion, ion pair, radical, radical ion, complex, conformer etc., identifiable as a separately distinguishable entity.
http://purl.obolibrary.org/obo/CHEBI_33250	atom	http://purl.obolibrary.org/obo/BFO_0000040	material entity		A chemical entity constituting the smallest component of an element having the chemical properties of the element.
http://purl.obolibrary.org/obo/STATO_0000588	scaled identity covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure		the scaled identity covariance structure is a type of covariance structure which has constant variance. The assumption is that there is no correlation between any elements.
http://purl.obolibrary.org/obo/PATO_0001018	physical quality	http://purl.obolibrary.org/obo/PATO_0001241	physical object quality		A quality of a physical entity that exists through action of continuants at the physical level of organisation in relation to other entities.
http://purl.obolibrary.org/obo/PATO_0001241	physical object quality	http://purl.obolibrary.org/obo/BFO_0000019	quality		A quality which inheres in a continuant.
http://purl.obolibrary.org/obo/PATO_0001291	electromagnetic (EM) radiation quality	http://purl.obolibrary.org/obo/PATO_0001739	radiation quality		A physical quality that inheres in an bearer by virtue of how that bearer interacts with electromagnetic radiation.
http://purl.obolibrary.org/obo/PATO_0001296	luminous flux	http://purl.obolibrary.org/obo/PATO_0001300	optical quality		A scalar optical quality which obtains by the magnitude of the light emitted by the bearer.
http://purl.obolibrary.org/obo/PATO_0001300	optical quality	http://purl.obolibrary.org/obo/PATO_0001291	electromagnetic (EM) radiation quality		An EM radiation quality in which the EM radiation is within the fiat range of the spectrum visible deemed to be light.
http://purl.obolibrary.org/obo/PATO_0001739	radiation quality	http://purl.obolibrary.org/obo/PATO_0001018	physical quality		A quality that inheres in an bearer by virtue of how that bearer interacts with radiation.
http://purl.obolibrary.org/obo/STATO_0000481	genomic best linear unbiased prediction	http://purl.obolibrary.org/obo/STATO_0000337	breeding value estimation		a data transformation which calculate predictions of breeding values using an animal model and a relationship matrix calculated from the genomic/genetic markers (G Matrix), in constrast to using Pedigree information as in BLUP, also known as ABLUP
http://purl.obolibrary.org/obo/STATO_0000336	best linear unbiased predictor	http://purl.obolibrary.org/obo/STATO_0000337	breeding value estimation		best linear unbiased prediction is a data transformation which predicts <TDB> under the assumption that the variable(s) under consideration have a random effect
http://purl.obolibrary.org/obo/STATO_0000704	concentration	http://purl.obolibrary.org/obo/IAO_0000032	scalar measurement datum		A concentration is the measure of the amount of solute dissolved in a unit quantity of solvent or solution. It describes the strength or intensity of a solution in terms of its solute content.

It is therefore a ratio in which the numerator is a measure of the solute and the denominator is a measure of the solvent.
http://purl.obolibrary.org/obo/STATO_0000736	effect size for power calculation	http://purl.obolibrary.org/obo/IAO_0000027	data item		a data item used as input to a statistical test power calculation that represents the smallest difference from the value assumed under the null hypothesis that is used to conclude in favor of an alternative hypothesis.
http://purl.obolibrary.org/obo/STATO_0000722	quantitative variable	http://purl.obolibrary.org/obo/STATO_0000258	variable		A variable that can assume numerical values, with or without units of measure.
http://purl.obolibrary.org/obo/STATO_0000742	probability distribution parameter	http://purl.obolibrary.org/obo/IAO_0000027	data item		A member of a set of quantities that unambiguously defines a probability distribution function.
http://purl.obolibrary.org/obo/STATO_0000709	testing margin	http://purl.obolibrary.org/obo/STATO_0000633	cutoff		a prespecified threshold that distinguishes between a null and alternative hypothesis.
http://purl.obolibrary.org/obo/STATO_0000710	non-inferiority margin	http://purl.obolibrary.org/obo/STATO_0000709	testing margin		hypothesis testing margin used in a study with a non-inferiority goal.
http://purl.obolibrary.org/obo/STATO_0000711	equivalence margin	http://purl.obolibrary.org/obo/STATO_0000709	testing margin		A hypothesis testing margin used in a study with an equivalence goal.
The equivalence margin is a predefined threshold that specifies how large a difference between two treatments (or groups) can be and still be considered practically the same. It defines the range of differences that are not clinically or practically meaningful.
http://purl.obolibrary.org/obo/STATO_0000715	A/B testing	http://purl.obolibrary.org/obo/STATO_0000279	between group comparison statistical test		a type of statistical testing comparing 2 types of treatments or interventions.
http://purl.obolibrary.org/obo/STATO_0000716	non-inferiority testing	http://purl.obolibrary.org/obo/STATO_0000715	A/B testing		a type of between group comparison where the intent is to compare two or more interventions or exposures and to determine that any difference in effects is below a prespecified value representing a threshold between a meaningful difference and absence of a meaningful difference
http://purl.obolibrary.org/obo/STATO_0000717	equivalence testing	http://purl.obolibrary.org/obo/STATO_0000715	A/B testing		a type of between group comparison test with the intent to compare two or more interventions or exposures and to determine that the absolute value of any difference in effects is smaller than a prespecified value representing a meaningful difference.
http://purl.obolibrary.org/obo/STATO_0000718	superiority testing	http://purl.obolibrary.org/obo/STATO_0000715	A/B testing		a type of between group comparison test with the intent to compare two or more interventions or exposures and to determine that the absolute value of any difference in effects is larger than a prespecified value representing a meaningful difference.
http://purl.obolibrary.org/obo/STATO_0000719	experimental unit	http://purl.obolibrary.org/obo/BFO_0000004	independent continuant		The smallest unit of an experiment to which a  treatment is *independently* applied. It is therefore the entity that receives a treatment (or condition) and whose response is assumed to be independent of other units.
The experimental unit defines how many independent pieces of information will be available for inference.
It determines the sample size (n) for hypothesis testing and confidence intervals.
Mistaking smaller components (like individual plants or students) for independent units can lead to pseudoreplication — overestimating the true sample size and producing misleadingly small p-values.
http://purl.obolibrary.org/obo/STATO_0000720	observation unit	http://purl.obolibrary.org/obo/BFO_0000004	independent continuant		An observation unit (also known as 'measurement unit') is the entity on which data is actually collected or recorded. It is the entity from which a measurement or observation is taken.
http://purl.obolibrary.org/obo/STATO_0000721	nominal variable	http://purl.obolibrary.org/obo/STATO_0000252	categorical variable		A categorical variable for which the meanings of the categories do not have an ordered relationship.
http://purl.obolibrary.org/obo/STATO_0000723	discrete variable	http://purl.obolibrary.org/obo/STATO_0000722	quantitative variable		A quantitative variable for which only integer values are allowed.
http://purl.obolibrary.org/obo/STATO_0000724	boolean variable	http://purl.obolibrary.org/obo/STATO_0000090	dichotomous variable		A variable which is defined to have only 2 possible values, True or False.
http://purl.obolibrary.org/obo/STATO_0000725	dichotomous ordinal variable	http://purl.obolibrary.org/obo/STATO_0000252	categorical variable		An ordinal variable which is defined to have only 2 categories or possible values.
http://purl.obolibrary.org/obo/STATO_0000726	polychotomous ordinal variable	http://purl.obolibrary.org/obo/STATO_0000252	categorical variable		An ordinal variable which is defined to have minimally 2 categories or possible values.
http://purl.obolibrary.org/obo/STATO_0000727	interval variable	http://purl.obolibrary.org/obo/STATO_0000251	continuous variable		A continuous variable with ordered, equally spaced units where the difference between values is meaningful, but no true zero point exists.
http://purl.obolibrary.org/obo/STATO_0000728	date variable	http://purl.obolibrary.org/obo/STATO_0000727	interval variable		An interval variable for which allowable values represent a point in time on a calendar with a year, with or without a month, and with or without a day.
http://purl.obolibrary.org/obo/STATO_0000729	date Time variable	http://purl.obolibrary.org/obo/STATO_0000727	interval variable		An interval variable for which allowable values represent a point in time on a calendar, with a year, month, and day, and on a clock, with hours, minutes, and seconds.
http://purl.obolibrary.org/obo/STATO_0000730	ratio variable	http://purl.obolibrary.org/obo/STATO_0000251	continuous variable		A continuous variable with a true zero point and ordered, equally spaced units where the difference between values is meaningful.
http://purl.obolibrary.org/obo/STATO_0000731	time-to-event variable	http://purl.obolibrary.org/obo/STATO_0000730	ratio variable		A ratio variable representing the duration of observation until the event occurred, conditioned on the event occurring.
http://purl.obolibrary.org/obo/STATO_0000732	multiclass categorical variable	http://purl.obolibrary.org/obo/STATO_0000721	nominal variable		A nominal variable that can assume two or more values simultaneously.
http://purl.obolibrary.org/obo/STATO_0000734	statistically significant	http://purl.obolibrary.org/obo/OBI_0001909	conclusion based on data		a conclusion based on the outcome of statistical test that means the result of a hypothesis test supports rejection of the null hypothesis at the specified alpha level.
http://purl.obolibrary.org/obo/STATO_0000735	not statistically significant	http://purl.obolibrary.org/obo/OBI_0001909	conclusion based on data		a conclusion based on the outcome of statistical test means the result of a hypothesis test does not support rejection of the null hypothesis at the specified alpha level.
http://purl.obolibrary.org/obo/STATO_0000738	alpha level as statistical test input parameter	http://purl.obolibrary.org/obo/IAO_0000027	data item		A hypothesis test parameter input that represents the probability of incorrectly rejecting the null hypothesis.
http://purl.obolibrary.org/obo/STATO_0000739	nominal alpha level	http://purl.obolibrary.org/obo/STATO_0000738	alpha level as statistical test input parameter		An alpha level that expresses the intended overall probability cutoff without adjustment for multiple comparisons
http://purl.obolibrary.org/obo/STATO_0000740	adjusted alpha level	http://purl.obolibrary.org/obo/STATO_0000738	alpha level as statistical test input parameter		An alpha level that is adjusted for multiple comparisons to preserve the intended overall probability cutoff.
http://purl.obolibrary.org/obo/BFO_0000002	continuant	http://purl.obolibrary.org/obo/BFO_0000001	entity		An entity that exists in full at any time in which it exists at all, persists through time while maintaining its identity and has no temporal parts.
http://purl.obolibrary.org/obo/BFO_0000003	occurrent	http://purl.obolibrary.org/obo/BFO_0000001	entity		An entity that has temporal parts and that happens, unfolds or develops through time.
http://purl.obolibrary.org/obo/BFO_0000004	independent continuant	http://purl.obolibrary.org/obo/BFO_0000002	continuant		A continuant that is a bearer of quality and realizable entity entities, in which other entities inhere and which itself cannot inhere in anything.
http://purl.obolibrary.org/obo/BFO_0000015	process	http://purl.obolibrary.org/obo/BFO_0000003	occurrent		An occurrent that has temporal proper parts and for some time t, p s-depends_on some material entity at t.
http://purl.obolibrary.org/obo/BFO_0000017	realizable entity	http://purl.obolibrary.org/obo/BFO_0000020	specifically dependent continuant		A specifically dependent continuant  that inheres in continuant  entities and are not exhibited in full at every time in which it inheres in an entity or group of entities. The exhibition or actualization of a realizable entity is a particular manifestation, functioning or process that occurs under certain circumstances.
http://purl.obolibrary.org/obo/BFO_0000020	specifically dependent continuant	http://purl.obolibrary.org/obo/BFO_0000002	continuant		A continuant that inheres in or is borne by other entities. Every instance of A requires some specific instance of B which must always be the same.
http://purl.obolibrary.org/obo/BFO_0000023	role	http://purl.obolibrary.org/obo/BFO_0000017	realizable entity		A realizable entity  the manifestation of which brings about some result or end that is not essential to a continuant  in virtue of the kind of thing that it is but that can be served or participated in by that kind of continuant  in some kinds of natural, social or institutional contexts.
http://purl.obolibrary.org/obo/BFO_0000031	generically dependent continuant	http://purl.obolibrary.org/obo/BFO_0000002	continuant		A continuant that is dependent on one or other independent continuant bearers. For every instance of A requires some instance of (an independent continuant type) B but which instance of B serves can change from time to time.
http://purl.obolibrary.org/obo/BFO_0000040	material entity	http://purl.obolibrary.org/obo/BFO_0000004	independent continuant		An independent continuant that is spatially extended whose identity is independent of that of other entities and can be maintained through time.
http://purl.obolibrary.org/obo/IAO_0000001	conditional specification	http://purl.obolibrary.org/obo/IAO_0000033	directive information entity		a directive information entity that specifies what should happen if the trigger condition is fulfilled
http://purl.obolibrary.org/obo/IAO_0000003	measurement unit label	http://purl.obolibrary.org/obo/IAO_0000009	datum label		A measurement unit label is as a label that is part of a scalar measurement datum and denotes a unit of measure.
http://purl.obolibrary.org/obo/IAO_0000007	action specification	http://purl.obolibrary.org/obo/IAO_0000033	directive information entity		a directive information entity that describes an action the bearer will take
http://purl.obolibrary.org/obo/IAO_0000009	datum label	http://purl.obolibrary.org/obo/IAO_0000030	information content entity		A label is a symbol that is part of some other datum and is used to either partially define  the denotation of that datum or to provide a means for identifying the datum as a member of the set of data with the same label
http://purl.obolibrary.org/obo/IAO_0000015	information carrier	http://purl.obolibrary.org/obo/BFO_0000019	quality		A quality of an information bearer that imparts the information content
http://purl.obolibrary.org/obo/IAO_0000028	symbol	http://purl.obolibrary.org/obo/IAO_0000030	information content entity		An information content entity that is a mark(s) or character(s) used as a conventional representation of another entity.
http://purl.obolibrary.org/obo/IAO_0000030	information content entity	http://purl.obolibrary.org/obo/BFO_0000031	generically dependent continuant		A generically dependent continuant that is about some thing.
http://purl.obolibrary.org/obo/IAO_0000032	scalar measurement datum	http://purl.obolibrary.org/obo/IAO_0000109	measurement datum		A scalar measurement datum is a measurement datum that is composed of two parts, numerals and a unit label.
http://purl.obolibrary.org/obo/IAO_0000033	directive information entity	http://purl.obolibrary.org/obo/IAO_0000030	information content entity		An information content entity whose concretizations indicate to their bearer how to realize them in a process.
http://purl.obolibrary.org/obo/IAO_0000037	dot plot	http://purl.obolibrary.org/obo/IAO_0000038	graph		A dot plot is a report graph which is a graphical representation of data where each data point is represented by a single dot placed on coordinates corresponding to data point values in particular dimensions.
http://purl.obolibrary.org/obo/IAO_0000038	graph	http://purl.obolibrary.org/obo/IAO_0000309	diagram		A diagram that presents one or more tuples of information by mapping those tuples in to a two dimensional space in a non arbitrary way.
http://purl.obolibrary.org/obo/IAO_0000055	rule	http://purl.obolibrary.org/obo/IAO_0000001	conditional specification		a rule is an executable which guides, defines, restricts actions
http://purl.obolibrary.org/obo/IAO_0000064	algorithm	http://purl.obolibrary.org/obo/IAO_0000104	plan specification		A plan specification which describes the inputs and output of mathematical functions as well as workflow of execution for achieving an predefined objective. Algorithms are realized usually by means of implementation as computer programs for execution by automata.
http://purl.obolibrary.org/obo/IAO_0000078	curation status specification	http://purl.obolibrary.org/obo/IAO_0000102	data about an ontology part		The curation status of the term. The allowed values come from an enumerated list of predefined terms. See the specification of these instances for more detailed definitions of each enumerated value.
http://purl.obolibrary.org/obo/IAO_0000100	data set	http://purl.obolibrary.org/obo/IAO_0000027	data item		A data item that is an aggregate of other data items of the same type that have something in common. Averages and distributions can be determined for data sets.
http://purl.obolibrary.org/obo/IAO_0000101	image	http://purl.obolibrary.org/obo/IAO_0000308	figure		An image is an affine projection to a two dimensional surface, of measurements of some quality of an entity or entities repeated at regular intervals across a spatial range, where the measurements are represented as color and luminosity on the projected on surface.
http://purl.obolibrary.org/obo/IAO_0000102	data about an ontology part	http://purl.obolibrary.org/obo/IAO_0000027	data item		Data about an ontology part is a data item about a part of an ontology, for example a term
http://purl.obolibrary.org/obo/IAO_0000104	plan specification	http://purl.obolibrary.org/obo/IAO_0000033	directive information entity		A directive information entity with action specifications and objective specifications as parts that, when concretized, is realized in a process in which the bearer tries to achieve the objectives by taking the actions specified.
http://purl.obolibrary.org/obo/IAO_0000109	measurement datum	http://purl.obolibrary.org/obo/IAO_0000027	data item		A measurement datum is an information content entity that is a recording of the output of a measurement such as produced by a device.
http://purl.obolibrary.org/obo/IAO_0000129	version number	http://purl.obolibrary.org/obo/IAO_0000028	symbol		A version number is an information content entity which is a sequence of characters borne by part of each of a class of manufactured products or its packaging and indicates its order within a set of other products having the same name.
http://purl.obolibrary.org/obo/IAO_0000144	conclusion textual entity	http://purl.obolibrary.org/obo/OBI_0001909	conclusion based on data		A textual entity that expresses the results of reasoning about a problem, for instance as typically found towards the end of scientific papers.
http://purl.obolibrary.org/obo/IAO_0000306	table	http://purl.obolibrary.org/obo/IAO_0000300	textual entity		A textual entity that contains a two-dimensional arrangement of texts repeated at regular intervals across a spatial range, such that the spatial relationships among the constituent texts expresses propositions
http://purl.obolibrary.org/obo/IAO_0000308	figure	http://purl.obolibrary.org/obo/IAO_0000030	information content entity		An information content entity consisting of a two dimensional arrangement of information content entities such that the arrangement itself is about something.
http://purl.obolibrary.org/obo/IAO_0000309	diagram	http://purl.obolibrary.org/obo/IAO_0000308	figure		A figure that expresses one or more propositions
http://purl.obolibrary.org/obo/IAO_0000310	document	http://purl.obolibrary.org/obo/IAO_0000030	information content entity		A collection of information content entities intended to be understood together as a whole
http://purl.obolibrary.org/obo/IAO_0000400	cartesian spatial coordinate datum	http://purl.obolibrary.org/obo/IAO_0000027	data item		A cartesian spatial coordinate datum is a representation of a point in a spatial region, in which equal changes in the magnitude of a coordinate value denote length qualities with the same magnitude
http://purl.obolibrary.org/obo/IAO_0000401	one dimensional cartesian spatial coordinate datum	http://purl.obolibrary.org/obo/IAO_0000400	cartesian spatial coordinate datum		A cartesion spatial coordinate datum that  uses one value to specify a position along a one dimensional spatial region
http://purl.obolibrary.org/obo/IAO_0000402	two dimensional cartesian spatial coordinate datum	http://purl.obolibrary.org/obo/IAO_0000400	cartesian spatial coordinate datum		A cartesion spatial coordinate datum that  uses two values to specify a position within a two dimensional spatial region
http://purl.obolibrary.org/obo/IAO_0000414	mass measurement datum	http://purl.obolibrary.org/obo/IAO_0000032	scalar measurement datum		A scalar measurement datum that is the result of measurement of mass quality
http://purl.obolibrary.org/obo/IAO_0000416	time measurement datum	http://purl.obolibrary.org/obo/IAO_0000032	scalar measurement datum		A scalar measurement datum that is the result of measuring a temporal interval
http://purl.obolibrary.org/obo/IAO_0000572	documenting	http://purl.obolibrary.org/obo/OBI_0000011	planned process		A planned process in which a document is created or added to by including the specified input in it.
http://purl.obolibrary.org/obo/OBI_0000010	fluorescent reporter intensity	http://purl.obolibrary.org/obo/IAO_0000109	measurement datum		A measurement datum that represents the output of a scanner measuring the intensity value for each fluorescent reporter.
http://purl.obolibrary.org/obo/OBI_0000011	planned process	http://purl.obolibrary.org/obo/BFO_0000015	process		A processual entity that realizes a plan which is the concretization of a plan specification.
http://purl.obolibrary.org/obo/OBI_0000015	biological feature identification objective	http://purl.obolibrary.org/obo/IAO_0000005	objective specification		Biological_feature_identification_objective is an objective role carried out by the proposition defining the aim of a study designed to examine or characterize a particular biological feature.
http://purl.obolibrary.org/obo/OBI_0000047	processed material	http://purl.obolibrary.org/obo/BFO_0000040	material entity		Is a material entity that is created or changed during material processing.
http://purl.obolibrary.org/obo/OBI_0000066	investigation	http://purl.obolibrary.org/obo/OBI_0000011	planned process		a planned process that consists of parts: planning, study design execution, documentation and which produce conclusion(s).
http://purl.obolibrary.org/obo/OBI_0000067	evaluant role	http://purl.obolibrary.org/obo/BFO_0000023	role		a role that inheres in a material entity that is realized in an assay in which data is generated about the bearer of the evaluant role
http://purl.obolibrary.org/obo/OBI_0000071	quantitative confidence value	http://purl.obolibrary.org/obo/STATO_0000609	measure of hypothesis testing		A data item which is used to indicate the degree of uncertainty about a measurement.
http://purl.obolibrary.org/obo/OBI_0000079	culture medium	http://purl.obolibrary.org/obo/OBI_0000047	processed material		a processed material that provides the needed nourishment for microorganisms or cells grown in vitro.
http://purl.obolibrary.org/obo/OBI_0000097	participant under investigation role	http://purl.obolibrary.org/obo/BFO_0000023	role		A role that is realized through the execution of a study design in which the bearer of the role participates and in which data about that bearer is collected.
http://purl.obolibrary.org/obo/OBI_0000112	specimen role	http://purl.obolibrary.org/obo/BFO_0000023	role		a role borne by a material entity that is gained during a specimen collection process and that can be realized by use of the specimen in an investigation
http://purl.obolibrary.org/obo/OBI_0000113	sequence feature identification objective	http://purl.obolibrary.org/obo/OBI_0000015	biological feature identification objective		Sequence_feature_identification_objective is a biological_feature_identification_objective role describing a study designed to examine or characterize molecular features exhibited at the level of a macromolecular sequence, e.g. nucleic acid, protein, polysaccharide.
http://purl.obolibrary.org/obo/OBI_0000115	intervention design	http://purl.obolibrary.org/obo/OBI_0500000	study design		An intervention design is a study design in which a controlled process applied to the subjects (the intervention) serves as the independent variable manipulated by the experimentalist. The treatment (perturbation or intervention) defined can be defined as a combination of values taken by independent variable manipulated by the experimentalists are applied to the recruited subjects assigned (possibly by applying specific methods) to treatment groups. The specificity of intervention design is the fact that independent variables are being manipulated and a response of the biological system is evaluated via response variables as monitored by possibly a series of assays.
http://purl.obolibrary.org/obo/OBI_0000118	gene list	http://purl.obolibrary.org/obo/IAO_0000100	data set		A data set of the names or identifiers of genes that are the outcome of an analysis or have been put together for the purpose of an analysis.
http://purl.obolibrary.org/obo/OBI_0000131	molecular feature identification objective	http://purl.obolibrary.org/obo/OBI_0000015	biological feature identification objective		Molecular_feature_identification_objective is a biological_feature_identification_objective role describing a study designed to examine or characterize molecular features of a biological system, e.g. expression profiling, copy number of molecular components, epigenetic modifications.
http://purl.obolibrary.org/obo/OBI_0000164	clonal cDNA library	http://purl.obolibrary.org/obo/OBI_0000731	recombinant vector		Mixed population of cDNAs (complementaryDNA) made from mRNA from a defined source, usually a specific cell type. This term should be associated only to nucleic acid interactors not to their proteins product. For instance in 2h screening use living cells (MI:0349) as sample process.

ALT DEF (PRS):: a cDNA library is a collection of host cells, typically E.Coli cells but not exclusively. modified by transfer of plasmid DNA molecule used as vector containing a fragment or totality of cDNA molecule (the insert) .  cDNA library may have an array of role and applications.
http://purl.obolibrary.org/obo/OBI_0000181	population	http://purl.obolibrary.org/obo/BFO_0000040	material entity		a population is a collection of individuals from the same taxonomic class living, counted or sampled at a particular site or in a particular area
http://purl.obolibrary.org/obo/OBI_0000185	imaging assay	http://purl.obolibrary.org/obo/OBI_0000070	assay		An imaging assay is an assay to produce a picture of an entity. definition_source: OBI.
http://purl.obolibrary.org/obo/OBI_0000245	organization	http://purl.obolibrary.org/obo/BFO_0000040	material entity		An entity that can bear roles, has members, and has a set of organization rules. Members of organizations are either organizations themselves or individual people. Members can bear specific organization member roles that are determined in the organization rules. The organization rules also determine how decisions are made on behalf of the organization by the organization members.
http://purl.obolibrary.org/obo/OBI_0000257	DNA extraction	http://purl.obolibrary.org/obo/OBI_0666667	nucleic acid extraction		A DNA extraction is a nucleic acid extraction where the desired output material is DNA.
http://purl.obolibrary.org/obo/OBI_0000272	protocol	http://purl.obolibrary.org/obo/IAO_0000104	plan specification		A plan specification which has sufficient level of detail and quantitative information to communicate it between investigation agents, so that different investigation agents will reliably be able to independently reproduce the process.
http://purl.obolibrary.org/obo/OBI_0000274	adding a material entity into a target	http://purl.obolibrary.org/obo/OBI_0000652	material combination		is a process with the objective to place a material entity bearing the 'material to be added role' into a material bearing the 'target of material addition role'.
http://purl.obolibrary.org/obo/OBI_0000275	analyte role	http://purl.obolibrary.org/obo/OBI_0002444	measurand role		A measurand role borne by a molecular entity or an atom and realized in an analyte assay which achieves the objective to measure the magnitude/concentration/amount of the analyte in the entity bearing evaluant role.
http://purl.obolibrary.org/obo/OBI_0000319	material to be added role	http://purl.obolibrary.org/obo/BFO_0000023	role		material to be added role is a protocol participant role realized by a material which is added into a material bearing the target of material addition role in a material addition process
http://purl.obolibrary.org/obo/OBI_0000338	drawing a conclusion based on data	http://purl.obolibrary.org/obo/OBI_0000011	planned process		A planned process in which data gathered in an investigation is evaluated in the context of existing knowledge with the objective to generate more general conclusions or to conclude that the data does not allow one to draw general conclusion
http://purl.obolibrary.org/obo/OBI_0000339	planning	http://purl.obolibrary.org/obo/BFO_0000015	process		a process of creating or modifying a plan specification
http://purl.obolibrary.org/obo/OBI_0000367	light emission function	http://purl.obolibrary.org/obo/OBI_0000374	excitation function		A light emission function is an excitation function to excite a material to a specific excitation state that it emits light.
http://purl.obolibrary.org/obo/OBI_0000370	contain function	http://purl.obolibrary.org/obo/BFO_0000034	function		A contain function is a function to constrain a material entities location in space
http://purl.obolibrary.org/obo/OBI_0000371	heat function	http://purl.obolibrary.org/obo/BFO_0000034	function		A heat function is a function that increases the internal kinetic energy of a material
http://purl.obolibrary.org/obo/OBI_0000372	material separation function	http://purl.obolibrary.org/obo/BFO_0000034	function		A material separation function is a function that increases the resolution between two or more material entities. The  to distinction between the entities is usually based on some associated physical quality.
http://purl.obolibrary.org/obo/OBI_0000374	excitation function	http://purl.obolibrary.org/obo/BFO_0000034	function		A excitation function is a function  to inject energy by bombarding a material with energetic particles (e.g., photons) thereby imbuing internal material components such as electrons with additional energy.  These internal, 'excited' particles may lead to the rupturing of covalent chemical bonds or may quickly relax back to there unexcited state with an exponential time course thereby locally emitting energy in the form of photons.
http://purl.obolibrary.org/obo/OBI_0000378	filter function	http://purl.obolibrary.org/obo/OBI_0000372	material separation function		A filter function is a function to prevent the flow of certain entities based on a quality or qualities of the entity while allowing entities which have different qualities to pass through
http://purl.obolibrary.org/obo/OBI_0000387	cool function	http://purl.obolibrary.org/obo/BFO_0000034	function		A cool function is a function to decrease the internal kinetic energy of a material below the initial kinetic energy of that type of material.
http://purl.obolibrary.org/obo/OBI_0000399	solid support function	http://purl.obolibrary.org/obo/BFO_0000034	function		A solid support function is a function of a device on which an entity is kept in a defined position and prevented in its movement
http://purl.obolibrary.org/obo/OBI_0000401	environment control function	http://purl.obolibrary.org/obo/BFO_0000034	function		An environmental control function is a function that regulates a contained environment within specified parameter ranges. For example the control of light exposure, humidity and temperature.
http://purl.obolibrary.org/obo/OBI_0000403	sort function	http://purl.obolibrary.org/obo/OBI_0000372	material separation function		A sort function is a function to distinguish material components based on some associated physical quality or entity and to partition the separate components into distinct fractions according to a defined order.
http://purl.obolibrary.org/obo/OBI_0000411	cloning vector role	http://purl.obolibrary.org/obo/OBI_0000319	material to be added role		A material to be added role played by a small, self-replicating DNA or RNA molecule - usually a plasmid or chromosome - and realized in a process whereby foreign DNA or RNA is inserted into the vector during the process of cloning.
http://purl.obolibrary.org/obo/OBI_0000416	cloning insert role	http://purl.obolibrary.org/obo/BFO_0000023	role		cloning insert role is a role which inheres in DNA or RNA and is realized by the process of being inserted into a cloning vector in a cloning process.
http://purl.obolibrary.org/obo/OBI_0000419	reverse transcriptase	http://purl.obolibrary.org/obo/OBI_0000427	enzyme		enzyme and has_function some GO:0003964 (RNA-directed DNA polymerase
activity)
http://purl.obolibrary.org/obo/OBI_0000425	averaging objective	http://purl.obolibrary.org/obo/OBI_0200166	data transformation objective		An averaging objective is a data transformation objective where the aim is to perform mean calculations on the input of the data transformation.
http://purl.obolibrary.org/obo/OBI_0000427	enzyme	http://purl.obolibrary.org/obo/BFO_0000040	material entity		(protein or rna) or has_part (protein or rna) and
has_function some GO:0003824 (catalytic activity)
http://purl.obolibrary.org/obo/OBI_0000434	adding material objective	http://purl.obolibrary.org/obo/OBI_0000686	material combination objective		is the specification of an objective to add a material into a target material. The adding is asymmetric in the sense that the target material largely retains its identity
http://purl.obolibrary.org/obo/OBI_0000437	analyte measurement objective	http://purl.obolibrary.org/obo/OBI_0000441	assay objective		an assay objective to determine the presence or concentration of an analyte in the evaluant
http://purl.obolibrary.org/obo/OBI_0000441	assay objective	http://purl.obolibrary.org/obo/IAO_0000005	objective specification		an objective specification to determine a specified type of information about an evaluated entity (the material entity bearing evaluant role)
http://purl.obolibrary.org/obo/OBI_0000443	analyte assay	http://purl.obolibrary.org/obo/OBI_0000070	assay		An assay with the objective to capture information about the presence, concentration, or amount of an analyte in an evaluant.
http://purl.obolibrary.org/obo/OBI_0000444	target of material addition role	http://purl.obolibrary.org/obo/BFO_0000023	role		target of material addition role is a role realized by an entity into which a material is added in a material addition process
http://purl.obolibrary.org/obo/OBI_0000451	normalized data set	http://purl.obolibrary.org/obo/IAO_0000100	data set		A data set that is produced as the output of a normalization data transformation.
http://purl.obolibrary.org/obo/OBI_0000453	measure function	http://purl.obolibrary.org/obo/BFO_0000034	function		Measure function is a function that is borne by a processed material  and realized in a process in which information about some entity is expressed relative to some reference.
http://purl.obolibrary.org/obo/OBI_0000471	study design execution	http://purl.obolibrary.org/obo/OBI_0000011	planned process		a planned process that carries out a study design
http://purl.obolibrary.org/obo/OBI_0000639	material separation objective	http://purl.obolibrary.org/obo/OBI_0000456	material transformation objective		is an objective to transform a material entity into spatially separated components.
http://purl.obolibrary.org/obo/OBI_0000649	data set of features	http://purl.obolibrary.org/obo/IAO_0000100	data set		A data set that is produced as the output of a descriptive statistical calculation data transformation and consists of producing a data set that represents one or more features of interest about the input data set.
http://purl.obolibrary.org/obo/OBI_0000650	differential expression analysis data transformation	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		A differential expression analysis data transformation is a data transformation that has an objective of differential expression analysis.  Frequently this data transformation involves summarizing or otherwise aggregating signals from various samples categorized into groups, and calculating measures of group differences from these aggregated signals.
http://purl.obolibrary.org/obo/OBI_0000652	material combination	http://purl.obolibrary.org/obo/OBI_0000094	material processing		is a material processing with the objective to combine two or more material entities as input into a single material entity as output.
http://purl.obolibrary.org/obo/OBI_0000662	error corrected data set	http://purl.obolibrary.org/obo/IAO_0000100	data set		A data set that is produced as the output of an error correction data transformation and consists of producing a data set which has had erroneous contributions from the input to the data transformation removed (corrected for).
http://purl.obolibrary.org/obo/OBI_0000668	error correction data transformation	http://purl.obolibrary.org/obo/APOLLO_SV_00000796	dataset creating		An error correction data transformation is a data transformation that has the objective of error correction, where the aim is to remove (correct for) erroneous contributions from the input to the data transformation.
http://purl.obolibrary.org/obo/OBI_0000671	sample from organism	http://purl.obolibrary.org/obo/OBI_0001479	specimen from organism		a material obtained from an organism in order to be a representative of the whole
http://purl.obolibrary.org/obo/OBI_0000673	statistical hypothesis test	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		A statistical hypothesis test data transformation is a data transformation that has objective statistical hypothesis test.
http://purl.obolibrary.org/obo/OBI_0000678	portioning objective	http://purl.obolibrary.org/obo/OBI_0000639	material separation objective		A material separation objective aiming to separate material into multiple portions, each of which contains a similar composition of the input material.
http://purl.obolibrary.org/obo/OBI_0000681	separation into different composition objective	http://purl.obolibrary.org/obo/OBI_0000639	material separation objective		A material separation objective aiming to separate a material entity that has parts of different types, and end with at least one output that is a material with parts of fewer types (modulo impurities).
http://purl.obolibrary.org/obo/OBI_0000684	specimen collection objective	http://purl.obolibrary.org/obo/IAO_0000005	objective specification		A objective specification to obtain a material entity for potential use as an input during an investigation.
http://purl.obolibrary.org/obo/OBI_0000686	material combination objective	http://purl.obolibrary.org/obo/OBI_0000456	material transformation objective		is an objective to obtain an output material that contains several input materials.
http://purl.obolibrary.org/obo/OBI_0000731	recombinant vector	http://purl.obolibrary.org/obo/OBI_0000047	processed material		A recombinant vector is created by a recombinant vector cloning process, and contains nucleic acids that can be amplified. It retains functions of the original cloning vector.
http://purl.obolibrary.org/obo/OBI_0000737	cloning vector	http://purl.obolibrary.org/obo/OBI_1110108	material to be added		A cloning vector is an engineered material that is used as an input material for a recombinant vector cloning process to carry inserted nucleic acids. It contains an origin of replication for a specific destination host organism, encodes for a selectable gene product and contains a cloning site.
http://purl.obolibrary.org/obo/OBI_0000740	material sample role	http://purl.obolibrary.org/obo/OBI_0000112	specimen role		A material sample role is a specimen role borne by a material entity that is the output of a material sampling process.
http://purl.obolibrary.org/obo/OBI_0000789	survival rate	http://purl.obolibrary.org/obo/STATO_0000645	rate		A rate that represents the percentage of people or animals in a study or treatment group who are alive for a given period of time after diagnosis or initiation of monitoring.
http://purl.obolibrary.org/obo/OBI_0000806	material maintenance objective	http://purl.obolibrary.org/obo/IAO_0000005	objective specification		An objective specification maintains some or all of the qualities of a material over time.
http://purl.obolibrary.org/obo/OBI_0000811	primary structure of DNA macromolecule	http://purl.obolibrary.org/obo/SO_0000001	region		a quality of a DNA molecule that inheres in its bearer due to the order of its DNA nucleotide residues.
http://purl.obolibrary.org/obo/OBI_0000832	measurement device	http://purl.obolibrary.org/obo/OBI_0000968	device		A device in which a measure function inheres.
http://purl.obolibrary.org/obo/OBI_0000838	material maintenance	http://purl.obolibrary.org/obo/OBI_0000011	planned process		a process with that achieves the objective to maintain some or all of the characteristics of an input material over time
http://purl.obolibrary.org/obo/OBI_0000845	primary structure of RNA molecule	http://purl.obolibrary.org/obo/SO_0000001	region		The primary structure of an RNA molecule that is completely defined by the set of its nucleic residue parts and the linear order induced by the peptide bonds that hold them together
http://purl.obolibrary.org/obo/OBI_0000848	polyA RNA extraction	http://purl.obolibrary.org/obo/OBI_0666666	RNA extraction		A RNA extraction process typically involving the use of poly dT oligomers in which the desired output material is polyA RNA.
http://purl.obolibrary.org/obo/OBI_0000861	Likelihood-ratio test	http://purl.obolibrary.org/obo/STATO_0000191	goodness of fit statistical test		Likelihood-ratio is a data transformation which tests whether there is evidence of the need to move from a simple model to a more complicated one (where the simple model is nested within the complicated one); tests of the goodness-of-fit between two models.
http://purl.obolibrary.org/obo/OBI_0000880	RNA extract	http://purl.obolibrary.org/obo/OBI_0001010	nucleic acid extract		an extract which is the output of an extraction process in which RNA molecules are isolated from a specimen.
http://purl.obolibrary.org/obo/OBI_0000889	survival curve	http://purl.obolibrary.org/obo/IAO_0000573	line graph		A survival curve is a report graph which is a graphical representation of data where the percentage of survival is plotted as a function of time.
http://purl.obolibrary.org/obo/OBI_0000916	flow cytometry assay	http://purl.obolibrary.org/obo/OBI_0001977	cytometry assay		A cytometry assay in which an input cell population is put in solution, is passed by a laser, and optical sensors are used to detect scattering of the laser light and/or fluorescence of specific markers to count and characterize the particles in solution.
http://purl.obolibrary.org/obo/OBI_0000924	labeled specimen	http://purl.obolibrary.org/obo/OBI_0001936	molecular-labeled material		A specimen  that has been modified in order to be able to detect it in future experiments
http://purl.obolibrary.org/obo/OBI_0000931	study intervention	http://purl.obolibrary.org/obo/OBI_0000011	planned process		the part of the execution of an intervention design study which is varied between two or more subjects in the study
http://purl.obolibrary.org/obo/OBI_0000932	material separation device	http://purl.obolibrary.org/obo/OBI_0000968	device		A device with a separation function realized in a planed process
http://purl.obolibrary.org/obo/OBI_0000953	processed specimen	http://purl.obolibrary.org/obo/OBI_0100051	specimen		A specimen that has been intentionally physically modified.
http://purl.obolibrary.org/obo/OBI_0000963	categorical label	http://purl.obolibrary.org/obo/IAO_0000009	datum label		A label that is part of a categorical datum and that indicates the value of the data item on the categorical scale.
http://purl.obolibrary.org/obo/OBI_0000964	in live cell assay	http://purl.obolibrary.org/obo/OBI_0000070	assay		An assay in which a measurement is made by observing entities located in a live cell.
http://purl.obolibrary.org/obo/OBI_0000967	container	http://purl.obolibrary.org/obo/OBI_0000968	device		A device that can be used to restrict the location of material entities over time
http://purl.obolibrary.org/obo/OBI_0000973	sequence data	http://purl.obolibrary.org/obo/IAO_0000109	measurement datum		A measurement datum that representing the primary structure of a macromolecule(it's sequence) sometimes associated with an indicator of confidence of that measurement.

http://purl.obolibrary.org/obo/OBI_0000984	dose	http://purl.obolibrary.org/obo/IAO_0000109	measurement datum		A measurement datum that measures the quantity of something that may be administered to an organism or that an organism may be exposed to. Quantities of nutrients, drugs, vaccines and toxins are referred to as doses.
http://purl.obolibrary.org/obo/OBI_0001010	nucleic acid extract	http://purl.obolibrary.org/obo/OBI_0000953	processed specimen		An extract that is the output of an extraction process in which nucleic acid molecules are isolated from a specimen.
http://purl.obolibrary.org/obo/OBI_0001032	light emission device	http://purl.obolibrary.org/obo/OBI_0000968	device		A device which has a function to emit light.
http://purl.obolibrary.org/obo/OBI_0001034	environmental control device	http://purl.obolibrary.org/obo/OBI_0000968	device		An environmental control device is a device which has the function to control some aspect of the environment such as temperature, or humidity.
http://purl.obolibrary.org/obo/OBI_0001051	DNA extract	http://purl.obolibrary.org/obo/OBI_0001010	nucleic acid extract		The output of an extraction process in which DNA molecules are purified in order to exclude DNA from organellas.
http://purl.obolibrary.org/obo/OBI_0001108	nucleic acid sequencer	http://purl.obolibrary.org/obo/OBI_0000832	measurement device		An device that is used to determine the order of nucleotides in nucleic acid sequences.
http://purl.obolibrary.org/obo/OBI_0001137	protein sequencer	http://purl.obolibrary.org/obo/OBI_0000832	measurement device		An device that is used to determine the order of amino acids in protein sequences.
http://purl.obolibrary.org/obo/OBI_0001172	dose response curve	http://purl.obolibrary.org/obo/IAO_0000573	line graph		A data item of paired values, one indicating the dose of a material, the other quantitating a measured effect at that dose. The dosing intervals are chosen so that effect values be interpolated by a plotting a curve. 
http://purl.obolibrary.org/obo/OBI_0001177	RNA sequencing assay	http://purl.obolibrary.org/obo/OBI_0600047	sequencing assay		An assay that determines the sequence of an RNA molecule.
http://purl.obolibrary.org/obo/OBI_0001225	genetic population background information	http://purl.obolibrary.org/obo/OBI_0001404	genetic characteristics information		a genetic characteristics information which is a part of genotype information that identifies the population of organisms
http://purl.obolibrary.org/obo/OBI_0001265	FWER adjusted p-value	http://purl.obolibrary.org/obo/OBI_0000662	error corrected data set		A quantitative confidence value resulting from a multiple testing error correction method which adjusts the p-value used as input to control for Type I error in the context of multiple pairwise tests
http://purl.obolibrary.org/obo/OBI_0001305	genotype information	http://purl.obolibrary.org/obo/OBI_0001404	genetic characteristics information		a genetic characteristics information that is about the genetic material of an organism and minimally includes information about the genetic background and can in addition contain information about specific alleles, genetic modifications, etc.
http://purl.obolibrary.org/obo/OBI_0001331	transcription profiling identification objective	http://purl.obolibrary.org/obo/OBI_0000131	molecular feature identification objective		A molecular feature identification objective that aims to characterize the abundance of transcripts
http://purl.obolibrary.org/obo/OBI_0001352	allele information	http://purl.obolibrary.org/obo/OBI_0001364	genetic alteration information		a genetic alteration information that about one of two or more alternative forms of a gene or marker sequence and differing from other alleles at one or more mutational sites based on sequence. Polymorphisms are included in this definition.
http://purl.obolibrary.org/obo/OBI_0001364	genetic alteration information	http://purl.obolibrary.org/obo/OBI_0001404	genetic characteristics information		a genetic characteristics information that is about known changes or the lack thereof from the genetic background, including allele information, duplication, insertion, deletion, etc.
http://purl.obolibrary.org/obo/OBI_0001404	genetic characteristics information	http://purl.obolibrary.org/obo/IAO_0000027	data item		a data item that is about genetic material including polymorphisms, disease alleles, and haplotypes.
http://purl.obolibrary.org/obo/OBI_0001444	genotyping design	http://purl.obolibrary.org/obo/OBI_0500000	study design		A study design that classifies an individual or group of individuals on the basis of alleles, haplotypes, SNPs.
http://purl.obolibrary.org/obo/OBI_0001501	fluorescence detection assay	http://purl.obolibrary.org/obo/OBI_0000070	assay		An assay in which a material's fluorescence is determined.
http://purl.obolibrary.org/obo/OBI_0001554	rate measurement datum	http://purl.obolibrary.org/obo/IAO_0000032	scalar measurement datum		A scalar measurement datum that represents the number of events occuring over a time interval
http://purl.obolibrary.org/obo/OBI_0001573	DNA sequence data	http://purl.obolibrary.org/obo/OBI_0000973	sequence data		A sequence data item that is about the primary structure of DNA
http://purl.obolibrary.org/obo/OBI_0001834	drawing a conclusion	http://purl.obolibrary.org/obo/OBI_0000011	planned process		A planned process in which new information is inferred from existing information.
http://purl.obolibrary.org/obo/OBI_0001865	assay array	http://purl.obolibrary.org/obo/OBI_0000968	device		A device made to be used in an analyte assay for immobilization of substances that bind the analyte at regular spatial positions on a surface.
http://purl.obolibrary.org/obo/OBI_0001879	reagent	http://purl.obolibrary.org/obo/OBI_0000047	processed material		A biological or chemical entity that bears a reagent role in virtue of it being intended for application in a scientific technique  to participate in (or have molecular parts that participate in) a chemical reaction that facilitates the generation of data about some distinct entity, or the generation of some distinct material specified output.
http://purl.obolibrary.org/obo/OBI_0001909	conclusion based on data	http://purl.obolibrary.org/obo/IAO_0000030	information content entity		An information content entity that is inferred from data.
http://purl.obolibrary.org/obo/OBI_0001912	cell freezing medium	http://purl.obolibrary.org/obo/OBI_0000047	processed material		A processed material that serves as a liquid vehicle for freezing cells for long term quiescent stroage, which contains chemicls needed to sustain cell viability across freeze-thaw cycles. 
http://purl.obolibrary.org/obo/OBI_0001930	categorical value specification	http://purl.obolibrary.org/obo/OBI_0001933	value specification		A value specification that is specifies one category out of a fixed number of nominal categories
http://purl.obolibrary.org/obo/OBI_0001931	scalar value specification	http://purl.obolibrary.org/obo/OBI_0001933	value specification		A value specification that consists of two parts: a numeral and a unit label
http://purl.obolibrary.org/obo/OBI_0001933	value specification	http://purl.obolibrary.org/obo/IAO_0000030	information content entity		An information content entity that specifies a value within a classification scheme or on a quantitative scale.
http://purl.obolibrary.org/obo/OBI_0001936	molecular-labeled material	http://purl.obolibrary.org/obo/OBI_0000047	processed material		a material entity that is the specified output of an addition of molecular label process that aims to label some molecular target to allow for its detection in a detection of molecular label assay
http://purl.obolibrary.org/obo/OBI_0001977	cytometry assay	http://purl.obolibrary.org/obo/OBI_0000070	assay		An assay that counts and/or measures characteristics of cells.
http://purl.obolibrary.org/obo/OBI_0002089	physical store	http://purl.obolibrary.org/obo/OBI_0001034	environmental control device		A container with an environmental control function.
http://purl.obolibrary.org/obo/OBI_0100026	organism	http://purl.obolibrary.org/obo/BFO_0000040	material entity		A material entity that is an individual living system, such as animal, plant, bacteria or virus, that is capable of replicating or reproducing, growth and maintenance in the right environment. An organism may be unicellular or made up, like humans, of many billions of cells divided into specialized tissues and organs.
http://purl.obolibrary.org/obo/OBI_0100051	specimen	http://purl.obolibrary.org/obo/BFO_0000040	material entity		A material entity that has the specimen role.
http://purl.obolibrary.org/obo/OBI_0100064	screening library	http://purl.obolibrary.org/obo/OBI_0000047	processed material		a screening library is a collection of materials engineered to identify qualities of a subset of its members during a screening process?
http://purl.obolibrary.org/obo/OBI_0200031	differential expression analysis objective	http://purl.obolibrary.org/obo/OBI_0200166	data transformation objective		A differential expression analysis objective is a data transformation objective whose input consists of expression levels of entities (such as transcripts or proteins), or of sets of such expression levels, under two or more conditions and whose output reflects which of these are likely to have different expression across such conditions.
http://purl.obolibrary.org/obo/OBI_0200036	Benjamini and Hochberg false discovery rate correction method	http://purl.obolibrary.org/obo/OBI_0200163	false discovery rate correction method		A data transformation process in which the Benjamini and Hochberg method sequential p-value procedure  is applied with the aim of correcting false discovery rate
http://purl.obolibrary.org/obo/OBI_0200041	k-means clustering	http://purl.obolibrary.org/obo/APOLLO_SV_00000796	dataset creating		A k-means clustering is a data transformation which achieves a class discovery or partitioning objective, which takes as input a collection of objects (represented as points in multidimensional space) and which partitions them into a specified number k of clusters. The algorithm attempts to find the centers of natural clusters in the data. The most common form of the algorithm starts by partitioning the input points into k initial sets, either at random or using some heuristic data. It then calculates the mean point, or centroid, of each set. It constructs a new partition by associating each point with the closest centroid. Then the centroids are recalculated for the new clusters, and the algorithm repeated by alternate applications of these two steps until convergence, which is obtained when the points no longer switch clusters (or alternatively centroids are no longer changed).
http://purl.obolibrary.org/obo/OBI_0200042	hierarchical clustering	http://purl.obolibrary.org/obo/APOLLO_SV_00000796	dataset creating		A hierarchical clustering is a data transformation which achieves a class discovery objective, which takes as input data item and builds a hierarchy of clusters. The traditional representation of this hierarchy is a tree (visualized by a dendrogram), with the individual input objects at one end (leaves) and a single cluster containing every object at the other (root).
http://purl.obolibrary.org/obo/OBI_0200043	average linkage hierarchical clustering	http://purl.obolibrary.org/obo/OBI_0200154	agglomerative hierarchical clustering		An average linkage hierarchical clustering is an agglomerative hierarchical clustering which generates successive clusters based on a distance measure, where the distance between two clusters is calculated as the average distance between objects from the first cluster and objects from the second cluster.
http://purl.obolibrary.org/obo/OBI_0200044	complete linkage hierarchical clustering	http://purl.obolibrary.org/obo/OBI_0200154	agglomerative hierarchical clustering		an agglomerative hierarchical clustering which generates successive clusters based on a distance measure, where the distance between two clusters is calculated as the maximum distance between objects from the first cluster and objects from the second cluster.
http://purl.obolibrary.org/obo/OBI_0200045	single linkage hierarchical clustering	http://purl.obolibrary.org/obo/OBI_0200154	agglomerative hierarchical clustering		A single linkage hierarchical clustering is an agglomerative hierarchical clustering which generates successive clusters based on a distance measure, where the distance between two clusters is calculated as the minimum distance between objects from the first cluster and objects from the second cluster.
http://purl.obolibrary.org/obo/OBI_0200049	Benjamini and Yekutieli false discovery rate correction method	http://purl.obolibrary.org/obo/OBI_0200163	false discovery rate correction method		A data transformation in which the Benjamini and Yekutieli method is applied with the aim of correcting false discovery rate
http://purl.obolibrary.org/obo/OBI_0200066	Holm-Bonferroni family-wise error rate correction method	http://purl.obolibrary.org/obo/OBI_0200073	family wise error rate correction method		a data transformation that performs more than one hypothesis test simultaneously, a closed-test procedure, that  controls the familywise error rate for all the k hypotheses at level α in the strong sense. Objective: multiple testing correction
http://purl.obolibrary.org/obo/OBI_0200078	descriptive statistical calculation objective	http://purl.obolibrary.org/obo/OBI_0200166	data transformation objective		A descriptive statistical calculation objective is a data transformation objective which concerns any calculation intended to describe a feature of a data set, for example, its center or its variability.
http://purl.obolibrary.org/obo/OBI_0200094	logarithmic transformation	http://purl.obolibrary.org/obo/OBI_0200169	normalization data transformation		A logarithmic transformation is a data transformation consisting in the application of the logarithm function with a given base a (where a>0 and a is not equal to 1) to a (one dimensional) positive real number input. The logarithm function with base a can be defined as the inverse of the exponential function with the same base. See e.g. http://en.wikipedia.org/wiki/Logarithm.
http://purl.obolibrary.org/obo/OBI_0200102	regression analysis method	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		Regression analysis is a descriptive statistics technique that examines the relation of a dependent variable (response variable) to specified independent variables (explanatory variables). Regression analysis can be used as a descriptive method of data analysis (such as curve fitting) without relying on any assumptions about underlying processes generating the data.
http://purl.obolibrary.org/obo/OBI_0200104	principal component regression	http://purl.obolibrary.org/obo/APOLLO_SV_00000796	dataset creating		The Principal Component Regression method is a regression analysis method that combines the Principal Component Analysis (PCA)spectral decomposition with an Inverse Least Squares (ILS) regression method to create a quantitative model for complex samples. Unlike quantitation methods based directly on Beer's Law which attempt to calculate the absorbtivity coefficients for the constituents of interest from a direct regression of the constituent concentrations onto the spectroscopic responses, the PCR method regresses the concentrations on the PCA scores.
http://purl.obolibrary.org/obo/OBI_0200117	mode calculation	http://purl.obolibrary.org/obo/APOLLO_SV_00000796	dataset creating		A mode calculation is a descriptive statistics calculation in which the mode is calculated which is the most common value in a data set. It is most often used as a measure of center for discrete data.
http://purl.obolibrary.org/obo/OBI_0200119	median calculation	http://purl.obolibrary.org/obo/APOLLO_SV_00000796	dataset creating		A median calculation is a  descriptive statistics calculation in which the midpoint of the data set (the 0.5 quantile) is calculated.  First, the observations are sorted in increasing order.  For an odd number of observations, the median is the middle value of the sorted data.  For an even number of observations, the median is the average of the two middle values.
http://purl.obolibrary.org/obo/OBI_0200166	data transformation objective	http://purl.obolibrary.org/obo/IAO_0000005	objective specification		An objective specification to transformation input data into output data
http://purl.obolibrary.org/obo/OBI_0200167	data normalization objective	http://purl.obolibrary.org/obo/OBI_0200166	data transformation objective		A normalization objective is a data transformation objective where the aim is to remove
systematic sources of variation to put the data on equal footing in order
to create a common base for comparisons.
http://purl.obolibrary.org/obo/OBI_0200168	correction objective	http://purl.obolibrary.org/obo/OBI_0200166	data transformation objective		A correction objective is a data transformation objective where the aim is to correct for error, noise or other impairments to the input of the data transformation or derived from the data transformation itself
http://purl.obolibrary.org/obo/OBI_0200169	normalization data transformation	http://purl.obolibrary.org/obo/APOLLO_SV_00000796	dataset creating		A normalization data transformation is a data transformation that has objective normalization.
http://purl.obolibrary.org/obo/OBI_0200170	averaging data transformation	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		An averaging data transformation is a data transformation that has objective averaging.
http://purl.obolibrary.org/obo/OBI_0200171	partitioning data transformation	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		A partitioning data transformation is a data transformation that has objective partitioning.
http://purl.obolibrary.org/obo/OBI_0200172	partitioning objective	http://purl.obolibrary.org/obo/OBI_0200166	data transformation objective		A partitioning objective is a data transformation objective where the aim is to generate a collection of disjoint non-empty subsets whose union equals a non-empty input set.
http://purl.obolibrary.org/obo/OBI_0200177	center calculation objective	http://purl.obolibrary.org/obo/OBI_0200166	data transformation objective		A center calculation objective is a data transformation objective where the aim is to calculate the center of an input data set.
http://purl.obolibrary.org/obo/OBI_0200181	center calculation data transformation	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		A center calculation data transformation is a data transformation that has objective of center calculation.
http://purl.obolibrary.org/obo/OBI_0200184	descriptive statistical calculation data transformation	http://purl.obolibrary.org/obo/APOLLO_SV_00000796	dataset creating		A descriptive statistical calculation data transformation is a data transformation that has objective descriptive statistical calculation and which concerns any calculation intended to describe a feature of a data set, for example, its center or its variability.
http://purl.obolibrary.org/obo/OBI_0200186	error correction objective	http://purl.obolibrary.org/obo/OBI_0200168	correction objective		An error correction objective is a data transformation objective where the aim is to remove (correct for) erroneous contributions arising from the input data, or the transformation itself.
http://purl.obolibrary.org/obo/OBI_0200191	gene list visualization	http://purl.obolibrary.org/obo/OBI_0200111	data visualization		Adata visualization which has input of a gene list and produces an output of a report graph which is capable of rendering data of this type.
http://purl.obolibrary.org/obo/OBI_0200194	survival analysis data transformation	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		A data transformation which has the objective of performing survival analysis.
http://purl.obolibrary.org/obo/OBI_0200200	chi square test	http://purl.obolibrary.org/obo/OBI_0000673	statistical hypothesis test		The chi-square test is a data transformation with the objective of statistical hypothesis testing, in which the sampling distribution of the test statistic is a chi-square distribution when the null hypothesis is true, or any in which this is asymptotically true, meaning that the sampling distribution (if the null hypothesis is true) can be made to approximate a chi-square distribution as closely as desired by making the sample size large enough.
http://purl.obolibrary.org/obo/OBI_0300311	observation design	http://purl.obolibrary.org/obo/OBI_0500000	study design		observation design is a study design in which subjects are monitored in the absence of any active intervention by experimentalists.
http://purl.obolibrary.org/obo/OBI_0302884	extraction	http://purl.obolibrary.org/obo/OBI_0600014	material component separation		A material separation in which a desired component of an input material is separated from the remainder
http://purl.obolibrary.org/obo/OBI_0302900	group randomization	http://purl.obolibrary.org/obo/OBI_0600015	group assignment		A group assignment which relies on chance to assign materials to a group of materials in order to avoid bias in experimental set up.
http://purl.obolibrary.org/obo/OBI_0302903	nucleic acid hybridization	http://purl.obolibrary.org/obo/OBI_0000094	material processing		a planned process by which totally or partially complementary, single-stranded nucleic acids are  combined into a single molecule called heteroduplex or homoduplex to an extent depending on the amount of complementarity.
http://purl.obolibrary.org/obo/OBI_0400044	flow cytometer	http://purl.obolibrary.org/obo/OBI_0400137	cytometer		A flow_cytometer is an instrument for counting, examining and sorting microscopic particles in suspension. It allows simultaneous multiparametric analysis of the physical and/or chemical characteristics of single cells flowing through an optical and/or electronic detection apparatus. A flow cytometer is an instrument that can be used to quantitatively measure the properties of individual cells in a flowing medium.
http://purl.obolibrary.org/obo/OBI_0400065	light source	http://purl.obolibrary.org/obo/OBI_0001032	light emission device		A light source is an optical subsystem that provides light for use in a distant area using a delivery system (e.g., fiber optics). Light sources may include one of a variety of lamps (e.g., xenon, halogen, mercury). Most light sources are operated from line power, but some may be powered from batteries. They are mostly used in endoscopic, microscopic, and other examination and/or in surgical procedures. The light source is part of the optical subsystem. In a flow cytometer the light source directs high intensity light at particles at the interrogation point. The light source in a flow cytometer is usually a laser.
http://purl.obolibrary.org/obo/OBI_0400078	obscuration bar	http://purl.obolibrary.org/obo/OBI_0000932	material separation device		An obscuration bar is a an optical subsystem which is a strip of metal or other material that serves to block out direct light from the illuminating beam. The obscuration bar prevents the bright light scattered in the forward directions from burning out the collection device.
http://purl.obolibrary.org/obo/OBI_0400079	optical filter	http://purl.obolibrary.org/obo/OBI_0000932	material separation device		An optical filter is an optical subsystem that selectively transmits light having certain properties (often, a particular range of wavelengths, that is, range of colours of light), while blocking the remainder. They are commonly used in photography, in many optical instruments, and to colour stage lighting Optical filters can be arranged to segregate and collect light by wave length.
http://purl.obolibrary.org/obo/OBI_0400082	photodetector	http://purl.obolibrary.org/obo/OBI_0000832	measurement device		A photodetector is a device used to detect and measure the intensity of radiant energy through photoelectric action. In a cytometer, photodetectors measure either the number of photons of laser light scattered on impact with a cell (for example), or the flourescence emitted by excitation of a fluorescent dye.
http://purl.obolibrary.org/obo/OBI_0400103	DNA sequencer	http://purl.obolibrary.org/obo/OBI_0000832	measurement device		A DNA sequencer is an instrument that determines the order of deoxynucleotides in deoxyribonucleic acid sequences.
http://purl.obolibrary.org/obo/OBI_0400110	hybridization chamber	http://purl.obolibrary.org/obo/OBI_0002089	physical store		A device which is used to maintain constant contact of a liquid on an array.  This can be either a glass vial or slide.
http://purl.obolibrary.org/obo/OBI_0400137	cytometer	http://purl.obolibrary.org/obo/OBI_0000832	measurement device		A cytometer is an instrument for counting and measuring cells.
http://purl.obolibrary.org/obo/OBI_0400147	microarray	http://purl.obolibrary.org/obo/OBI_0001865	assay array		A processed material that is made to be used in an analyte assay. It consists of a physical immobilisation matrix in which substances that bind the analyte are placed in regular spatial position.
http://purl.obolibrary.org/obo/OBI_0400153	droplet sorter	http://purl.obolibrary.org/obo/OBI_0000932	material separation device		A droplet sorter is part_of a flow cytometer sorter that converts the carrier fluid stream into individual droplets, and these droplets are directed into separate locations for recovery (enriching the original
sample for particles of interest based on qualities determined by gating) or disposal.
http://purl.obolibrary.org/obo/OBI_0500002	repeated measure design	http://purl.obolibrary.org/obo/OBI_0500000	study design		a study design which use the same individuals and exposure them to a set of conditions. The effect of order and practice can be confounding factor in such designs
http://purl.obolibrary.org/obo/OBI_0500003	cross over design	http://purl.obolibrary.org/obo/STATO_0000046	block design		a repeated measure design which ensures that experimental units receive, in sequence, the treatment (or the control), and then, after a specified time interval (aka *wash-out periods*), switch to the control (or treatment). In this design, subjects (patients in human context) serve as their own controls, and randomization may be used to determine the ordering which a subject receives the treatment and control
http://purl.obolibrary.org/obo/OBI_0500005	matched pairs design	http://purl.obolibrary.org/obo/STATO_0000046	block design		A matched pair design is a study design which use groups of individuals associated (hence matched) to each other based on a set of criteria, one member going to one treatment, the other member receiving the other treatment.
http://purl.obolibrary.org/obo/OBI_0500007	randomized complete block design	http://purl.obolibrary.org/obo/STATO_0000046	block design		A randomized complete block design is_a study design which assigns randomly treatments to block. The number of units per block equals the number of treatment so each block receives each treatment exactly once (hence the qualifier 'complete'). The design was originally devised from field trials used in agronomy and agriculture. The analysis assumes that there is no interaction between block and treatment. The method was then used in other settings So The randomised complete block design is a design in which the subjects are matched according to a variable which the experimenter wishes to control. The subjects are put into groups (blocks) of the same size as the number of treatments. The members of each block are then randomly assigned to different treatment groups.
http://purl.obolibrary.org/obo/OBI_0500011	latin square design	http://purl.obolibrary.org/obo/OBI_0500007	randomized complete block design		Latin square design is_a study design which allows in its simpler form controlling 2 levels of nuisance variables (also known as blocking variables).he 2 nuisance factors are divided into a tabular grid with the property that each row and each column receive each treatment exactly once.
http://purl.obolibrary.org/obo/OBI_0500012	graeco latin square design	http://purl.obolibrary.org/obo/OBI_0500007	randomized complete block design		Greco-Latin square design is a study design which relates to Latin square design
http://purl.obolibrary.org/obo/OBI_0500013	hyper graeco latin square design	http://purl.obolibrary.org/obo/OBI_0500007	randomized complete block design		PRS to do
http://purl.obolibrary.org/obo/OBI_0500014	factorial design	http://purl.obolibrary.org/obo/OBI_0500000	study design		factorial design is_a study design which is used to evaluate two or more factors simultaneously. The treatments are combinations of levels of the factors. The advantages of factorial designs over one-factor-at-a-time experiments is that they are more efficient and they allow interactions to be detected. In statistics, a factorial design experiment is an experiment whose design consists of two or more factors, each with discrete possible values or levels, and whose experimental units take on all possible combinations of these levels across all such factors. Such an experiment allows studying the effect of each factor on the response variable, as well as the effects of interactions between factors on the response variable.
http://purl.obolibrary.org/obo/OBI_0500015	2x2 factorial design	http://purl.obolibrary.org/obo/STATO_0000270	full factorial design		a factorial design which has 2 experimental factors (aka independent variables) and 2 factor levels per experimental factors
http://purl.obolibrary.org/obo/OBI_0500016	fractional factorial design	http://purl.obolibrary.org/obo/OBI_0500014	factorial design		A fractional factorial design is_a study design in which only an adequately chosen fraction of the treatment combinations required for the complete factorial experiment is selected to be run
http://purl.obolibrary.org/obo/OBI_0500017	dye swap design	http://purl.obolibrary.org/obo/STATO_0000046	block design		An experiment design type where the label orientations are reversed. exact synonym: flip dye, dye flip
http://purl.obolibrary.org/obo/OBI_0500020	time series design	http://purl.obolibrary.org/obo/OBI_0500000	study design		Groups of assays that are related as part of a time series.
http://purl.obolibrary.org/obo/OBI_0600005	collecting specimen from organism	http://purl.obolibrary.org/obo/OBI_0000659	specimen collection process		a process with the objective to obtain a material entity that was part of an organism for potential future use in an investigation
http://purl.obolibrary.org/obo/OBI_0600014	material component separation	http://purl.obolibrary.org/obo/OBI_0000094	material processing		a material processing in which components of an input material become segregated in space
http://purl.obolibrary.org/obo/OBI_0600024	maintaining cell culture	http://purl.obolibrary.org/obo/OBI_0000838	material maintenance		a protocol application in which cells are kept alive in a defined environment outside of an organism.  part of cell_culturing
http://purl.obolibrary.org/obo/OBI_0600028	artificially induced reverse transcription	http://purl.obolibrary.org/obo/OBI_0000094	material processing		a protocol with the objective to transcribe single-stranded RNA into complementary DNA (cDNA)
http://purl.obolibrary.org/obo/OBI_0600036	establishing cell culture	http://purl.obolibrary.org/obo/OBI_0000094	material processing		a process through which a new type of cell culture or cell line is created, either through the isolation and culture of one or more cells from a fresh source, or the deliberate experimental modification of an existing cell culture (e.g passaging a primary culture to become a secondary culture or line, or the immortalization or stable genetic modification of an existing culture or line).
http://purl.obolibrary.org/obo/OBI_0600047	sequencing assay	http://purl.obolibrary.org/obo/OBI_0000070	assay		An assay the uses chemical or biochemical means to infer the sequence of a biomaterial
http://purl.obolibrary.org/obo/OBI_0666666	RNA extraction	http://purl.obolibrary.org/obo/OBI_0666667	nucleic acid extraction		A RNA extraction is a nucleic acid extraction where the desired output material is RNA
http://purl.obolibrary.org/obo/OBI_0666667	nucleic acid extraction	http://purl.obolibrary.org/obo/OBI_0302884	extraction		a material separation to recover the nucleic acid fraction of an input material
http://purl.obolibrary.org/obo/OBI_1110108	material to be added	http://purl.obolibrary.org/obo/BFO_0000040	material entity		a material that is added to another one in a material combination process
http://purl.obolibrary.org/obo/OBI_1110109	target of material addition	http://purl.obolibrary.org/obo/BFO_0000040	material entity		A material entity into which another is being added in a material combinatino process
http://purl.obolibrary.org/obo/OGMS_0000023	phenotype	http://purl.obolibrary.org/obo/BFO_0000019	quality		A (combination of) quality(ies) of an organism determined by the interaction of its genetic make-up and environment that differentiates specific instances of a species from other instances of the same species.
http://purl.obolibrary.org/obo/PATO_0000018	fluorescence	http://purl.obolibrary.org/obo/BFO_0000016	disposition		A luminous flux quality inhering in a bearer by virtue of the bearer's emitting longer wavelength light following the absorption of shorter wavelength radiation; fluorescence is common with aromatic compounds with several rings joined together.
http://purl.obolibrary.org/obo/PATO_0000125	mass	http://purl.obolibrary.org/obo/BFO_0000019	quality		A physical quality that inheres in a bearer by virtue of the proportion of the bearer's amount of matter.
http://purl.obolibrary.org/obo/PR_000000001	protein	http://purl.obolibrary.org/obo/CHEBI_16670	peptide		An amino acid chain that is produced de novo by ribosome-mediated translation of a genetically-encoded mRNA, and any derivatives thereof.
http://purl.obolibrary.org/obo/UO_0000001	length unit	http://purl.obolibrary.org/obo/IAO_0000003	measurement unit label		A unit which is a standard measure of the distance between two points.
http://purl.obolibrary.org/obo/UO_0000002	mass unit	http://purl.obolibrary.org/obo/IAO_0000003	measurement unit label		A unit which is a standard measure of the amount of matter/energy of a physical object.
http://purl.obolibrary.org/obo/UO_0000003	time unit	http://purl.obolibrary.org/obo/IAO_0000003	measurement unit label		A unit which is a standard measure of the dimension in which events occur in sequence.
http://purl.obolibrary.org/obo/STATO_0000017	Wilk's lambda distribution	http://purl.obolibrary.org/obo/STATO_0000225	probability distribution		Wilks' lambda distribution (named for Samuel S. Wilks), is a probability distribution used in multivariate hypothesis testing, especially with regard to the likelihood-ratio test and Multivariate analysis of variance. It is a multivariate generalization of the univariate F-distribution, and generalizes the F-distribution in the same way that the Hotelling's T-squared distribution generalizes Student's t-distribution.
http://purl.obolibrary.org/obo/STATO_0000057	null hypothesis	http://purl.obolibrary.org/obo/STATO_0000065	hypothesis		A null hypothesis is a statistical hypothesis that is tested for possible rejection under the assumption that it is true (usually that observations are the result of chance). The concept was introduced by R. A. Fisher.
The hypothesis contrary to the null hypothesis, usually that the observations are the result of a real effect, is known as the alternative hypothesis.[wolfram alpha]
http://purl.obolibrary.org/obo/STATO_0000060	hypergeometric distribution	http://purl.obolibrary.org/obo/STATO_0000117	discrete probability distribution		hypergeometric distribution is a probability distribution that describes the probability of  k successes in n draws from a finite population of size  N containing  K successes without replacement
http://purl.obolibrary.org/obo/STATO_0000066	Cleveland dot plot	http://purl.obolibrary.org/obo/IAO_0000037	dot plot		Cleveland dot plot is a dot plot which plots points that each belong to one of several categories. They are an alternative to bar charts or pie charts, and look somewhat like a horizontal bar chart where the bars are replaced by a dots at the values associated with each category. Compared to (vertical) bar charts and pie charts, Cleveland argues that dot plots allow more accurate interpretation of the graph by readers by making the labels easier to read, reducing non-data ink (or graph clutter) and supporting table look-up.which
http://purl.obolibrary.org/obo/STATO_0000075	rarefaction curve	http://purl.obolibrary.org/obo/IAO_0000573	line graph		a rarefaction curve is a graph used for estimating species richness in ecology studies
http://purl.obolibrary.org/obo/STATO_0000103	multinomial distribution	http://purl.obolibrary.org/obo/STATO_0000117	discrete probability distribution		the multinomial distribution is a probability distribution which gives the probability of any particular combination of numbers of successes for various categories defined in the context of  n independent trials each of which leads to a success for exactly one of k categories, with each category having a given fixed success probability.
http://purl.obolibrary.org/obo/STATO_0000112	Funnel plot	http://purl.obolibrary.org/obo/IAO_0000184	scatter plot		a Funnel plot is a scatter plot of treatment effect versus a measure of study size and aims to provide a visual aid to detecting bias or systematic heterogeneity. A symmetric inverted funnel shape arises from a ‘well-behaved’ data set, in which publication bias is unlikely. An asymmetric funnel indicates a relationship between treatment effect and study size. 
Known caveats: If high precision studies really are different from low precision studies with respect to effect size (e.g., due to different populations examined) a funnel plot may give a wrong impression of publication bias. The appearance of the funnel plot can change quite dramatically depending on the scale on the y-axis — whether it is the inverse square error or the trial size.
Funnel plot was introduced by Light and Palmer in 1984.
http://purl.obolibrary.org/obo/STATO_0000120	bean plot	http://purl.obolibrary.org/obo/IAO_0000038	graph		beanplot is a plot in which (one or) multiple batches ("beans") are shown. Each bean consists of a density trace, which is mirrored to
form a polygon shape. Next to that, a one-dimensional scatter plot shows all the individual measurements, like in a stripchart.

The name beanplot stems from green beans. The density shape can be seen as the pod of a green bean, while the scatter plot shows the seeds inside the pod.
http://purl.obolibrary.org/obo/STATO_0000122	pedigree chart	http://purl.obolibrary.org/obo/IAO_0000038	graph		a pedigree chart is a graph which plots parent child relations
http://purl.obolibrary.org/obo/STATO_0000125	Wilkinson dot plot	http://purl.obolibrary.org/obo/IAO_0000037	dot plot		The dot plot as a representation of a distribution consists of group of data points plotted on a simple scale. Dot plots are used for continuous, quantitative, univariate data. Data points may be labelled if there are few of them.
http://purl.obolibrary.org/obo/STATO_0000126	volcano plot	http://purl.obolibrary.org/obo/IAO_0000184	scatter plot		volcano plot is a kind of scatter plot which graphs the negative log of the p-value (significance) on the y-axis versus log2 of fold-change between 2 conditions on the  x-axis. 
It is a popular method for visualizing differential occurence of variables between 2 conditions.
http://purl.obolibrary.org/obo/STATO_0000128	Altman box and whisker plot	http://purl.obolibrary.org/obo/STATO_0000243	box and whisker plot		Altman Box and Whisker plot is a variation of Tukey Box and Whisker plot which use the criteria of Altman to create the 'whisker' of the plot.
http://purl.obolibrary.org/obo/STATO_0000132	Hotelling T2 distribution	http://purl.obolibrary.org/obo/STATO_0000225	probability distribution		Hotelling T squared distribution is a probability distribution used in multivariate hypothesis testing, which is a univariate distribution proportional to the F-distribution and arises importantly as the distribution of a set of statistics which are natural generalizations of the statistics underlying Student's t-distribution.
In particular, the distribution arises in multivariate statistics in undertaking tests of the differences between the (multivariate) means of different populations, where tests for univariate problems would make use of a t-test.
The distribution is named for Harold Hotelling, who developed it[1] as a generalization of Student's t-distribution.
This distribution is commonly used to describe the sample Mahalanobis distance between two populations.
http://purl.obolibrary.org/obo/STATO_0000145	geometric distribution	http://purl.obolibrary.org/obo/STATO_0000283	negative binomial distribution		the geometric distribution is a negative binomial distribution where r is 1.
It is useful for modeling the runs of consecutive successes (or failures) in repeated independent trials of a system.

The geometric distribution models the number of successes before one failure in an independent succession of tests where each test results in success or failure.


The geometric distribution with prob = p has density

p(x) = p (1-p)^x

for x = 0, 1, 2, …, 0 < p ≤ 1.

If an element of x is not integer, the result of dgeom is zero, with a warning.

The quantile is defined as the smallest value x such that F(x) ≥ p, where F is the distribution function.
http://purl.obolibrary.org/obo/STATO_0000147	LD plot	http://purl.obolibrary.org/obo/IAO_0000038	graph		Linkage Disequilibrium plot is a graph which represents pairwise linkage disequilibrium measures between SNP as a heatmap
http://purl.obolibrary.org/obo/STATO_0000154	violin plot	http://purl.obolibrary.org/obo/IAO_0000038	graph		a violin plot is a plot combining the features of box plot and kernel density plot. The violin plot is therefore similar to box plot but it incorporated in the display the probability density of the data at different values.
Typically violin plots will include a marker for the median of the data and a box indicating the interquartile range, as in standard box plots.
http://purl.obolibrary.org/obo/STATO_0000159	stacked bar chart	http://purl.obolibrary.org/obo/STATO_0000166	bar chart		Stacked bar chart is a bar which is used to compare overall quantities across items while showing the contribution of category to the total amount. Stacked bar chart can be used  for highlighting the total as they visually aggregate all of the categories in a group while indicating a part to whole relationship. The downside is that it becomes harder to compare the sizes of the individual categories.
http://purl.obolibrary.org/obo/STATO_0000165	pie chart	http://purl.obolibrary.org/obo/IAO_0000038	graph		a pie chart is a graph in which a circular graph is divided into sector illustrating numerical proportion, meaning that the arc length of each sector (and consequently its central angle and area), is proportional to the quantity it represents.
http://purl.obolibrary.org/obo/STATO_0000166	bar chart	http://purl.obolibrary.org/obo/IAO_0000038	graph		the bart chart is a graph resulting from plotting rectangular bars with lengths proportional to the values that they represent.
http://purl.obolibrary.org/obo/STATO_0000168	real time quantitative pcr plot	http://purl.obolibrary.org/obo/IAO_0000573	line graph		a real time quantitative pcr plot is a line graph which plots the signal fluorescence intensity as a function of the number of PCR cycle
http://purl.obolibrary.org/obo/STATO_0000171	Spear box and whisker plot	http://purl.obolibrary.org/obo/STATO_0000243	box and whisker plot		Spear Box and Whisker plot is a variation of Tukey Box and Whisker plot which use the criteria of Spear to create the 'whisker' of the plot.
http://purl.obolibrary.org/obo/STATO_0000174	Forest plot	http://purl.obolibrary.org/obo/IAO_0000038	graph		A forest plot  is a graph designed to illustrate the relative strength of treatment effects in multiple quantitative scientific studies addressing the same question.
http://purl.obolibrary.org/obo/STATO_0000177	beta distribution	http://purl.obolibrary.org/obo/STATO_0000067	continuous probability distribution		the beta distribution is a continuous probability distributions defined on the interval [0, 1] parametrized by two positive shape parameters, denoted by α and β, that appear as exponents of the random variable and control the shape of the distribution
http://purl.obolibrary.org/obo/STATO_0000180	standard normal distribution	http://purl.obolibrary.org/obo/STATO_0000227	normal distribution		standard normal distribution is a normal distribution with variance = 1 and mean=0
http://purl.obolibrary.org/obo/STATO_0000186	Lineweaver-Burk plot	http://purl.obolibrary.org/obo/IAO_0000038	graph		Lineweaver-Burk plot is a graph which  is the graphical representation of the Lineweaver–Burk equation of enzyme kinetics, described by Hans Lineweaver and Dean Burk in 1934. The plot provides a useful graphical method for analysis of the Michaelis–Menten equation.
It was widely used to determine important terms in enzymology and enzyme kinetics as  the x-intercept of the graph represents −1/Km and the y-intercept of such a graph is equivalent to the inverse of Vmax
http://purl.obolibrary.org/obo/STATO_0000219	RT-PCR standard curve	http://purl.obolibrary.org/obo/IAO_0000573	line graph		a real time pcr standard curve is a line graph which plots the fluorescence intensity signal as a function of the concentration of a sample used as reference  and used to determine relative abundance of test samples
http://purl.obolibrary.org/obo/STATO_0000241	Q-Q plot	http://purl.obolibrary.org/obo/IAO_0000038	graph		Q-Q plot or quantile-quantile plot is the output of a graphical method for comparing two probability distributions by plotting their quantiles against each other
http://purl.obolibrary.org/obo/STATO_0000256	manhattan plot for gwas	http://purl.obolibrary.org/obo/IAO_0000184	scatter plot		a manhattan plot for gwas is a kind of scatter plot used to facilitate presentation of genome-wide association study (GWAS) data. Genomic coordinates are displayed along the X-axis, with the negative logarithm of the association P-value for each single nucleotide polymorphism displayed on the Y-axis.
http://purl.obolibrary.org/obo/STATO_0000262	Bernoulli distribution	http://purl.obolibrary.org/obo/STATO_0000276	binomial distribution		Bernoulli distribution is a binomial distribution where the number of trials is equal to 1.

notation: B(1,p)

The mean is p

The variance is p*q
http://purl.obolibrary.org/obo/STATO_0000263	Galbraith plot	http://purl.obolibrary.org/obo/IAO_0000184	scatter plot		Galbraith (Radial) plot is a scatter plot which can be used in the meta-analytic context to examine the data for heterogeneity. For a fixed-effects model, the plot shows the inverse of the standard errors on the horizontal axis against the individual observed effect sizes or outcomes standardized by their corresponding standard errors on the vertical axis.
Radial plots were introduced by Rex Galbraith (1988a, 1988b, 1994).
http://purl.obolibrary.org/obo/STATO_0000267	grouped bar chart	http://purl.obolibrary.org/obo/STATO_0000166	bar chart		grouped bar chart is a kind of bar chart which juxtaposes the  discrete values for each of the possible value of a given categorical variable, thus providing  within group comparison. Grouped bar charts are good for comparing between each element in the categories, and comparing elements across categories. However, the grouping can make it harder to tell the difference between the total of each group.
http://purl.obolibrary.org/obo/CHEBI_16991	deoxyribonucleic acid	http://purl.obolibrary.org/obo/CHEBI_33696	nucleic acid		High molecular weight, linear polymers, composed of nucleotides containing deoxyribose and linked by phosphodiester bonds; DNA contain the genetic information of organisms.
http://purl.obolibrary.org/obo/CHEBI_33696	nucleic acid	http://purl.obolibrary.org/obo/CHEBI_33839	macromolecule		A macromolecule made up of nucleotide units and hydrolysable into certain pyrimidine or purine bases (usually adenine, cytosine, guanine, thymine, uracil), D-ribose or 2-deoxy-D-ribose and phosphoric acid.
http://purl.obolibrary.org/obo/CL_0000000	cell	http://purl.obolibrary.org/obo/BFO_0000040	material entity		A material entity of anatomical origin (part of or deriving from an organism) that has as its parts a maximally connected cell compartment surrounded by a plasma membrane.
http://purl.obolibrary.org/obo/CL_0000010	cultured cell	http://purl.obolibrary.org/obo/CL_0000578	experimentally modified cell in vitro		A cell in vitro that is or has been maintained or propagated as part of a cell culture.
http://purl.obolibrary.org/obo/CL_0000578	experimentally modified cell in vitro	http://purl.obolibrary.org/obo/CL_0000000	cell		A cell in vitro that has undergone physical changes as a consequence of a deliberate and specific experimental procedure.
http://purl.obolibrary.org/obo/STATO_0000438	log normal distribution	http://purl.obolibrary.org/obo/STATO_0000067	continuous probability distribution		a log-normal (or lognormal) distribution is a continuous probability distribution of a random variable whose logarithm is normally distributed. Thus, if the random variable {\displaystyle X} X is log-normally distributed, then {\displaystyle Y=\ln(X)} Y=\ln(X) has a normal distribution. Likewise, if {\displaystyle Y} Y has a normal distribution, then {\displaystyle X=\exp(Y)} X=\exp(Y) has a log-normal distribution. A random variable which is log-normally distributed takes only positive real values. The distribution is occasionally referred to as the Galton distribution or Galton's distribution, after Francis Galton.
http://purl.obolibrary.org/obo/STATO_0000703	posterior probability	http://purl.obolibrary.org/obo/STATO_0000039	statistic		A statistic that represents the likelihood of a parameter of interest, after updating its prior probability with the observed data from the study.
http://purl.obolibrary.org/obo/CHEBI_16670	peptide	http://purl.obolibrary.org/obo/CHEBI_33839	macromolecule		Amide derived from two or more amino carboxylic acid molecules (the same or different) by formation of a covalent bond from the carbonyl carbon of one to the nitrogen atom of another with formal loss of water. The term is usually applied to structures formed from alpha-amino acids, but it includes those derived from any amino carboxylic acid. X = OH, OR, NH2, NHR, etc.
http://purl.obolibrary.org/obo/CHEBI_33697	ribonucleic acid	http://purl.obolibrary.org/obo/CHEBI_33696	nucleic acid		High molecular weight, linear polymers, composed of nucleotides containing ribose and linked by phosphodiester bonds; RNA is central to the synthesis of proteins.
http://purl.obolibrary.org/obo/UBERON_0000465	material anatomical entity	http://purl.obolibrary.org/obo/BFO_0000040	material entity		Anatomical entity that has mass.
http://purl.obolibrary.org/obo/UBERON_0000477	anatomical cluster	http://purl.obolibrary.org/obo/BFO_0000040	material entity		Anatomical group that has its parts adjacent to one another.
http://purl.obolibrary.org/obo/STATO_0000209	area under curve	http://purl.obolibrary.org/obo/STATO_0000039	statistic		area under curve is a measurement datum which corresponds to the surface define by the x-axis and bound by the line graph represented in a 2 dimensional plot resulting from an integration or integrative calculus. The interpretation of this measurement datum depends on the variables plotted in the graph
http://purl.obolibrary.org/obo/STATO_0000009	cartesian coordinate system	http://purl.obolibrary.org/obo/STATO_0000010	coordinate system		A Cartesian coordinate system is a coordinate system that specifies each point uniquely in a plane by a pair of numerical coordinates, which are the signed distances from the point to two fixed perpendicular directed lines, measured in the same unit of length.
http://purl.obolibrary.org/obo/STATO_0000010	coordinate system	http://purl.obolibrary.org/obo/IAO_0000030	information content entity		In geometry, a coordinate system is a system which uses one or more numbers, or coordinates, to uniquely determine the position of a point or other geometric element on a manifold such as Euclidean space.
http://purl.obolibrary.org/obo/STATO_0000013	two dimensional cartesian coordinate system	http://purl.obolibrary.org/obo/STATO_0000009	cartesian coordinate system		a 2 dimensional cartesian coordinate system is a cartesian coordinate system which defines 2 orthogonal one dimensional axes and which may be used to describe a 2 dimensional spatial region.
http://purl.obolibrary.org/obo/STATO_0000014	spherical coordinate system	http://purl.obolibrary.org/obo/STATO_0000010	coordinate system		In mathematics, a spherical coordinate system is a coordinate system for three-dimensional space where the position of a point is specified by three numbers: the radial distance of that point from a fixed origin, its polar angle measured from a fixed zenith direction, and the azimuth angle of its orthogonal projection on a reference plane that passes through the origin and is orthogonal to the zenith, measured from a fixed reference direction on that plane.
http://purl.obolibrary.org/obo/STATO_0000015	cylindrical coordinate system	http://purl.obolibrary.org/obo/STATO_0000010	coordinate system		A cylindrical coordinate system is a three-dimensional coordinate system that specifies point positions by the distance from a chosen reference axis, the direction from the axis relative to a chosen reference direction, and the distance from a chosen reference plane perpendicular to the axis. The latter distance is given as a positive or negative number depending on which side of the reference plane faces the point.
http://purl.obolibrary.org/obo/STATO_0000016	polar coordinate system	http://purl.obolibrary.org/obo/STATO_0000010	coordinate system		In mathematics, the polar coordinate system is a two-dimensional coordinate system in which each point on a plane is determined by a distance from a fixed point and an angle from a fixed direction.
http://purl.obolibrary.org/obo/STATO_0000022	one dimensional cartesian coordinate system	http://purl.obolibrary.org/obo/STATO_0000009	cartesian coordinate system		A one dimensional cartesian coordinate system is a cartesian coordinate system which defines a one dimensional axis and which may be used to describe a one dimensional spatial region, i.e. a straight line. It is defined by a point O, the origin, a unit of length and the orientation for the one dimensional space.
http://purl.obolibrary.org/obo/STATO_0000024	three dimensional cartesian coordinate system	http://purl.obolibrary.org/obo/STATO_0000009	cartesian coordinate system		a three dimensional cartesian coordinate system is a cartesian coordinate system which defines 3 orthogonal one dimensional axes and which may be used to describe a 3 dimensional spatial region.
http://purl.obolibrary.org/obo/STATO_0000047	count	http://purl.obolibrary.org/obo/STATO_0000039	statistic		a count is a data item denoted by an integer and representing the number of instances or occurences of an entity
http://purl.obolibrary.org/obo/STATO_0000050	signal to noise ratio	http://purl.obolibrary.org/obo/STATO_0000184	ratio		signal to noise ratio is a measurement datum comparing the amount of meaningful, useful or interesting data (the signal) to the amount of irrelevant or false data (the noise). Depending on the field and domain of application, different variables will be used to determinate a 'signal to noise ratio'. In statistics, the definition of signal to noise ratio is the ratio of the mean of a measurement to its standard deviation. It thus corresponds to the inverse of the coefficient of variation
http://purl.obolibrary.org/obo/STATO_0000053	false positive rate	http://purl.obolibrary.org/obo/STATO_0000607	proportion		a false positive rate is a data item which accounts for the proportion of incorrect rejection of a true null hypothesis.
http://purl.obolibrary.org/obo/STATO_0000055	chromosome coordinate system	http://purl.obolibrary.org/obo/STATO_0000197	genomic coordinate system		chromosome coordinate system is a genomic coordinate which uses chromosome of a particular assembly build process to define start and end positions. This coordinate system is unstable and will change with each new genome sequence assembly build.
http://purl.obolibrary.org/obo/STATO_0000064	sequence read count	http://purl.obolibrary.org/obo/STATO_0000047	count		sequence read count is a data item determining how many sequence reads have been generated by a DNA sequencing assay for a given stretch of DNA
http://purl.obolibrary.org/obo/STATO_0000069	number of degrees of freedom	http://purl.obolibrary.org/obo/STATO_0000047	count		The number degree of freedom is a count evaluating the number of values in a calculation that can vary. In statistics, the number of degrees of freedom ν is equal to N-1 in the case of the direct measurement of a quantity estimated by the arithmetic mean of N independent observations.
http://purl.obolibrary.org/obo/STATO_0000071	reaction rate	http://purl.obolibrary.org/obo/STATO_0000645	rate		reaction rate is a measurement datum which represents the speed of a chemical reaction turning reactive species into product species of event (i.e the number of such conversions)s occuring over a time interval
http://purl.obolibrary.org/obo/STATO_0000087	polychotomous variable	http://purl.obolibrary.org/obo/STATO_0000252	categorical variable		a polychotomous variable  is a categorical variable  which is defined to have minimally 2 categories or possible values
http://purl.obolibrary.org/obo/STATO_0000090	dichotomous variable	http://purl.obolibrary.org/obo/STATO_0000087	polychotomous variable		a dichotomous variable  is a categorical variable  which is defined to have only 2 categories or possible values
http://purl.obolibrary.org/obo/STATO_0000169	fold change	http://purl.obolibrary.org/obo/STATO_0000300	dimensionless ratio		Fold change is a number describing how much a quantity changes going from an initial to a final value or one condition to another condition
http://purl.obolibrary.org/obo/STATO_0000182	odds ratio	http://purl.obolibrary.org/obo/STATO_0000622	ratio-based measure of association		Odds ratio is a ratio that measures effect size, that is the strength of association between 2 dichotomous variables, one describing an exposure and one describing an outcome.
It represents the odds that an outcome will occur given a particular exposure, compared to the odds of the outcome occurring in the absence of that exposure ( the probability of the event occuring divided by the probability of an event not occurring). The odds ratio is a ratio of describing the strength of association or non-independence between two binary data values by forming the ratio of the odds for the first group and the odds for the second group. Odds ratio are used when one wants to compare the odds of something occurring to two different groups.
http://purl.obolibrary.org/obo/STATO_0000184	ratio	http://purl.obolibrary.org/obo/STATO_0000039	statistic		A ratio is a data item which is formed with two numbers r and s is written r/s, where r is the numerator and s is the denominator. The ratio of r to s is equivalent to the quotient r/s.
http://purl.obolibrary.org/obo/STATO_0000197	genomic coordinate system	http://purl.obolibrary.org/obo/STATO_0000010	coordinate system		a genomic coordinate system is a coordinate system to describe position of sequence on a genomic scaffold (assembly of chromosome, contig....)
http://purl.obolibrary.org/obo/STATO_0000210	channel1/channel2 fluorescence intensity ratio	http://purl.obolibrary.org/obo/STATO_0000169	fold change		is a data item formed by dividing the fluorescence intensity obtained in one channel to that obtained in the other channel, typically the case when considering 2-color microarray data when imaging is done for Cy3 and Cy5 dyes.
http://purl.obolibrary.org/obo/STATO_0000220	false negative rate	http://purl.obolibrary.org/obo/STATO_0000607	proportion		the false negative rate is a data item which denotes the proportion of missed detection of elements known to be meeting the detection criteria
http://purl.obolibrary.org/obo/STATO_0000232	number of PCR cycle	http://purl.obolibrary.org/obo/STATO_0000047	count		number of PCR cycle is a count which enumerates how many iterations of 'annealing, renaturation, amplification,' rounds (or cycles) are performed during a polymerase chain reaction (PCR) or an assay relying on PCR.
http://purl.obolibrary.org/obo/STATO_0000251	continuous variable	http://purl.obolibrary.org/obo/STATO_0000722	quantitative variable		A continuous variable is one for which, within the limits the variable ranges, any value is possible.
http://purl.obolibrary.org/obo/STATO_0000254	allele frequency	http://purl.obolibrary.org/obo/STATO_0000184	ratio		The allele frequency is a data item which denotes the incidence of a gene variant in a population. It is calculated as a ratio, by dividing the number of copies of a particular allele by the number of copies of all alleles at the genetic place (locus) in a population.
http://purl.obolibrary.org/obo/STATO_0000258	variable	http://purl.obolibrary.org/obo/STATO_0000741	statistical model component		a variable is a data item which can assume any of a set of values, either as determined by an agent or as randomly occuring through observation.
http://purl.obolibrary.org/obo/STATO_0000300	dimensionless ratio	http://purl.obolibrary.org/obo/STATO_0000184	ratio		A ratio where the numerator and denominator are expressed in the same unit.
http://purl.obolibrary.org/obo/STATO_0000002	digital file	http://purl.obolibrary.org/obo/IAO_0000030	information content entity		an electronic file is an information content entity which conforms to a specification or format and which is meant to hold data and information in digital form, accessible to software agents
http://purl.obolibrary.org/obo/STATO_0000003	balanced design	http://purl.obolibrary.org/obo/OBI_0500000	study design		a balanced design is a an experimental design where all experimental group have the an equal number of subject observations
http://purl.obolibrary.org/obo/STATO_0000005	single factor design	http://purl.obolibrary.org/obo/OBI_0500000	study design		a single factor design is a study design which declares exactly 1 independent variable
http://purl.obolibrary.org/obo/STATO_0000006	x-axis	http://purl.obolibrary.org/obo/STATO_0000011	cartesian coordinate axis		x-axis is a cartesian coordinate axis which is orthogonal to the y-axis and the z-axis
http://purl.obolibrary.org/obo/STATO_0000007	axis	http://purl.obolibrary.org/obo/IAO_0000573	line graph		an axis is a line graph used as reference line for the measurement of coordinates.
http://purl.obolibrary.org/obo/STATO_0000008	y-axis	http://purl.obolibrary.org/obo/STATO_0000011	cartesian coordinate axis		y-axis is a cartesian coordinate axis which is orthogonal to the x-axis and the z-axis
http://purl.obolibrary.org/obo/STATO_0000012	z-axis	http://purl.obolibrary.org/obo/STATO_0000011	cartesian coordinate axis		z-axis is a cartesian coordinate axis which is orthogonal to the x-axis and the y-axis
http://purl.obolibrary.org/obo/STATO_0000018	three dimensional cartesian spatial coordinate origin	http://purl.obolibrary.org/obo/STATO_0000026	cartesian spatial coordinate origin		A cartesian spatial coordinate datum chosen as a fixed point of reference in a three dimensional spatial region.
http://purl.obolibrary.org/obo/STATO_0000019	normal distribution hypothesis	http://purl.obolibrary.org/obo/STATO_0000058	goodness of fit hypothesis		normal distribution hypothesis is a goodness of fit hypothesis stating that the distribution computed from the sample population fits a normal distribution.
http://purl.obolibrary.org/obo/STATO_0000020	two dimensional cartesian spatial coordinate origin	http://purl.obolibrary.org/obo/STATO_0000026	cartesian spatial coordinate origin		A cartesian spatial coordinate datum chosen as a fixed point of reference in a two dimensional spatial region.
http://purl.obolibrary.org/obo/STATO_0000021	90% confidence interval	http://purl.obolibrary.org/obo/STATO_0000196	confidence interval		a confidence interval which covers 90% of the sampling distribution, meaning that there is a 90% risk of false positive (type I error)
http://purl.obolibrary.org/obo/STATO_0000025	one dimensional cartesian spatial coordinate origin	http://purl.obolibrary.org/obo/STATO_0000026	cartesian spatial coordinate origin		A cartesian spatial coordinate datum chosen as a fixed point of reference in a one dimensional spatial region.
http://purl.obolibrary.org/obo/STATO_0000026	cartesian spatial coordinate origin	http://purl.obolibrary.org/obo/IAO_0000400	cartesian spatial coordinate datum		A cartesian spatial coordinate datum chosen as a fixed point of reference in a spatial region.
http://purl.obolibrary.org/obo/STATO_0000028	measure of dispersion	http://purl.obolibrary.org/obo/STATO_0000039	statistic		measure of variation or statistical dispersion is a data item which describes how much a theoritical distribution or dataset is spread.
http://purl.obolibrary.org/obo/STATO_0000029	measure of central tendency	http://purl.obolibrary.org/obo/STATO_0000039	statistic		a measure of central tendency is a data item which attempts to describe a set of data by identifying the value of its centre.
http://purl.obolibrary.org/obo/STATO_0000030	Chi-Squared statistic	http://purl.obolibrary.org/obo/STATO_0000609	measure of hypothesis testing		Chi-squared statistic is a statistic computed from observations and used to produce a p-value in statistical test when compared to a Chi-Squared distribution.
http://purl.obolibrary.org/obo/STATO_0000031	binary classification	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		binary classification (or binomial classification) is a data transformation which aims to cast members of a set into 2 disjoint groups depending on whether the element have a given property/feature or not.
http://purl.obolibrary.org/obo/STATO_0000033	mode	http://purl.obolibrary.org/obo/STATO_0000574	center value		The mode is a data item which corresponds to the most frequently occurring number in a set of numbers.
http://purl.obolibrary.org/obo/STATO_0000034	model parameter	http://purl.obolibrary.org/obo/IAO_0000027	data item		a model parameter is a data item which is part of a model and which is meant to characterize an theoritecal or unknown population. a model parameter may be estimated by considering the properties of samples presumably taken from the theoritecal population
http://purl.obolibrary.org/obo/STATO_0000035	range	http://purl.obolibrary.org/obo/STATO_0000028	measure of dispersion		the range is a measure of variation which describes the difference between the lowest score and the highest score in a set of numbers (a data set)
http://purl.obolibrary.org/obo/STATO_0000036	outlier	http://purl.obolibrary.org/obo/IAO_0000027	data item		Outliers are deviant scores that have been legitimately gathered and are not due to equipment failures.
http://purl.obolibrary.org/obo/STATO_0000038	matched pair of subjects	http://purl.obolibrary.org/obo/OBI_0000181	population		a set of 2 subjects which result from a pairing process which assigns subject to a set based on a pairing rule/criteria
http://purl.obolibrary.org/obo/STATO_0000039	statistic	http://purl.obolibrary.org/obo/IAO_0000027	data item		a statistic is a measurement datum to describe a dataset or a variable. It is generated by a calculation on set of observed data.
http://purl.obolibrary.org/obo/STATO_0000042	Anderson-Darling test	http://purl.obolibrary.org/obo/STATO_0000191	goodness of fit statistical test		The Anderson–Darling test is a statistical test of whether a given sample of data is drawn from a given probability distribution.
http://purl.obolibrary.org/obo/STATO_0000046	block design	http://purl.obolibrary.org/obo/OBI_0500000	study design		a block design is a kind of study design which declares a blocking variable (also known as nuisance variable) in order to account for a known source of variation and reduce its impact on the acquisition of the signal
http://purl.obolibrary.org/obo/STATO_0000048	multiway ANOVA	http://purl.obolibrary.org/obo/OBI_0200201	ANOVA		Multi-way anova is an analysis of variance where the difference groups being compared are associated to the factor levels of more than 2 independent variables. The null hypothesis is an absence of difference between the means calculated for each of the groups. The test assumes normality and equivariance of the data.
http://purl.obolibrary.org/obo/STATO_0000049	Hardy-Weinberg equilibrium hypothesis	http://purl.obolibrary.org/obo/STATO_0000058	goodness of fit hypothesis		Hardy-Weinberg equilibrium hypothesis is a good of fit hypothesis which states that allele and genotype frequencies in a population will remain constant from generation to generation in the absence of other evolutionary influences (non-random mating, mutation, selection, genetic drift, gene flow and meiotic drive).
http://purl.obolibrary.org/obo/STATO_0000051	Poisson distribution	http://purl.obolibrary.org/obo/STATO_0000117	discrete probability distribution		Poisson distribution is a probability distribution used to model the number of events occurring within a given time interval. It is defined by a real number (λ)  and an integer k representing the number of events and a function.
The expected value of a Poisson-distributed random variable is equal to λ and so is its variance.
http://purl.obolibrary.org/obo/STATO_0000052	Z-test	http://purl.obolibrary.org/obo/STATO_0000279	between group comparison statistical test		Z-test is a statistical test which evaluate the null hypothesis that the means of 2 populations are equal and returns a p-value.
http://purl.obolibrary.org/obo/STATO_0000056	absence of association hypothesis	http://purl.obolibrary.org/obo/STATO_0000065	hypothesis		a null hypothesis which states that no linkage exists between 2 categorical variables
http://purl.obolibrary.org/obo/STATO_0000058	goodness of fit hypothesis	http://purl.obolibrary.org/obo/STATO_0000065	hypothesis		goodness of fit hypothesis is a null hypothesis stating that the distribution computed from the sample population fits a theoretical distribution or that a dataset can be correctly explained by a model
http://purl.obolibrary.org/obo/STATO_0000059	Student's t distribution	http://purl.obolibrary.org/obo/STATO_0000329	symmetric distribution		the Student's t distribution is a continuous probability distribution which  arises when estimating the mean of a normally distributed population in situations where the sample size is small and population standard deviation is unknown.
http://purl.obolibrary.org/obo/STATO_0000061	absence of between group difference hypothesis	http://purl.obolibrary.org/obo/STATO_0000065	hypothesis		It is a null hypothesis stating that there are no differences observed between group of subjects.
http://purl.obolibrary.org/obo/STATO_0000062	absence of within subject difference hypothesis	http://purl.obolibrary.org/obo/STATO_0000065	hypothesis		is a null hypothesis stating that there are no difference observed across a series of measurements made one same subject.
http://purl.obolibrary.org/obo/STATO_0000063	genomic coordinate datum	http://purl.obolibrary.org/obo/IAO_0000027	data item		genomic coordinate datum is a data item which denotes a genomic position expressed using a genomic coordinate system
http://purl.obolibrary.org/obo/STATO_0000065	hypothesis	http://purl.obolibrary.org/obo/IAO_0000030	information content entity		In statistics, a statement that can be tested.[wolfram alpha]
http://purl.obolibrary.org/obo/STATO_0000067	continuous probability distribution	http://purl.obolibrary.org/obo/STATO_0000225	probability distribution		a continuous probability distribution is a probability distribution which is defined by a probability density function
http://purl.obolibrary.org/obo/STATO_0000068	skewness	http://purl.obolibrary.org/obo/STATO_0000039	statistic		Skewness is a data item indicating of the degree of asymmetry of a distribution.
http://purl.obolibrary.org/obo/STATO_0000070	Yate's corrected Chi-Squared test	http://purl.obolibrary.org/obo/STATO_0000081	Pearson's Chi square test of independence between categorical variables		Yate's corrected Chi-Squared test is a statistical test which is used to test the association/linkage/independence of 2 dichotomous variables while introducing a correction for using the continous Chi-squared distribution for the test.
To reduce the error in approximation, Frank Yates, an English statistician, suggested a correction for continuity that adjusts the formula for Pearson's chi-squared test by subtracting 0.5 from the difference between each observed value and its expected value in a 2 × 2 contingency table. This reduces the chi-squared value obtained and thus increases its p-value.
http://purl.obolibrary.org/obo/STATO_0000072	substrate concentration	http://purl.obolibrary.org/obo/STATO_0000704	concentration		substrate concentration is a scalar measurement datum which denotes the amount of molecular entity involved in an enzymatic reaction (or catalytic chemical reaction) and whose role in that reaction is as substrate.
http://purl.obolibrary.org/obo/STATO_0000073	Fisher's exact test	http://purl.obolibrary.org/obo/STATO_0000027	test of association between categorical variables		Fisher's exact test is a statistical test used to determine if there are nonrandom associations between two categorical variables.
http://purl.obolibrary.org/obo/STATO_0000074	Cochran-Mantel-Haenzel test for repeated tests of independence	http://purl.obolibrary.org/obo/STATO_0000027	test of association between categorical variables		Cochran-Mantel-Haenzel test for repeated tests of independence is a statitiscal test which allows the comparison of two groups on a dichotomous/categorical response. It is used when the effect of the explanatory variable on the response variable is influenced by covariates that can be controlled. It is often used in observational studies where random assignment of subjects to different treatments cannot be controlled, but influencing covariates can.
The null hypothesis is that the two nominal variables that are tested within each repetition are independent of each other. So there are 3 variables to consider: two categorical variables to be tested for independence of each other, and the third variable identifies the repeats.
http://purl.obolibrary.org/obo/STATO_0000078	Levene's test	http://purl.obolibrary.org/obo/STATO_0000191	goodness of fit statistical test		Levene's test is a null hypothesis statistical test which evaluates the null hypothesis of equality of variance in several populations.
http://purl.obolibrary.org/obo/STATO_0000079	Barlett's test	http://purl.obolibrary.org/obo/STATO_0000191	goodness of fit statistical test		Bartlett's test (see Snedecor and Cochran, 1989) is used to test if k samples are from populations with equal variances. Equal variances across samples is called homoscedasticity or homogeneity of variances. Some statistical tests, for example the analysis of variance, assume that variances are equal across groups or samples. The Bartlett test can be used to verify that assumption.

Bartlett's test is sensitive to departures from normality. That is, if the samples come from non-normal distributions, then Bartlett's test may simply be testing for non-normality. Levene's test and the Brown–Forsythe test are alternatives to the Bartlett test that are less sensitive to departures from normality.
http://purl.obolibrary.org/obo/STATO_0000080	Brown Forsythe test	http://purl.obolibrary.org/obo/STATO_0000137	homoskedasticity test		the Brown Forsythe test is a statistical test which evaluates if the variance of different groups are equal. It relies on computing the median rather than the mean, as used in the Levene's test for homoschedacity.
This test maybe used to, for instance, ensure that the conditions of applications of ANOVA are met.
http://purl.obolibrary.org/obo/STATO_0000082	fixed effect model	http://purl.obolibrary.org/obo/STATO_0000107	statistical model		a fixed effect model is a statistical model which represents the observed quantities in terms of explanatory variables that are treated as if the quantities were non-random.
http://purl.obolibrary.org/obo/STATO_0000084	multinomial probit regression for analysis of polychotomous dependent variable	http://purl.obolibrary.org/obo/OBI_0200102	regression analysis method		multinomial logistic regression model is a model which attempts to explain data distribution associated with *polychotomous* response/dependent variable in terms of values assumed by the independent variable uses a function of predictor/independent variable(s): the function used in this instance of regression modeling is probit function.
http://purl.obolibrary.org/obo/STATO_0000085	effect size estimate	http://purl.obolibrary.org/obo/STATO_0000471	estimate		effect size estimate is a data item about the direction and strength of the consequences of a causative agent as explored by statistical methods. Those methods produce estimates of the effect size, e.g. confidence interval
http://purl.obolibrary.org/obo/STATO_0000086	F-test	http://purl.obolibrary.org/obo/STATO_0000191	goodness of fit statistical test		an F-test is a statistical test which evaluates that the computed test statistics follows an F-distribution under the null hypothesis. The F-test is sensitive to departure from normality. F-test arise when decomposing the variability in a data set in terms of sum of squares.
http://purl.obolibrary.org/obo/STATO_0000088	study group population size	http://purl.obolibrary.org/obo/STATO_0000047	count		statistical sample size is a count evaluating the number of individual experimental units
http://purl.obolibrary.org/obo/STATO_0000089	case-control study design	http://purl.obolibrary.org/obo/OBI_0500006	parallel group design		a case-control study design is a observation study design which assess the risk of particular outcome (a trait or a disease) associated with an event (either an exposure or endogenous factor). A case-control study design therefore declares an exposure variable which is dichotomous in nature (exposed/non-exposed) and an outcome variable, which is also dichotomous (case or control), thus giving the name to the design. During the execution of the design, a case control study defines a population and counts the events to determine their frequency.
http://purl.obolibrary.org/obo/STATO_0000091	genome-wide association study	http://purl.obolibrary.org/obo/STATO_0000235	genetic association study		Genome wide association study is a kind of study whose objective is to detect association between genetic markers (SNP or otherwise) accross the genome and a trait which may be a disease or another phenotype (e.g. trait of agronomic relevance in animal or plant studies). Genome wide association study compare the allele frequencies in 2 populations, one free of the trait used as control, the other one showing the trait use as 'case'. GWAS studies implement case-control design
http://purl.obolibrary.org/obo/STATO_0000092	Wilcoxon signed rank test	http://purl.obolibrary.org/obo/STATO_0000202	within subject comparison statistical test		The Wilcoxon signed rank test is a statistical test which tests the null hypothesis that the median difference between pairs of observations is zero. This is the non-parametric analogue to the paired t-test, and should be used if the distribution of differences between pairs may be non-normally distributed.

The procedure involves a ranking, hence the name. The absolute value of the differences between observations are ranked from smallest to largest, with the smallest difference getting a rank of 1, then next larger difference getting a rank of 2, etc. Ties are given average ranks. The ranks of all differences in one direction are summed, and the ranks of all differences in the other direction are summed. The smaller of these two sums is the test statistic, W (sometimes symbolized Ts). Unlike most test statistics, smaller values of W are less likely under the null hypothesis.
http://purl.obolibrary.org/obo/STATO_0000093	date	http://purl.obolibrary.org/obo/IAO_0000582	time stamped measurement datum		Information about a calendar date or timestamp indicating day, month, year and time of an event.
http://purl.obolibrary.org/obo/STATO_0000095	paired t-test	http://purl.obolibrary.org/obo/STATO_0000202	within subject comparison statistical test		paired t-test is a statistical test which is specifically designed to analysis differences between paired observations in the case of studies realizing repeated measures design with only 2 repeated measurements per subject (before and after treatment for example)
http://purl.obolibrary.org/obo/STATO_0000096	population stratification prior to sampling	http://purl.obolibrary.org/obo/OBI_0000094	material processing		stratification is a planned process which executes a stratification rule using as input a population and assign it member to mutually exclusive subpopulation based on the values defined by the stratification rule
http://purl.obolibrary.org/obo/STATO_0000097	statistical test power analysis	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		A stastical test power analysis is a data transformation which aims to determine the size of a statistical sample required to reach a desired significance level given a particular statistical test
http://purl.obolibrary.org/obo/STATO_0000098	two sample Hotelling T2 test	http://purl.obolibrary.org/obo/STATO_0000279	between group comparison statistical test		Hotelling's T2 test is a statistical test which is a generalization of Student's T-test to a assess if the means of a set of variables remains unchanged when studying 2 populations. It is a type of multivariate analysis
http://purl.obolibrary.org/obo/STATO_0000104	z-score	http://purl.obolibrary.org/obo/STATO_0000609	measure of hypothesis testing		A z-score (also known as z-value, standard score, or normal score) is a measure of the divergence of an individual experimental result from the most probable result, the mean. Z is expressed in terms of the number of standard deviations from the mean value.
http://purl.obolibrary.org/obo/STATO_0000105	log signal intensity ratio	http://purl.obolibrary.org/obo/STATO_0000184	ratio		log signal intensity ratio is a data item which corresponding the logarithmitic base 2 of the ratio between 2 signal intensity, each corresponding to a condition.
http://purl.obolibrary.org/obo/STATO_0000106	probit regression for analysis of polychotomous dependent variable	http://purl.obolibrary.org/obo/OBI_0200102	regression analysis method		probit regression model is a model which attempts to explain data distribution associated with *dichotomous* response/dependent variable in terms of values assumed by the independent variable uses a function of predictor/independent variable(s): the function used in this instance of regression modeling is the probit function aka the quantile function, i.e., the inverse cumulative distribution function (CDF), associated with the standard normal distribution.
http://purl.obolibrary.org/obo/STATO_0000107	statistical model	http://purl.obolibrary.org/obo/IAO_0000030	information content entity		a statistical model is an information content entity which is a formalization of relationships between variables in the form of mathematical equations. A statistical model describes how one or more random variables are related to one or more other variables. The model is statistical as the variables are not deterministically but stochastically related.
http://purl.obolibrary.org/obo/STATO_0000108	linear regression for analysis of continuous dependent variable	http://purl.obolibrary.org/obo/OBI_0200102	regression analysis method		linear regression model is a model which attempts to explain data distribution associated with response/dependent variable in terms of values assumed by the independent variable uses a linear function or linear combination of the regression parameters and the predictor/independent variable(s).
linear regression modeling makes a number of assumptions, which includes homoskedasticity (constance of variance)
http://purl.obolibrary.org/obo/STATO_0000109	multinomial logistic regression for analysis of dichotomous dependent variable	http://purl.obolibrary.org/obo/OBI_0200102	regression analysis method		multinomial logistic regression model is a model which attempts to explain data distribution associated with *polychotomous* response/dependent variable in terms of values assumed by the independent variable uses a function of predictor/independent variable(s): the function used in this instance of regression modeling is logistic function.
http://purl.obolibrary.org/obo/STATO_0000111	sequence read	http://purl.obolibrary.org/obo/OBI_0001573	DNA sequence data		a sequence read is a DNA sequence data which is generated by a DNA sequencer
http://purl.obolibrary.org/obo/STATO_0000116	assess statistical evidence	http://purl.obolibrary.org/obo/OBI_0000011	planned process		the process of using statistical analysis for interpreting and communicating "what the data say".
http://purl.obolibrary.org/obo/STATO_0000117	discrete probability distribution	http://purl.obolibrary.org/obo/STATO_0000225	probability distribution		a discrete probability distribution is a probability distribution which is defined by a probability mass function where the random variable can only assume a finite number of values or infinitely countable values
http://purl.obolibrary.org/obo/STATO_0000118	ranking	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		ranking is a data transformation which turns a non-ordinal variable into a Ordinal variable by sorting the values of the input variable and replacing their value by their position in the sorting result
http://purl.obolibrary.org/obo/STATO_0000119	model parameter estimation	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		model parameter estimation is a data transformation that finds parameter values (the model parameter estimates) most compatible with the data as judged by the model.
http://purl.obolibrary.org/obo/STATO_0000121	association between categorical variables testing objective	http://purl.obolibrary.org/obo/OBI_0000675	statistical hypothesis test objective		the objective of a data transformation to evaluate a null hypothesis of absence of linkage between variables.
http://purl.obolibrary.org/obo/STATO_0000124	stratification rule	http://purl.obolibrary.org/obo/IAO_0000055	rule		a stratification rule/criteria is a criteria used to determine population strata so that a stratification process implementing the rule can result in any member of the total population being assigned to one and only one stratum
http://purl.obolibrary.org/obo/STATO_0000130	Breslow-Day test for homogeneity of odds ratio	http://purl.obolibrary.org/obo/STATO_0000697	homogeneity test		the Breslow-Day test is a statistical test which evaluates if the odds ratios are homogenous across N 2x2 contingency tables, for instance several 2x2 contingency tables associated with different strata of a stratified population when evaluating the relationship between exposure and outcome or associated with the different samples coming from several centres in a multicentric study in clinical trial context.
http://purl.obolibrary.org/obo/STATO_0000131	sphericity test	http://purl.obolibrary.org/obo/OBI_0000673	statistical hypothesis test		a sphericity test is a null hypothesis statistical testing procedure which posits a null hypothesis of equality of the variances of the differences between levels of the repeated measures factor
http://purl.obolibrary.org/obo/STATO_0000135	strictly standardized mean difference	http://purl.obolibrary.org/obo/STATO_0000100	standardized mean difference		strictly standardized mean difference (SSMS) is a standardized mean difference which corresponds to the  ratio of mean to the standard deviation of the difference between two groups. 
SSMD directly measures the magnitude of difference between two groups.
SSMD is widely used in High Content Screen for hit selection and quality control.

When the data is preprocessed using log-transformation as normally done in HTS experiments, SSMD is the mean of log fold change divided by the standard deviation of log fold change with respect to a negative reference. 

In other words, SSMD is the average fold change (on the log scale) penalized by the variability of fold change (on the log scale).

For quality control, one index for the quality of an HTS assay is the magnitude of difference between a positive control and a negative reference in an assay plate. For hit selection, the size of effects of a compound (i.e., a small molecule or an siRNA) is represented by the magnitude of difference between the compound and a negative reference. SSMD directly measures the magnitude of difference between two groups. Therefore, SSMD can be used for both quality control and hit selection in HTS experiments.
http://purl.obolibrary.org/obo/STATO_0000136	Tarone's test for homogeneity of odds ratio	http://purl.obolibrary.org/obo/STATO_0000697	homogeneity test		Tarone's test for homogeneity of odds ratio is a statistical test which evaluates the null hypothesis that odds ratio are homogeneous
http://purl.obolibrary.org/obo/STATO_0000137	homoskedasticity test	http://purl.obolibrary.org/obo/OBI_0000673	statistical hypothesis test		an homoskedasticity test is a statistical test aiming at evaluate if the variances from several random samples are similar
http://purl.obolibrary.org/obo/STATO_0000138	2 by 2 contingency table	http://purl.obolibrary.org/obo/STATO_0000140	contingency table		a 2x2 contingency table is a contingency table build for 2 dichotomous variables (i.e. 2 categorical variables, each with only 2 possible outcomes). It is the simplest of contingency tables
http://purl.obolibrary.org/obo/STATO_0000139	subject pairing	http://purl.obolibrary.org/obo/OBI_0600015	group assignment		a subject pairing is a planned process which executes a pairing rule and results in the creation of sets of 2 subjects meeting the pairing criteria
http://purl.obolibrary.org/obo/STATO_0000140	contingency table	http://purl.obolibrary.org/obo/STATO_0000642	matrix		a contigency table is a data item which displays the (multivariate) frequency distribution of the possible values of categorical variables.
The first row of the table corresponds to categories of one categorical variable, the first column of the table corresponds to categories of the other categorical variable, the cells corresponding to each combination of categories is filled with the observed occurences in the sample being considered.
The table also contains marginal total (marginal sums) and grand total of the occurences

The term contingency table was first used by Karl Pearson in "On the Theory of Contingency and Its Relation to Association and Normal Correlation", part of the Drapers' Company Research Memoirs Biometric Series I published in 1904.
http://purl.obolibrary.org/obo/STATO_0000141	acute toxicity study	http://purl.obolibrary.org/obo/OBI_0000066	investigation		acute toxicity study is an investigation which use interventions organized according to a factorial design and a parallel group design to observe the effect of use of high dose xenobiotics in animal models or cellular models
http://purl.obolibrary.org/obo/STATO_0000142	correlation coefficient	http://purl.obolibrary.org/obo/STATO_0000611	measure of correlation		The correlation coefficient of two variables in a data sample is their covariance divided by the product of their individual standard deviations. It is a normalized measurement of how the two are linearly related.
http://purl.obolibrary.org/obo/STATO_0000143	Bayesian model selection	http://purl.obolibrary.org/obo/STATO_0000328	statistical model selection		A Bayesian model selection is a data transformation which is based on Bayesian statistics to compute Bayes factor in order to evaluate which model best explains data.
http://purl.obolibrary.org/obo/STATO_0000144	model parameter estimate	http://purl.obolibrary.org/obo/STATO_0000471	estimate		a model parameter estimate is a data item which results from a model parameter estimation process and which provides a numerical value about a model parameter.
http://purl.obolibrary.org/obo/STATO_0000146	presence of between group difference hypothesis	http://purl.obolibrary.org/obo/STATO_0000065	hypothesis		a null hypothesis stating that there are differences observed between group of subjects
http://purl.obolibrary.org/obo/STATO_0000149	binomial logistic regression for analysis of dichotomous dependent variable	http://purl.obolibrary.org/obo/OBI_0200102	regression analysis method		binomial logistic regression model is a model which attempts to explain data distribution associated with *dichotomous* response/dependent variable in terms of values assumed by the independent variable uses a function of predictor/independent variable(s): the function used in this instance of regression modeling is logistic function.
http://purl.obolibrary.org/obo/STATO_0000152	quartile	http://purl.obolibrary.org/obo/STATO_0000291	quantile		a quartile is a quantile which splits data into sections accrued of 25% of data, so the first quartile delineates 25% of the data, the second quartile delineates 50% of the data and the third quartile, 75 % of the data
http://purl.obolibrary.org/obo/STATO_0000153	one sample Hotelling T2 test	http://purl.obolibrary.org/obo/STATO_0000191	goodness of fit statistical test		The one-sample Hotelling’s T2 is the multivariate extension of the common one-sample or paired Student’s t-test. In a one-sample t-test, the mean response is compared against a specific value. Hotelling’s one-sample T2 is used when the number of response variables is two or more, although it can be used when there is only one response variable. T2 makes the usual assumption that the data are approximately multivariate normal. Randomization tests are provided that do not rely on this assumption. These randomization tests should be used whenever you want exact results that do not rely on several assumptions.
http://purl.obolibrary.org/obo/STATO_0000155	meta analysis	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		meta-analysis is a data transformation which uses the effect size estimates from several independent quantitative scientific studies addressing the same question in order to assess finding consistency.
http://purl.obolibrary.org/obo/STATO_0000156	Scheffe test	http://purl.obolibrary.org/obo/STATO_0000133	post-hoc analysis		the Scheffe test is a data transformation which evaluates all possible contrasts and adjusting the levels significance by accounting for multiple comparison. The test is therefore conservative. Confidence intervals can be constructed for the corresponding linear regression.  It was developped by  American statistician Henry  Scheffe in 1959.
http://purl.obolibrary.org/obo/STATO_0000157	Least significance different test	http://purl.obolibrary.org/obo/STATO_0000133	post-hoc analysis		the LSD test  is a statistical test for multiple comparisons of treatments by means of least significant difference following an ANOVA analysis
http://purl.obolibrary.org/obo/STATO_0000158	presence of association hypothesis	http://purl.obolibrary.org/obo/STATO_0000065	hypothesis		a null hypothesis which states that a linkage exists between 2 categorical variables
http://purl.obolibrary.org/obo/STATO_0000160	exponential distribution	http://purl.obolibrary.org/obo/STATO_0000067	continuous probability distribution		The exponential distribution (a.k.a. negative exponential distribution) is the probability distribution that describes the time between events in a Poisson process, i.e. a process in which events occur continuously and independently at a constant average rate. It is the continuous analogue of the geometric distribution, and it has the key property of being memoryless.
http://purl.obolibrary.org/obo/STATO_0000161	data distribution	http://purl.obolibrary.org/obo/IAO_0000030	information content entity		variable distribution is data item which denotes the spatial resolution of data point making up a variable. variable distribution may be compared to a known probability distribution using goodness of fit test or plotting a quantile-quantile plot for visual assessment of the fit.
http://purl.obolibrary.org/obo/STATO_0000162	experimental unit role	http://purl.obolibrary.org/obo/BFO_0000023	role		the role played by an entity part of study group as defined by an experimental design and realized in a data analysis and data interpretation
http://purl.obolibrary.org/obo/STATO_0000163	trimmed mean	http://purl.obolibrary.org/obo/STATO_0000029	measure of central tendency		trimmed mean or truncated mean is a measure of central tendency which involves the calculation of the mean after discarding given parts of a probability distribution or sample at the high and low end, and typically discarding an equal amount of both
http://purl.obolibrary.org/obo/STATO_0000164	inter quartile range	http://purl.obolibrary.org/obo/STATO_0000028	measure of dispersion		The interquartile range is a data item which corresponds to the difference between the upper quartile (3rd quartile) and lower quartile (1st quartile).
The interquartile range contains the second quartile or median.
The interquartile range is a data item providing a measure of data dispersion.
http://purl.obolibrary.org/obo/STATO_0000167	first quartile	http://purl.obolibrary.org/obo/STATO_0000152	quartile		the first quartile is a quartile which splits the lower 25 % of the data
http://purl.obolibrary.org/obo/STATO_0000170	third quartile	http://purl.obolibrary.org/obo/STATO_0000152	quartile		the first quartile is a quartile which splits the 75 % of the data
http://purl.obolibrary.org/obo/STATO_0000173	homogeneity test objective	http://purl.obolibrary.org/obo/OBI_0000675	statistical hypothesis test objective		homogeneity testing objective is the objective of a data transformation to test a null hypothesis that two or more sub-groups of a population share the same distribution of a single categorical variable.
For example, do people of different countries have the same proportion of smokers to non-smokers
http://purl.obolibrary.org/obo/STATO_0000175	confidence interval calculation	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		confidence interval calculation is a data transformation which determines a confidence interval for a given statistical parameter
http://purl.obolibrary.org/obo/STATO_0000176	t-statistic	http://purl.obolibrary.org/obo/STATO_0000039	statistic		t-statistic is a statistic computed from observations and used to produce a p-value in statistical test when compared to a Student's t distribution.
http://purl.obolibrary.org/obo/STATO_0000178	kurtosis	http://purl.obolibrary.org/obo/STATO_0000039	statistic		Kurtosis is a data item which denotes the degree of peakedness of a distribution. It is defined as a normalized form of the fourth central moment of a distribution.
http://purl.obolibrary.org/obo/STATO_0000179	ANCOVA	http://purl.obolibrary.org/obo/STATO_0000279	between group comparison statistical test		ANCOVA or analysis of covariance is a data transformation which evaluates if population means of a dependent variable are equal across levels of a categorical independent variables while controlling for the effects of other continuous variable s, known as covariates. Therefore, when performing ANCOVA, we are adjusting the dependent variable means to what they would be if all groups were equal on the covariates.
It augments the ANOVA model with one or more additional quantitative variables, called covariates, which are related to the response variable. The covariates are included to reduce the variance in the error terms and provide more precise measurement of the treatment effects. ANCOVA is used to test the main and interaction effects of the factors, while controlling for the effects of the covariate
http://purl.obolibrary.org/obo/STATO_0000181	Hardy-Weinberg equilibrium testing	http://purl.obolibrary.org/obo/STATO_0000191	goodness of fit statistical test		Hardy-Weinberg equilibrium test is a statistical test which aims to evaluate if a population's proportion of allele is stable or not. It is used as means of quality control to evaluate possibility of genotyping error or population structure.
http://purl.obolibrary.org/obo/STATO_0000183	sphericity testing objective	http://purl.obolibrary.org/obo/OBI_0000675	statistical hypothesis test objective		sphericity testing objective is a statistical objective of a data transformation which aims to test a null hypothesis of sphericity holds.
http://purl.obolibrary.org/obo/STATO_0000185	2 by n contingency table	http://purl.obolibrary.org/obo/STATO_0000140	contingency table		a 2 by n contingency table is a contingency table built for one dichotomous variable (a categorical variable with only 2 outcomes) and one polychotomous variable (a polychomotomous variable with at least 2 outcomes)
http://purl.obolibrary.org/obo/STATO_0000187	Tukey HSD for Post-Hoc Analysis	http://purl.obolibrary.org/obo/STATO_0000133	post-hoc analysis		Tukey Honestly Significant Difference (HSD) test is a statistical test used following an ANOVA test yielding a statistically significant p-value in order to determine which means are different, to a given level of significance. The Tukey HSD test relies on the q-distribution.
The procedure is conservative, meaning that if sample sizes (the sizes of different study groups) are equal, the risk of a Type I error is exactly α, and if sample sizes are unequal it’s less than α.
http://purl.obolibrary.org/obo/STATO_0000188	average log signal intensity	http://purl.obolibrary.org/obo/OBI_0000451	normalized data set		average log signal intensity is a data time which corresponds to the sum of 2 distinct logarithm base 2 transformed signal intensity, each corresponding to a distinct condition of signal acquisition, divided by 2.
http://purl.obolibrary.org/obo/STATO_0000189	mixed effect model	http://purl.obolibrary.org/obo/STATO_0000107	statistical model		A mixed model is a statistical model containing both fixed effects and random effects. These models are useful in a wide variety of disciplines in the physical, biological and social sciences. They are particularly useful in settings where repeated measurements are made on the same statistical units (longitudinal study), or where measurements are made on clusters of related statistical units. Because of their advantage in dealing with missing values, mixed effects models are often preferred over more traditional approaches such as repeated measures ANOVA.
http://purl.obolibrary.org/obo/STATO_0000191	goodness of fit statistical test	http://purl.obolibrary.org/obo/OBI_0000673	statistical hypothesis test		a goodness of fit statistical test is a statistical test which aim to evaluate if a sample distribution can be considered equivalent to a theoretical distribution used as input
http://purl.obolibrary.org/obo/STATO_0000192	cartesian product	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		a cartesian product is a data transformation which operates on a n Sets to produce a set of all possible ordered n-tuples where each element of the tuple comes from a Set
http://purl.obolibrary.org/obo/STATO_0000193	study group population	http://purl.obolibrary.org/obo/OBI_0000181	population		is a population whose individual members realize (may be expressed as) a combination of  inclusion rule values specifications or resulting from a sampling process (e.g. recruitment followed by randomization to group) on which a number of measurements will be carried out, which may be used as input to statistical tests and statistical inference.
http://purl.obolibrary.org/obo/STATO_0000194	cartesian product 2 sets	http://purl.obolibrary.org/obo/STATO_0000192	cartesian product		self explanatory
http://purl.obolibrary.org/obo/STATO_0000195	number of factor level combinations	http://purl.obolibrary.org/obo/STATO_0000047	count		A non-negative integer defining how many combination of factor levels (or treatments in the statistical sense) are to be used in a study.
http://purl.obolibrary.org/obo/STATO_0000196	confidence interval	http://purl.obolibrary.org/obo/STATO_0000028	measure of dispersion		A confidence interval is a data item which defines an range of values in which a measurement or trial falls corresponding to a given probability.
http://purl.obolibrary.org/obo/STATO_0000198	non-parametric test	http://purl.obolibrary.org/obo/STATO_0000279	between group comparison statistical test		a statistical test which makes no assumption about the underlying data distribution
http://purl.obolibrary.org/obo/STATO_0000199	Mauchly's test for sphericity	http://purl.obolibrary.org/obo/STATO_0000131	sphericity test		the Mauchly's test for sphericity is a statistical test which evaluates if the variance of the differences between all combinations of the groups are equal, a property known as 'sphericity' in the context of repeated measures. It is used for instance prior to repeated measure ANOVA.
The test works by assessing if a Wishart-distributed covariance matrix (or transformation thereof) is proportional to a given matrix.
http://purl.obolibrary.org/obo/STATO_0000200	statistical test power	http://purl.obolibrary.org/obo/IAO_0000027	data item		the statistical test power is data item which is about a statistical test and is obtained by subtracting the false negative rate (type II error rate) to 1. The power of a statistical test is the probability that it will correctly lead to the rejection of a false null hypothesis (Greene 2000). The statistical power is the ability of a test to detect an effect, if the effect actually exists (High 2000).
http://purl.obolibrary.org/obo/STATO_0000202	within subject comparison statistical test	http://purl.obolibrary.org/obo/OBI_0000673	statistical hypothesis test		within subject comparison statistical test is a kind of statistical test which evaluates if a change occurs within one experimental unit over time following a treatment or an event
http://purl.obolibrary.org/obo/STATO_0000203	cohort	http://purl.obolibrary.org/obo/STATO_0000193	study group population		a cohort is a study group population where the members are human beings which meet inclusion criteria and undergo a longitudinal design
http://purl.obolibrary.org/obo/STATO_0000207	specifying null and alternate hypothesis	http://purl.obolibrary.org/obo/OBI_0000011	planned process		a planned process which etablishes and states the different  hypothesis to be evaluated during a null hypothesis statistical test
http://purl.obolibrary.org/obo/STATO_0000208	alternative hypothesis	http://purl.obolibrary.org/obo/STATO_0000065	hypothesis		An alternative hypothesis is an hypothesis defined in a statistical test that is the opposite of the null hypothesis.
http://purl.obolibrary.org/obo/STATO_0000211	odds ratio homogeneity hypothesis	http://purl.obolibrary.org/obo/STATO_0000065	hypothesis		odds ratio homogeneity hypothesis is a null hypothesis stating that all odds ratio are homogenous, that is remain within the same range.
http://purl.obolibrary.org/obo/STATO_0000212	tetrachoric correlation coefficient	http://purl.obolibrary.org/obo/STATO_0000269	polychoric correlation coefficient		a tetrachoric correlation coefficient is a polychoric correlation coefficient for 2 dichotomous variables used as proxy for correlation between 2 continuous latent variables.
http://purl.obolibrary.org/obo/STATO_0000213	continuous variable discretization	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		discretization as a processing converting a continuous variable into a polychotomous variable by concretizing a set of discretization rules
http://purl.obolibrary.org/obo/STATO_0000215	ordered probit regression for analysis of ordinal dependent variable	http://purl.obolibrary.org/obo/OBI_0200102	regression analysis method		probit regression model is a model which attempts to explain data distribution associated with *ordinal* response/dependent variable in terms of values assumed by the independent variable uses a function of predictor/independent variable(s): the function used in this instance of regression modeling is the ordered probit function.
http://purl.obolibrary.org/obo/STATO_0000216	stratum population	http://purl.obolibrary.org/obo/OBI_0000181	population		a stratum population is a population resulting from a population stratification prior to sampling process which aims to produce homogenous subpopulations from an heterogeneous population by applying one or more stratification criteria
http://purl.obolibrary.org/obo/STATO_0000217	sphericity hypothesis	http://purl.obolibrary.org/obo/STATO_0000065	hypothesis		a null hypothesis which states that a given matrix is proportional to a Wishart-distributed covariance matrix
http://purl.obolibrary.org/obo/STATO_0000218	model fitting	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		Model fitting is a data transformation process which evaluates if a model appropriately represents a dataset. A model fitting process tests the goodness of fit of the model to the data
http://purl.obolibrary.org/obo/STATO_0000223	group assignment based on blocking variable specification	http://purl.obolibrary.org/obo/OBI_0600015	group assignment		group assignment based on blocking variable specification is a kind of group assignment process which takes into account the levels assumed by a blocking variable to allocate subjects or experimental units to a treatment group
http://purl.obolibrary.org/obo/STATO_0000224	goodness of fit testing objective	http://purl.obolibrary.org/obo/OBI_0000675	statistical hypothesis test objective		A testing objective to ensure that the sample used in a statistical test actually follows a normal distribution.
http://purl.obolibrary.org/obo/STATO_0000225	probability distribution	http://purl.obolibrary.org/obo/IAO_0000030	information content entity		A probability distribution is a information content entity that specifies the probability of the value of a random variable.

For a discrete random variable, a mathematical formula that gives the probability of each value of the variable.

For a continuous random variable, a curve described by a mathematical formula which specifies, by way of areas under the curve, the probability that the variable falls within a particular interval.
http://purl.obolibrary.org/obo/STATO_0000229	Chi-square probability distribution	http://purl.obolibrary.org/obo/STATO_0000067	continuous probability distribution		Chi-square probability distribution with k degrees of freedom is a theoretical probability distribution which  corresponds to the distribution of a sum of the squares of k independent standard normal random variables.
http://purl.obolibrary.org/obo/STATO_0000234	residual	http://purl.obolibrary.org/obo/STATO_0000039	statistic		a residual is a data item which is the output of an error estimate or model fitting process and which is an observable estimate of the unobservable error
http://purl.obolibrary.org/obo/STATO_0000235	genetic association study	http://purl.obolibrary.org/obo/OBI_0000066	investigation		A genetic association study is a kind of study whose objective is to detect associations between phenotypes, between a phenotype and a genetic polymorphism or between two genetic polymorphisms.
http://purl.obolibrary.org/obo/STATO_0000236	coefficient of variation	http://purl.obolibrary.org/obo/STATO_0000028	measure of dispersion		the coefficient of variation is a normalized measure of dispersion of a probability distribution of frequency distribution.
http://purl.obolibrary.org/obo/STATO_0000238	high-content screening	http://purl.obolibrary.org/obo/STATO_0000239	high throughput screening		high content screening is a kind of investigation which uses a standardized cellular assays to test the effect of substances (RNAi or small molecules) held in libraries on a cellular phenotype. it relies on microscopy imaging and or flow-cytometry, robotic handling to ensure fast and high-throughput.
http://purl.obolibrary.org/obo/STATO_0000239	high throughput screening	http://purl.obolibrary.org/obo/OBI_0000066	investigation		high throughput screening is a kind of investigation which uses a standardized  assays (cell based, enzymatic or chemometric) to test the effect of substances (RNAi or small molecules) held in libraries on a very specific and measureable outcome (e.g fluorence intensity). it relies on robotic handling to ensure fast and high-throughput in assay performance, data acquisition and hit selection.
http://purl.obolibrary.org/obo/STATO_0000242	statistical error	http://purl.obolibrary.org/obo/STATO_0000039	statistic		statistical error is an data item denoting the amount by which an observation differs from the expected value, being based on the whole statistical population from which the statistical unit was chosen randomly
http://purl.obolibrary.org/obo/STATO_0000244	ΔRn	http://purl.obolibrary.org/obo/IAO_0000027	data item		(Rn +) − (Rn −), where Rn + = (emission intensity of reporter dye)/(emission intensity of passive reference dye) in PCR with template and Rn − = (emission intensity of reporter dye)/(emission intensity of passive reference dye) in PCR without template or early cycles of a real-time reaction. Ct = threshold cycle, i.e., cycle at which a statistically significant increase in ΔRn is first detected
http://purl.obolibrary.org/obo/STATO_0000246	Woolf's test	http://purl.obolibrary.org/obo/STATO_0000697	homogeneity test		Woolf's test is a statistical test which evaluates the null hypothesis that odds ratio are the same accross all strata of population under investigation
http://purl.obolibrary.org/obo/STATO_0000247	odds ratio homogeneity test	http://purl.obolibrary.org/obo/OBI_0000673	statistical hypothesis test		odds ratio homogeneity test is a statistical test which aims to evaluate that null the hypothesis of consistency odds ratio accross different strata of population is true or not
http://purl.obolibrary.org/obo/STATO_0000249	DNA microarray hybridization	http://purl.obolibrary.org/obo/OBI_0000070	assay		a DNA microarray hybridization is an assay relying on nucleic acid hybridization , which uses a DNA microarray device and a nucleic acid as input. It precedes a data acquisition process
http://purl.obolibrary.org/obo/STATO_0000250	group comparison objective	http://purl.obolibrary.org/obo/OBI_0200166	data transformation objective		group comparison objective is a data transformation objective which aims to determine if 2 or more study group differ with respect to the signal of a response variable
http://purl.obolibrary.org/obo/STATO_0000253	within subject comparison objective	http://purl.obolibrary.org/obo/OBI_0000675	statistical hypothesis test objective		the objective of a data transformation to test a null hypothesis of absence of difference within subject holds.
http://purl.obolibrary.org/obo/STATO_0000255	between group comparison objective	http://purl.obolibrary.org/obo/OBI_0000675	statistical hypothesis test objective		the objective of a data transformation to test a null hypothesis of absence of difference withing subject holds.
http://purl.obolibrary.org/obo/STATO_0000257	family	http://purl.obolibrary.org/obo/OBI_0000181	population		A domestic group, or a number of domestic groups linked through descent (demonstrated or stipulated) from a common ancestor, marriage, or adoption.
http://purl.obolibrary.org/obo/STATO_0000261	Newman-Keuls test post-hoc analysis	http://purl.obolibrary.org/obo/STATO_0000133	post-hoc analysis		The Newman–Keuls or Student–Newman–Keuls (SNK) method is a stepwise multiple comparisons procedure used to identify sample means that are significantly different from each other. It was named after Student (1927), D. Newman, and M. Keuls. This procedure is often used as a post-hoc test whenever a significant difference between three or more sample means has been revealed by an analysis of variance (ANOVA). The Newman–Keuls method is similar to Tukey's range test as both procedures use Studentized range statistics.Compared to Tukey's range test, the Newman–Keuls method is more powerful but less conservative.
http://purl.obolibrary.org/obo/STATO_0000266	Bayes factor	http://purl.obolibrary.org/obo/STATO_0000184	ratio		Bayes factor is a ratio between 2 probabilities of observing data according 2 distinct models. It is used in Bayes model selection to evaluate which model best explains the data. if K<0, the model used in the denominator term is supported, if K>1, the model used in the numerator term is supported.
The Bayes factor is about the plausibility of 2 different models
http://purl.obolibrary.org/obo/STATO_0000268	Gamma distribution	http://purl.obolibrary.org/obo/STATO_0000067	continuous probability distribution		A gamma distribution is a general type of continous statistical distribution (related to the beta distribution) that arises naturally in processes for which the waiting times between Poisson distributed events are relevant. Gamma distributions have two free parameters shape k and scale denoted theta .
http://purl.obolibrary.org/obo/STATO_0000269	polychoric correlation coefficient	http://purl.obolibrary.org/obo/STATO_0000142	correlation coefficient		polychoric correlation coefficient is a correlation coefficient which is computed over 2 variables to characterise an association by proxy with 2  (latent) variables which are assumed to be continuous and normally distributed.
http://purl.obolibrary.org/obo/STATO_0000270	full factorial design	http://purl.obolibrary.org/obo/OBI_0500014	factorial design		a full factorial design is a factorial design which ensures that all possible factor level combinations are defined and used so all between group differences can be explored
http://purl.obolibrary.org/obo/STATO_0000271	permutation numbering	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		permutation numbering is a data tranformation allowing to count the number of possible permutations of elements in a set of size n, each element occurring exactly once. This number is factorial n.
http://purl.obolibrary.org/obo/STATO_0000272	Michaelis-Menten constant	http://purl.obolibrary.org/obo/IAO_0000032	scalar measurement datum		The Michaelis constant  is the substrate concentration at which the reaction rate is at half-maximum, and is an inverse measure of the substrate's affinity for the enzyme—as a small  indicates high affinity, meaning that the rate will approach  more quickly.[5] The value of  is dependent on both the enzyme and the substrate, as well as conditions such as temperature and pH.
http://purl.obolibrary.org/obo/STATO_0000273	child-parents trio	http://purl.obolibrary.org/obo/STATO_0000257	family		A population of two parents and a child.
http://purl.obolibrary.org/obo/STATO_0000276	binomial distribution	http://purl.obolibrary.org/obo/STATO_0000117	discrete probability distribution		The binomial distribution is a discrete probability distribution which describes the probability of k successes in n draws with replacement from a finite population of size N.

The binomial distribution is frequently used to model the number of successes in a sample of size n drawn with replacement from a population of size N.

The binomial distribution gives the discrete probability distribution  of obtaining exactly n successes out of N Bernoulli trials (where the result of each Bernoulli trial is true with probability p and false with probability q=1-p )

notation: B(n,p)

The mean is N*p

The variance is N*p*q
http://purl.obolibrary.org/obo/STATO_0000277	hit selection	http://purl.obolibrary.org/obo/OBI_0000011	planned process		hit selection is a planned process which in screening processes such as high-throughput screening, lead to the identification of perturbing agent which cause the typical signal generated by a standardized assay to significantly differ from the negative control. The selection hitself results from meeting or exceeding selection threshold (for instance 6 sigma from the mean or SSMD value beyond 5 when compared to positive controls or below -5 when compared to negative controls
http://purl.obolibrary.org/obo/STATO_0000278	pairing rule	http://purl.obolibrary.org/obo/IAO_0000055	rule		pairing rule is a rule which specifies the criteria for deciding on how to associated any 2 entities.
http://purl.obolibrary.org/obo/STATO_0000279	between group comparison statistical test	http://purl.obolibrary.org/obo/OBI_0000673	statistical hypothesis test		between group comparison statistical test is a statistical test which aims to detect difference  between the means computing for each of the study group populations
http://purl.obolibrary.org/obo/STATO_0000284	Breusch-Pagan test	http://purl.obolibrary.org/obo/STATO_0000137	homoskedasticity test		Breusch-Pagan test is a statistical test which computes a score test of the hypothesis of constant error variance against the alternative that the error variance changes with the level of the response (fitted values), or with a linear combination of predictors.
http://purl.obolibrary.org/obo/STATO_0000285	hypergeometric test	http://purl.obolibrary.org/obo/STATO_0000287	two tailed test		hypergeometric test is a null hypothesis test which evaluates if a random variable follows a hypergeometric distribution. It is a test of goodness of fit to that distribution. The test is suited for situation aimed at assessing cases of sampling from a finite set without replacements. For instance, testing for enrichment or depletion of elements (e.g GO categories, genes)
http://purl.obolibrary.org/obo/STATO_0000288	absence of difference hypothesis	http://purl.obolibrary.org/obo/STATO_0000065	hypothesis		A null hypothesis which states that no difference exists between 2 or more groups being considered.
http://purl.obolibrary.org/obo/STATO_0000290	contrast	http://purl.obolibrary.org/obo/IAO_0000027	data item		A contrast is the weighted sum of group means, the c_j coefficients represent the assigned weights of the means (these must sum to 0 for orthogonal contrasts)
http://purl.obolibrary.org/obo/STATO_0000291	quantile	http://purl.obolibrary.org/obo/STATO_0000039	statistic		a quantile is a data item which corresponds to specific elements x in the range of a variate X.
the k-th n-tile P_k is that value of x, say x_k, which corresponds to a cumulative frequency of Nk/n (Kenney and Keeping 1962). If n=4, the quantity is called a quartile, and if n=100, it is called a percentile.
http://purl.obolibrary.org/obo/STATO_0000292	decile	http://purl.obolibrary.org/obo/STATO_0000291	quantile		a decile is a quantile where n=10  and which splits data into sections accrued of 10% of data, so the first decile delineates 10% of the data, the second decile delineates 20% of the data and the nineth decile, 90 % of the data
http://purl.obolibrary.org/obo/STATO_0000293	percentile	http://purl.obolibrary.org/obo/STATO_0000291	quantile		a percentile is a quantile which splits data into sections accrued of 1% of data, so the first percentile delineates 1% of the data, the second quartile delineates 2% of the data and the 99th percentile, 99 % of the data
http://purl.obolibrary.org/obo/STATO_0000294	absence of negative difference hypothesis	http://purl.obolibrary.org/obo/STATO_0000065	hypothesis		absence of negative difference hypothesis is a hypothesis which assumes that a difference  significantly less than a threshold does not exist.
http://purl.obolibrary.org/obo/STATO_0000295	absence of positive difference hypothesis	http://purl.obolibrary.org/obo/STATO_0000065	hypothesis		absence of negative difference hypothesis is a hypothesis which assumes that a difference significantly greater than a threshold does not exist.
http://purl.obolibrary.org/obo/STATO_0000298	exact binomial test	http://purl.obolibrary.org/obo/STATO_0000191	goodness of fit statistical test		a binomial test is a statistical hypothesis test which evaluates if the observations made about a Bernoulli experiment , that is an experiment which tests the statistical significance of deviations from a theoretically expected distribution (the binomial distribution) of observations into 2 categories. It is a goodness of fit test.
http://purl.obolibrary.org/obo/STATO_0000299	statistical inference	http://purl.obolibrary.org/obo/OBI_0001834	drawing a conclusion		Statistical inference is the process of deducing properties of an underlying probability distribution by analysis of data.
http://purl.obolibrary.org/obo/STATO_0000301	covariance	http://purl.obolibrary.org/obo/STATO_0000611	measure of correlation		The covariance is a measurement data item about the strength of correlation between a set (2 or more) of random variables.
The covariance is obtained by forming:
cov(X,Y)=E([X-E(X)][Y-E(Y)] where E(X), E(Y) is the expected value (mean) of variable X and Y respectively.

covariance is symmetric so cov(X,Y)=cov(Y,X).

The covariance is usefull when looking at the variance of the sum of the 2 random variables since:
var(X+Y) = var(X) +var(Y) +2cov(X,Y)

The covariance cov(x,y) is used to obtain the coefficient of correlation cor(x,y) by normalizing (dividing) cov(x,y) but the product of the standard deviations of x and y.
http://purl.obolibrary.org/obo/STATO_0000302	one sample t-test	http://purl.obolibrary.org/obo/STATO_0000279	between group comparison statistical test		one sample t-test is a kind of Student's t-test which evaluates if a given sample can be reasonably assumed to be taken from the population.
The test compares the sample statistic (m) to the population parameter (M).

The one sample t-test is the small sample analog of the z test, which is suitable for large samples.
http://purl.obolibrary.org/obo/STATO_0000305	Helmert contrast	http://purl.obolibrary.org/obo/STATO_0000290	contrast		A Helmert contrast is a contrast in which the coefficients for the Helmert regressors compare each level with the average of the “preceding” ones
http://purl.obolibrary.org/obo/STATO_0000306	polynomial contrast	http://purl.obolibrary.org/obo/STATO_0000290	contrast		a polynomial contrast is a contrast which...
http://purl.obolibrary.org/obo/STATO_0000307	treatment contrast	http://purl.obolibrary.org/obo/STATO_0000290	contrast		treatment contrast is a contrast which allows to test how linear model coefficients of categorical variables are interpreted in case where  the “first” level (aka, the baseline) is included into the intercept and all subsequent levels have a coefficient that represents their difference from the baseline.
http://purl.obolibrary.org/obo/STATO_0000308	sum contrast	http://purl.obolibrary.org/obo/STATO_0000290	contrast		the sum contrast is a contrast in which each coefficient compares the corresponding level of the factor to the average of the other levels
http://purl.obolibrary.org/obo/STATO_0000310	Barnard's test	http://purl.obolibrary.org/obo/STATO_0000027	test of association between categorical variables		Barnard's test is an exact statistical test used to determine if there are non-random associations between two categorical variables. It was developed in 1949 by Barnard and is a test which is, most times, more powerful that the Fisher exact test
http://purl.obolibrary.org/obo/STATO_0000311	central composite design	http://purl.obolibrary.org/obo/OBI_0500000	study design		a central composite design is a study design which contains an imbedded factorial or fractional factorial design with center points that is augmented with a group of so-called 'star points' that allow estimation of curvature.
A CCD design with k factors has 2k star points.
http://purl.obolibrary.org/obo/STATO_0000312	Box–Behnkens design	http://purl.obolibrary.org/obo/OBI_0500000	study design		The Box-Behnken design is an independent quadratic design in that it does not contain an embedded factorial or fractional factorial design. In this design the treatment combinations are at the midpoints of edges of the process space and at the center. These designs are rotatable (or near rotatable) and require 3 levels of each factor. The designs have limited capability for orthogonal blocking compared to the central composite designs.
http://purl.obolibrary.org/obo/STATO_0000313	Plackett-Burman design	http://purl.obolibrary.org/obo/OBI_0500000	study design		Plackett-Burman design is a type of study design optimizing multifactorial experiments characterized by their parsimony  and economy  with the run number a multiple of 4 (rather than a power of 2). 
Plackett-Burman design is often used for screening experiments where the main effect is often heavily confounded with two-factor interactions.
This type of design is very useful for economically detecting large main effects, assuming all interactions are negligible when compared with the few important main effects.
http://purl.obolibrary.org/obo/STATO_0000314	upper confidence limit	http://purl.obolibrary.org/obo/STATO_0000633	cutoff		upper confidence limit is a data item which is a largest value bounding a confidence interval
http://purl.obolibrary.org/obo/STATO_0000315	lower confidence limit	http://purl.obolibrary.org/obo/STATO_0000633	cutoff		lower confidence limit is a data item which is a lowest value bounding a confidence interval
http://purl.obolibrary.org/obo/STATO_0000317	eta-squared	http://purl.obolibrary.org/obo/STATO_0000471	estimate		Eta-squared is a biased estimator of the variance explained by the model in the population (it estimates only the effect size in the sample). Eta-squared describes the ratio of variance explained in the dependent variable by a predictor while controlling for other predictors, making it analogous to the r2. This estimate shares the weakness with r2 that each additional variable will automatically increase the value of η2. In addition, it measures the variance explained of the sample, not the population, meaning that it will always overestimate the effect size, although the bias grows smaller as the sample grows larger.
http://purl.obolibrary.org/obo/STATO_0000318	omega-squared	http://purl.obolibrary.org/obo/STATO_0000471	estimate		omega-squared is a effect size estimate for variance explained which is less biased than the eta-squared coefficient.
http://purl.obolibrary.org/obo/STATO_0000319	Hedges's g	http://purl.obolibrary.org/obo/STATO_0000100	standardized mean difference		Hedges's g is an estimator of effect size, which is similar to Cohen's d and is a measure based on a standardized difference. However, the denominator, corresponding to a pooled standard deviation, is computed differently from Cohen's d coefficient, by applying a correction factor (which involves a Gamma function).
http://purl.obolibrary.org/obo/STATO_0000320	Glass's delta	http://purl.obolibrary.org/obo/STATO_0000100	standardized mean difference		Glass's delta is an estimator of effect size which is similar to Cohen's d but where the denominator corresponds only to the standard deviation of the control group (or second group). It is considered less biais than the Cohen's d for estimating effect sizes based on means and distances between means.
http://purl.obolibrary.org/obo/STATO_0000321	non-parametric distribution	http://purl.obolibrary.org/obo/STATO_0000329	symmetric distribution		Probability distribution estimated empirically on the data without assumptions on the shape of the probability distribution.
http://purl.obolibrary.org/obo/STATO_0000322	contrast weight	http://purl.obolibrary.org/obo/IAO_0000027	data item		a contrast weight is a coefficient which multiplies a group mean, part of a linear combinaison defining a constrast as a weighted sum of group means, giving a 'weight' to a specific group mean hence the name.
http://purl.obolibrary.org/obo/STATO_0000323	contrast weight matrix	http://purl.obolibrary.org/obo/STATO_0000642	matrix		a contrast weight matrix is a information content entity which holds a set of contrast weight, coefficient used in a weighting sum of means defining a contrast
http://purl.obolibrary.org/obo/STATO_0000324	contrast weight estimate	http://purl.obolibrary.org/obo/STATO_0000144	model parameter estimate		contrast weight estimate is a model parameter estimate which results from the computation from the data and that is used as input to a model fitting process
http://purl.obolibrary.org/obo/STATO_0000325	Akaike information criterion	http://purl.obolibrary.org/obo/IAO_0000109	measurement datum		The Akaike information criterion (AIC) is a measure of the relative quality of a statistical model for a given set of data. As such, AIC provides a means for model selection. AIC is defined as:
 
AIC = 2K - 2log(L)
where K is the number of predictors and L is the maximized likelihood value.

AIC deals with the trade-off between the goodness of fit of the model and the complexity of the model. It is founded on information theory: it offers a relative estimate of the information lost when a given model is used to represent the process that generates the data. AIC does not provide a test of a model in the sense of testing a null hypothesis; i.e. AIC can tell nothing about the quality of the model in an absolute sense. If all the candidate models fit poorly, AIC will not give any warning of that.
http://purl.obolibrary.org/obo/STATO_0000326	corrected Akaike information criterion	http://purl.obolibrary.org/obo/STATO_0000325	Akaike information criterion		corrected Akaike information criteria is a modified version of the Akaike information criterion.
http://purl.obolibrary.org/obo/STATO_0000327	Bayesian information criterion	http://purl.obolibrary.org/obo/IAO_0000109	measurement datum		Bayesian information criterion or Schwartz's Bayesian information criterion is a criterion for model selection among a finite set of models. It is based, in part, on the likelihood function and it is closely related to the Akaike information criterion (AIC).
Given any two estimated models, the model with the lower value of BIC is the one to be preferred. The BIC is an increasing function of sigma_e^2 and an increasing function of k. That is, unexplained variation in the dependent variable and the number of explanatory variables increase the value of BIC. Hence, lower BIC implies either fewer explanatory variables, better fit, or both.
http://purl.obolibrary.org/obo/STATO_0000328	statistical model selection	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		A statistical model selection is a data transformation which is based on computing a relative quality value in order to evaluate and select which model best explains data.
http://purl.obolibrary.org/obo/STATO_0000329	symmetric distribution	http://purl.obolibrary.org/obo/STATO_0000225	probability distribution		Probability distribution which has no skew so its skewness=0
http://purl.obolibrary.org/obo/STATO_0000330	empirical distribution	http://purl.obolibrary.org/obo/STATO_0000225	probability distribution		Probability distribution estimated empirically from all acquired data
http://purl.obolibrary.org/obo/STATO_0000331	histogram distribution	http://purl.obolibrary.org/obo/STATO_0000225	probability distribution		Probability distribution estimated empirically on the data following a binning process
http://purl.obolibrary.org/obo/STATO_0000332	smooth kernel distribution	http://purl.obolibrary.org/obo/STATO_0000225	probability distribution		Probability distribution estimated using a smooth kernel function to avoid making assumptions about the distribution of the data. The kernel density estimator is the estimated probability density function (pdf) of the random variable.
http://purl.obolibrary.org/obo/STATO_0000333	kernel mixture distribution	http://purl.obolibrary.org/obo/STATO_0000225	probability distribution		kernel density estimation (KDE) is a non-parametric way to estimate the probability density function of a random variable
http://purl.obolibrary.org/obo/STATO_0000334	mixture distribution	http://purl.obolibrary.org/obo/STATO_0000225	probability distribution		Mixture distribution is the probability distribution of a random variable that is derived from a collection of other random variables as follows: first, a random variable is selected by chance from the collection according to given probabilities of selection, and then the value of the selected random variable is realized.
http://purl.obolibrary.org/obo/STATO_0000335	survival distribution	http://purl.obolibrary.org/obo/STATO_0000330	empirical distribution		Probability distribution estimated empirically from a censored lifetime data
http://purl.obolibrary.org/obo/STATO_0000346	covariance structure	http://purl.obolibrary.org/obo/STATO_0000107	statistical model		a covariance structure is a data item which is part of a regression model and which indicates a pattern in the covariance matrix. The nature of covariance structure is specified before the regression analysis and various covariance structure may be tested and evaluated using information criteria to help choose the most suiteable model
http://purl.obolibrary.org/obo/STATO_0000348	rational quadratic anisotropic covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure		The rational quadratic covariance function is used in spatial statistics, geostatistics, machine learning, image analysis, and other fields where multivariate statistical analysis is conducted on metric spaces. It is commonly used to define the statistical covariance between measurements made at two points that are d units distant from each other. Since the covariance only depends on distances between points, it is stationary. If the distance is Euclidean distance, the rational quadratic covariance function is also isotropic.
http://purl.obolibrary.org/obo/STATO_0000350	spatial spherical geometric anisotropic covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure		spatial spherical geometric anisotropic covariance structure is a type of covariance structure characterized by its anisotropy, i.e., the variation of properties can be different in directions x and y, which is this case give spherical features.
http://purl.obolibrary.org/obo/STATO_0000354	banded heterogeneous Toeplitz covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure		the banded heterogeneous Toeplitz covariance structure is a type of coviance structure which is often used to  analyzed and intepret repeated measure design.
http://purl.obolibrary.org/obo/STATO_0000356	banded Toeplitz covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure		A banded Toeplitz structure, defined by parameter q, can be viewed as a moving-average structure with order q-1.
http://purl.obolibrary.org/obo/STATO_0000357	Toeplitz covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure		The Toeplitz covariance structure has homogenous variances and heterogenous correlations between elements. The correlation between adjacent elements is homogenous across pairs of adjacent elements. The correlation between elements separated by a third is again homogenous, and so on.
http://purl.obolibrary.org/obo/STATO_0000359	equal diagonal Factor Analytic covariance structure	http://purl.obolibrary.org/obo/STATO_0000361	Factor Analytic covariance structure		factor-analytic structure is a covariance structure which is specified for q factors 
equal diagonal factor-analytic covariance structure is a type of factor analytic covariance structure specified for q factors, which includes a diagonal component for repeated measures.
http://purl.obolibrary.org/obo/STATO_0000360	no diagonal Factor Analytic covariance structure	http://purl.obolibrary.org/obo/STATO_0000361	Factor Analytic covariance structure		no diagonal factor-analytic covariance structure is a type of factor analytic covariance structure specified for q factors, which does not include a diagonal component for repeated measures.
http://purl.obolibrary.org/obo/STATO_0000361	Factor Analytic covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure		factor-analytic structure is a type of heterogeneous covariance structure which is specified for q factors
http://purl.obolibrary.org/obo/STATO_0000362	compound symmetry covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure		compound symmetry covariance structure is a covariance structure which means that all the variances are equal and all the covariances are equal.
http://purl.obolibrary.org/obo/STATO_0000363	heterogeneous compound symmetry covariance structure	http://purl.obolibrary.org/obo/STATO_0000362	compound symmetry covariance structure		heterogenous compound symmetry structure is a compound symmetry covariance structure which has a different variance parameter for each diagonal element, and it uses the square roots of these parameters in the off-diagonal entries.
http://purl.obolibrary.org/obo/STATO_0000365	first order autoregressive covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure		first order autoregressive covariance structure is a covariance structure where correlations among errors decline exponentially with distance
http://purl.obolibrary.org/obo/STATO_0000366	heterogeneous first-order autoregressive covariance structure	http://purl.obolibrary.org/obo/STATO_0000365	first order autoregressive covariance structure		This is an homogeneous structure, i.e. the variance along the main diagonal is constant. The covariances decline exponentially. It has only 2 parameters.
http://purl.obolibrary.org/obo/STATO_0000367	Ante-dependence covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure		Ante-dependence covariance structure is a covariance structure which specifies that the covariance between two time points is a function of the product of variances at both points (hence allowing hetrogenity of error variance across measures to affect the correlation) and the product of the correlations at the distances up to the one chosen.
http://purl.obolibrary.org/obo/STATO_0000368	Mallows' Cp	http://purl.obolibrary.org/obo/STATO_0000039	statistic		Mallows' Cp is a data item which compares the precision and bias of the full model to models with a subset of the predictors thus helping to choose between multiple regression models.

the mallows cp is a function of the number of parameter used in the model relying on the residuals sum of squares to compute a score.
the smaller Cp is, the better the model fit is.
http://purl.obolibrary.org/obo/STATO_0000369	repeated measure analysis	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		repeated measure analysis is a kind of data transformation which deals with signals measured in the same experimental units at different times and, possibly, under different conditions over a period of time. Data produced by longitudinal studies qualify for such analysis. Since measurements are made on the same experimental units a number of times, they are likely to be correlated. Repeated measure analysis usually takes into consideration the possibility of correlation with time. It does so by specifying covariance structure in the analysis
http://purl.obolibrary.org/obo/STATO_0000370	ordinary least squares estimation	http://purl.obolibrary.org/obo/STATO_0000119	model parameter estimation		the ordinary least squares estimation is a model parameter estimation for a linear regression model when the errors are uncorrelated and equal in variance. Is the Best Linear Unbiased Estimation (BLUE) method under these assumptions, Uniformly Minimum-Variance Unbiased Estimator (UMVUE) with addition of a Gaussian assumption.
http://purl.obolibrary.org/obo/STATO_0000371	weighted least squares estimation	http://purl.obolibrary.org/obo/STATO_0000119	model parameter estimation		the weighted least squares estimation is a model parameter estimation for a linear regression model with errors that independent but have heterogeneous variance. Difficult to use use in practice, as weights must be set based on the variance which is usually unknown. If true variance is known, it is the Best Linear Unbiased Estimation (BLUE) method under these assumptions, Uniformly Minimum-Variance Unbiased Estimator (UMVUE) with addition of a Gaussian assumption.
http://purl.obolibrary.org/obo/STATO_0000372	generalized least squares estimation	http://purl.obolibrary.org/obo/STATO_0000119	model parameter estimation		the generalized least squares estimation is a model parameter estimation for a linear regression model with errors that are dependent and (possibly) have heterogeneous variance. Difficult to use use in practice, as covariance matrix of the errors must known to "whiten" data and model. If true covariance is known, it is the Best Linear Unbiased Estimation (BLUE) method under these assumptions, Uniformly Minimum-Variance Unbiased Estimator (UMVUE) with addition of a Gaussian assumption.
http://purl.obolibrary.org/obo/STATO_0000373	iteratively reweighted least squares estimation	http://purl.obolibrary.org/obo/STATO_0000119	model parameter estimation		the iteratively reweighted least squares estimation is a model parameter estimation which is a practical implementation of Weighted Least Squares, where the heterogeneous variances of the errors are estimated from the residuals of the regression model, providing an estimate for the weights. Each successive estimate of the weights improves the estimation of the regression parameters, which in turn are used to compute residuals and update the weights
http://purl.obolibrary.org/obo/STATO_0000374	feasible generalized least squares estimation	http://purl.obolibrary.org/obo/STATO_0000372	generalized least squares estimation		the feasible generalized least squares estimation is a model parameter estimation which is a practical implementation of Generalised Least Squares, where the covariance of the errors is estimated from the residuals of the regression model, providing the information needed to whiten the data and model. Each successive estimate of the whitening matrix improves the estimation of the regression parameters, which in turn are used to compute residuals and update the whitening matrix.
http://purl.obolibrary.org/obo/STATO_0000375	residual mean square	http://purl.obolibrary.org/obo/STATO_0000039	statistic		a residual mean square is a data item which is obtained by dividing the sum of squared residuals (SSR) by the number of degrees of freedom
http://purl.obolibrary.org/obo/STATO_0000376	Z-statistic	http://purl.obolibrary.org/obo/STATO_0000039	statistic		Z-statistic is a statistic computed from observations and used to produce a p-value when compared to a Standard Normal Distribution in a statistical test called the Z-test.
http://purl.obolibrary.org/obo/STATO_0000377	deviance	http://purl.obolibrary.org/obo/STATO_0000039	statistic		Deviance is an indicator of fit and can be estimated by computing  -2 times the log-likelihood ratio of the fitted model compared to a saturated(full) model.
It is a generalization of the idea of using the sum of squares of residuals in ordinary least squares to cases where model-fitting is achieved by maximum likelihood.
http://purl.obolibrary.org/obo/STATO_0000378	deviance information criterion	http://purl.obolibrary.org/obo/IAO_0000109	measurement datum		The deviance information criterion (DIC) is a hierarchical modeling generalization of the AIC (Akaike information criterion) and BIC (Bayesian information criterion, also known as the Schwarz criterion). It is particularly useful in Bayesian model selection problems where the posterior distributions of the models have been obtained by Markov chain Monte Carlo (MCMC) simulation. Like AIC and BIC it is an asymptotic approximation as the sample size becomes large. It is only valid when the posterior distribution is approximately multivariate normal.
The deviance information criterion was published in 2002 by Spiegelhalter et al.

Spiegelhalter, D. J., N. G. Best, B. P. Carlin, and A. van der Linde, 2002. Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society, B, 64, 583-639.
http://purl.obolibrary.org/obo/STATO_0000379	focused information criterion	http://purl.obolibrary.org/obo/IAO_0000109	measurement datum		The focused information criterion is a measurement data item which aims at facilitating model selection. It was published in 2003 by Claeskens, G. and Hjort, N.L. (2003). "The focused information criterion".
http://purl.obolibrary.org/obo/STATO_0000383	contrast estimation	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		a data transformation that finds a contrast value (the contrast estimate) by computing the weighted sum of model parameter estimates using a set of contrast weights.
http://purl.obolibrary.org/obo/STATO_0000384	contrast estimate	http://purl.obolibrary.org/obo/STATO_0000471	estimate		estimate of a contrast obtained by computing the weighted sum of model parameter estimates using a set of contrast weights.
http://purl.obolibrary.org/obo/STATO_0000385	standard error of a contrast estimate	http://purl.obolibrary.org/obo/STATO_0000562	standard error of estimate		an estimate of the standard deviation of a contrast estimate sampling distribution.
http://purl.obolibrary.org/obo/STATO_0000386	scree plot	http://purl.obolibrary.org/obo/IAO_0000038	graph		A scree plot is a graphical display of the variance of each component in the dataset which is used to determine how many components should be retained in order to explain a high percentage of the variation in the data
http://purl.obolibrary.org/obo/STATO_0000387	scatterplot matrix	http://purl.obolibrary.org/obo/IAO_0000038	graph		A scatterplot matrix contains all the pairwise scatter plots of a set of variables on a single page in a matrix format.
http://purl.obolibrary.org/obo/STATO_0000388	alpha distribution	http://purl.obolibrary.org/obo/STATO_0000067	continuous probability distribution		The alpha distribution is a continuous probability distribution whose density function is as defined at: https://docs.scipy.org/doc/scipy-1.0.0/reference/tutorial/stats/continuous_alpha.html
http://purl.obolibrary.org/obo/STATO_0000389	power law distribution	http://purl.obolibrary.org/obo/STATO_0000225	probability distribution		a power-law probability distribution is a probability distribution whose density function (or mass function in the discrete case) has the form

p(x) = L(x) . x^{-alpha}

where alpha is a parameter >1 and L(x) is a slowly varying function.
http://purl.obolibrary.org/obo/STATO_0000390	regression model	http://purl.obolibrary.org/obo/STATO_0000107	statistical model		A regression model is a statistical model used in a type of analysis knowns as regression analysis, whereby a function is used to determine the relation between a response variable and an independent variable , with a set of unknown parameters.
http://purl.obolibrary.org/obo/STATO_0000392	Pareto type-I probability distribution	http://purl.obolibrary.org/obo/STATO_0000067	continuous probability distribution		The Pareto distribution is a continuous probability distribution, which is defined by the follwoing probability density (1) function and distribution function (2)

(1): P(x)=(ab^a)/(x^(a+1))	

(2): D(x)=1-(b/x)^a	

defined over the interval x>=b.
http://purl.obolibrary.org/obo/STATO_0000393	Pareto type-II probability distribution	http://purl.obolibrary.org/obo/STATO_0000067	continuous probability distribution		the Pareto type-II probability distribution is a continuous probability distribution which is defined by a probability density function characterized by 2 parameters, alpha and lambda, 2 real, strictly positive numbers. alpha is known as the shape parameter while lambda is known as the scale parameter.

the function defines the probably of a continous random variable according to the following:

 p(x) = {\alpha \over \lambda} \left[{1+ {x \over \lambda}}\right]^{-(\alpha+1)}, \qquad x \geq 0,
http://purl.obolibrary.org/obo/STATO_0000394	Pareto type-III probability distribution	http://purl.obolibrary.org/obo/STATO_0000067	continuous probability distribution		The Pareto(III) distribution is a continous probability distribution which is described with a cumulative distribution function of the following form:

F(x) = 1 − [1 + ((x − mu)/sigma)1/gamma]−1
for x > mu, sigma > 0, gamma > 0 and s =1. 

a is the location parameter, 
b is the scale parameter, 
g is the inequality parameter
s is the shape parameter of value 1

The Pareto III distribution corresponds to a Pareto Type IV distribution where the shape parameter has a value of 1.
http://purl.obolibrary.org/obo/STATO_0000395	Pareto type-IV probability distribution	http://purl.obolibrary.org/obo/STATO_0000067	continuous probability distribution		The Pareto(IV) distribution is a continous probability distribution which is described with a cumulative distribution function of the following form:

F(y) = 1 − [1 + ((y − a)/b)1/g]−s
for y > a, b > 0, g > 0 and s > 0. 

a is the location parameter, 
b is the scale parameter, 
g is the inequality parameter
s is the shape parameter

The distribution is used in actuarial science, economics, finance and telecommunications, but not restricted to those fields.
http://purl.obolibrary.org/obo/STATO_0000396	geometric mean	http://purl.obolibrary.org/obo/STATO_0000029	measure of central tendency		The geometric mean is defined as the nth root of the product of n numbers, i.e., for a set of numbers \{x_i\}_{i=1}^N, the geometric mean is defined as \left(\prod_{i=1}^N x_i\right)^{1/N}.
http://purl.obolibrary.org/obo/STATO_0000397	harmonic mean	http://purl.obolibrary.org/obo/STATO_0000029	measure of central tendency		The harmonic mean is a kind of mean which is calculated by dividing the total number of observations by the reciprocal of each number in a series.

Harmonic Mean = N/(1/a1+1/a2+1/a3+1/a4+.......+1/aN)  
where a(i)= Individual score and  N = Sample size (Number of scores)
http://purl.obolibrary.org/obo/STATO_0000398	weighted arithmetic mean	http://purl.obolibrary.org/obo/STATO_0000029	measure of central tendency		The weighted arithmetic mean is a measure of central tendency that is the sum of the products of each observed value and their respective non-negative weights, divided by the sum of the weights, such that the contribution of each observed value to the mean may defer according to its respective weight. It is defined by the formula: A = sum(vi*wi)/sum(wi), where 'i' ranges from 1 to n, 'vi' is the value of each observation, and 'wi' is the value of the respective weight for each observed value.

The weighted arithmetic mean is a kind of mean similar to an ordinary arithmetic mean (the most common type of average), except that instead of each of the data points contributing equally to the final average, some data points are weighted, meaning they contribute more than others. 

The weighted arithmetic mean is often used if one wants to combine average values from samples of the same population with different sample sizes.
http://purl.obolibrary.org/obo/STATO_0000401	sample mean	http://purl.obolibrary.org/obo/STATO_0000573	average value		the sample mean of sample of size n with n observations is an arithmetic mean computed over n number of observations on a statistical sample.
The sample mean, denoted x¯ and read “x-bar,” is simply the average of the n data points x1, x2, ..., xn:

x¯=x1+x2+⋯+xnn=1n∑i=1nxi
The sample mean summarizes the "location" or "center" of the data.

the sample mean is a measure of dispersion of the observations made on the sample and provides an unbias estimate of the population mean
http://purl.obolibrary.org/obo/STATO_0000405	unstructured covariance structure	http://purl.obolibrary.org/obo/STATO_0000346	covariance structure		A covariance structure where no restrictions are made on the covariance between any pair of measurements.
http://purl.obolibrary.org/obo/STATO_0000472	degree of freedom approximation	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		An estimate of the number of degrees of freedom.
http://purl.obolibrary.org/obo/STATO_0000663	F-beta score	http://purl.obolibrary.org/obo/STATO_0000623	measure of agreement		The F-beta score is a measure of a model's accuracy, specifically designed to evaluate the performance of binary classification models. It is a generalized form of the F1 score, which itself is the harmonic mean of precision and recall. The F-beta score introduces a way to weigh recall more heavily than precision (or vice versa) depending on the value of beta.
http://purl.obolibrary.org/obo/STATO_0000683	simple chance-corrected agreement coefficient	http://purl.obolibrary.org/obo/STATO_0000682	kappa		A Kappa statistic in which the expected agreement by chance is based on an assumption that all possible categories for assignment are equally likely.
http://purl.obolibrary.org/obo/STATO_0000686	mean calibration	http://purl.obolibrary.org/obo/STATO_0000685	mesure of calibration		A measure of calibration that is the average of a function of the difference between the expected values and the observed values.
http://purl.obolibrary.org/obo/STATO_0000694	probability distribution standard deviation	http://purl.obolibrary.org/obo/STATO_0000237	standard deviation		A probability distribution attribute that is the square root of the distribution variance.
http://purl.obolibrary.org/obo/STATO_0000696	value-time curve	http://purl.obolibrary.org/obo/IAO_0000038	graph		a value-time curve is a graph which plot time on the x-axis versus the value of a variable of interest as delivered by a process. It is used to represent the relationship there is the value and the time it takes to achieve that value. It is commonly used in project management or business analysis to evaluate and optimize how efficiently value is delivered over time.
http://purl.obolibrary.org/obo/STATO_0000697	homogeneity test	http://purl.obolibrary.org/obo/OBI_0000673	statistical hypothesis test		an homogeneity test is a statistical test aiming at evaluate if the statisical measure from several random samples are similar
http://purl.obolibrary.org/obo/STATO_0000701	Chi-square test for homogeneity	http://purl.obolibrary.org/obo/STATO_0000697	homogeneity test		a statistical test which test for homogeneity of proportions, which is used when comparing proportions observed across multiple groups. It relies on frequencies calculated in contingency tables. It determines is proportions are consistent.
http://purl.obolibrary.org/obo/STATO_0000702	prior probability	http://purl.obolibrary.org/obo/STATO_0000039	statistic		A statistic that represents the likelihood of a parameter of interest, before accounting for the observed data from the study.
http://purl.obolibrary.org/obo/STATO_0000705	measurement accuracy as a percentage	http://purl.obolibrary.org/obo/STATO_0000639	percentage		Measurement accuracy refers to how close a measured value is to the true or accepted value of the quantity being measured.
It is a percentage in which the numerator represents the absolute value of one minus the difference between the true value and the observed value, and the denominator represents the true value.


Accuracy is often expressed as a percentage of the true value, based on the relative error:

Accuracy (%)= (1 −∣Measured Value−True Value∣/True Value)×100

True Value: The reference or accepted value.
Measured Value: The value obtained during the measurement.
http://purl.obolibrary.org/obo/STATO_0000708	Cochran-Armitage Chi-Square Statistic for Trend	http://purl.obolibrary.org/obo/STATO_0000030	Chi-Squared statistic		A chi-square statistic used to test whether a dichotomous variable and an ordinal variable are related.

The statistic is calculated using a weighted sum of the differences between observed and expected frequencies, where the weights are based on the scores assigned to the ordered categories. Mathematically, it's often expressed as:
Z = ∑(wi(Oi-Ei)/sqr( wi^2.Ei(1-Ei))
 
where wi  are the scores, Oi observed frequencies, and Ei expected frequencies.
http://purl.obolibrary.org/obo/stato.owl#http://purl.obolibrary.org/obo/STATO_0000698	Chi-squared statistic for homogeneity	http://purl.obolibrary.org/obo/STATO_0000030	Chi-Squared statistic		A Chi-squared statistic used to test whether observed data from two or more groups follow the same distribution.
http://purl.obolibrary.org/obo/STATO_0000482	trait-specific relationship matrix best linear unbiaised prediction	http://purl.obolibrary.org/obo/STATO_0000337	breeding value estimation		a data transformation which calculate estimates of genomic estimated breeding values (GEBVs) on an animal or plant model utilizing trait-specific marker information.
http://purl.obolibrary.org/obo/STATO_0000468	model fixed effect term	http://purl.obolibrary.org/obo/STATO_0000466	model term		a model fixed effect term is a model term which accounts for  variation explained by an independent variable and its levels.
http://purl.obolibrary.org/obo/STATO_0000478	pedigree data set	http://purl.obolibrary.org/obo/IAO_0000100	data set		a dataset which is made up of pedigree information, that is presenting ancestry or lineage information in a set of individuals of an organism.
http://purl.obolibrary.org/obo/STATO_0000620	risk	http://purl.obolibrary.org/obo/STATO_0000607	proportion		A proportion in which the numerator represents the cases in which an event or characteristic occurs and the denominator represents all possible cases.
http://purl.obolibrary.org/obo/GO_0003674	molecular_function	http://purl.obolibrary.org/obo/BFO_0000015	process		A molecular process that can be carried out by the action of a single macromolecular machine, usually via direct physical interactions with other molecular entities. Function in this sense denotes an action, or activity, that a gene product (or a complex) performs. These actions are described from two distinct but related perspectives: (1) biochemical activity, and (2) role as a component in a larger system/process.
http://purl.obolibrary.org/obo/GO_0003824	catalytic activity	http://purl.obolibrary.org/obo/GO_0003674	molecular_function		Catalysis of a biochemical reaction at physiological temperatures. In biologically catalyzed reactions, the reactants are known as substrates, and the catalysts are naturally occurring macromolecular substances known as enzymes. Enzymes possess specific binding sites for substrates, and are usually composed wholly or largely of protein, but RNA that has catalytic activity (ribozyme) is often also regarded as enzymatic.
http://purl.obolibrary.org/obo/GO_0003964	RNA-directed DNA polymerase activity	http://purl.obolibrary.org/obo/GO_0003824	catalytic activity		Catalysis of the reaction: deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1). Catalyzes RNA-template-directed extension of the 3'- end of a DNA strand by one deoxynucleotide at a time.
http://purl.obolibrary.org/obo/GO_0008150	biological_process	http://purl.obolibrary.org/obo/BFO_0000015	process		A biological process represents a specific objective that the organism is genetically programmed to achieve. Biological processes are often described by their outcome or ending state, e.g., the biological process of cell division results in the creation of two daughter cells (a divided cell) from a single parent cell. A biological process is accomplished by a particular set of molecular functions carried out by specific gene products (or macromolecular complexes), often in a highly regulated manner and in a particular temporal sequence.
http://purl.obolibrary.org/obo/GO_0010467	gene expression	http://purl.obolibrary.org/obo/GO_0008150	biological_process		The process in which a gene's sequence is converted into a mature gene product or products (proteins or RNA). This includes the production of an RNA transcript as well as any processing to produce a mature RNA product or an mRNA or circRNA (for protein-coding genes) and the translation of that mRNA or circRNA into protein. Protein maturation is included when required to form an active form of a product from an inactive precursor form.
http://purl.obolibrary.org/obo/GO_0043234	protein complex	http://purl.obolibrary.org/obo/BFO_0000040	material entity		A stable macromolecular complex composed (only) of two or more polypeptide subunits along with any covalently attached molecules (such as lipid anchors or oligosaccharide) or non-protein prosthetic groups (such as nucleotides or metal ions). Prosthetic group in this context refers to a tightly bound cofactor. The component polypeptide subunits may be identical.
http://purl.obolibrary.org/obo/STATO_0000615	relative difference	http://purl.obolibrary.org/obo/STATO_0000613	difference		A statistic that is a difference between 1 and a ratio of the two quantities being compared.

Relative Difference = 1 - ( a / b ).   Because 1 - ( a / b ) is not equal to 1 - ( b / a ), Relative Difference may be expressed as "Relative Difference with respect to b" when referring to 1 - ( a / b ).


The relative difference can also be defined as a statistic that is a ratio of the absolute difference (of the two quantities being compared) to the reference value (one of the quantities being compared). Relative Difference = ( b - a ) / ( b ) where b is the reference value and this may also be called "Relative Difference with respect to b".
http://purl.obolibrary.org/obo/STATO_0000483	Bayes A	http://purl.obolibrary.org/obo/STATO_0000337	breeding value estimation		Bayes A is a data transformation used in the context of estimating breeding value, which relies on a Bayesian model and treats the prior probability π that a SNP has zero effect as unknown (i.e π=0)
http://purl.obolibrary.org/obo/STATO_0000485	estimated breeding value	http://purl.obolibrary.org/obo/STATO_0000471	estimate		the estimated breeding value of an organism is a data item computed to estimate the true breeding value defined as genetic merit of an organism, half of which will be passed on to its progeny.  While the exact breeding value can not been known, for performance traits it is possible to make good estimates. These estimates are called Estimated Breeding Values (EBVs).  EBVs are expressed in the units of measurement for each particular trait. These estimates are output of various estimation methods which differ depending on the underlying assumptions (equal variance of marker effect, all markers contributing to the trait) , the mathemical methods used (bayesian or non-bayesians) and the genetic inheritance models being considered (additive, dominant, epistatic) selected by the analysts.
http://purl.obolibrary.org/obo/STATO_0000492	Bayesian least absolute shrinkage and selection operator	http://purl.obolibrary.org/obo/STATO_0000337	breeding value estimation		Bayesian LASSO is a data transformation where the regression parameters have independent Laplace (i.e., double-exponential) priors and are used to interprete Lasso estimate for linear regression parameters as Bayesian posterior mode estimates in accordance to a Bayesian framework.
http://purl.obolibrary.org/obo/STATO_0000693	probability distribution variance	http://purl.obolibrary.org/obo/STATO_0000113	variance		A distribution variance is defined as a probability distribution attribute that is the expected value of the square of the difference of the value of a variable that has that distribution from its expected value.
http://purl.obolibrary.org/obo/STATO_0000556	number of orthogonal components	http://purl.obolibrary.org/obo/STATO_0000047	count		number of orthogonal components is a count used as input to the orthogonal partial least square discriminant analysis (OPLS-DA)
http://purl.obolibrary.org/obo/STATO_0000685	mesure of calibration	http://purl.obolibrary.org/obo/STATO_0000039	statistic		A measure of association between a variable representing known or true values and a variable representing measured or predicted values.
http://purl.obolibrary.org/obo/STATO_0000688	calibration intercept	http://purl.obolibrary.org/obo/STATO_0000685	mesure of calibration		A measure of calibration that is the difference between the mean expected value and the mean observed value.
http://purl.obolibrary.org/obo/STATO_0000689	partial area under the receiver operator characteristic curve	http://purl.obolibrary.org/obo/STATO_0000209	area under curve		An area under the curve where the curve is the true positive rate and the range of interest is a specified portion of the range of possible values for the false positive rate and/or range of possible values for the true positive rate.
http://purl.obolibrary.org/obo/OBI_0002444	measurand role	http://purl.obolibrary.org/obo/BFO_0000023	role		A role borne by a material entity and realized in an assay which achieves the objective to measure the magnitude/concentration/amount of the measurand in the entity bearing evaluant role.
http://purl.obolibrary.org/obo/STATO_0000554	number of cross-validation segments	http://purl.obolibrary.org/obo/STATO_0000047	count		number of cross-validation segments is a count which is used as input parameter in a cross validation procedure to evaluate a statistical model.
http://purl.obolibrary.org/obo/STATO_0000595	number of true positive	http://purl.obolibrary.org/obo/STATO_0000047	count		the number of true positive is a count which denotes how many elements are correctly classified as having a feature they are actually known to be having (e.g. carrier of a pathogen).
http://purl.obolibrary.org/obo/STATO_0000596	number of false positive	http://purl.obolibrary.org/obo/STATO_0000047	count		the number of false positive is a count which denotes how many elements known to be void of feature are wrongly classified as having it (e.g. being diagnosed with a disease when one is totally healthy)
http://purl.obolibrary.org/obo/STATO_0000597	number of true negative	http://purl.obolibrary.org/obo/STATO_0000047	count		the number of true negative is a count which denotes how many elements are correctly classified as void of a feature they are actually known to be missing (e.g. free of pathogen).
http://purl.obolibrary.org/obo/STATO_0000598	number of false negative	http://purl.obolibrary.org/obo/STATO_0000047	count		the number of false negative is a count which denotes how many elements known to be having a feature are wrongly classified as being devoided of it (e.g. being given an all clear while being actually infected and carrier of a pathogen)
http://purl.obolibrary.org/obo/STATO_0000599	point estimate	http://purl.obolibrary.org/obo/STATO_0000471	estimate		a point estimate is a data item which provides a particular value evaluating a population parameter
http://purl.obolibrary.org/obo/STATO_0000600	interval estimate	http://purl.obolibrary.org/obo/STATO_0000471	estimate		an interval estimate is a data item corresponding to a range of values likely to contain the population parameter of interest
http://purl.obolibrary.org/obo/STATO_0000601	simultaneous multiple testing correction	http://purl.obolibrary.org/obo/OBI_0200089	multiple testing correction method		simultaneous multiple testing method is a multiple testing correction method which...
http://purl.obolibrary.org/obo/STATO_0000602	sequential multiple testing correction method	http://purl.obolibrary.org/obo/OBI_0200089	multiple testing correction method		sequential multiple testing method is a multiple testing correction method which...
http://purl.obolibrary.org/obo/STATO_0000603	alpha debt	http://purl.obolibrary.org/obo/STATO_0000602	sequential multiple testing correction method		a sequential multiple correction procedure which does not maintain a constant false positive rate but allows it to grow controllably.
http://purl.obolibrary.org/obo/STATO_0000604	alpha investing procedure	http://purl.obolibrary.org/obo/STATO_0000602	sequential multiple testing correction method		a type of sequential multiple testing correction method
http://purl.obolibrary.org/obo/STATO_0000605	alpha spending procedure	http://purl.obolibrary.org/obo/STATO_0000602	sequential multiple testing correction method		a type of sequential multiple testing correction
http://purl.obolibrary.org/obo/STATO_0000606	q-statistic	http://purl.obolibrary.org/obo/STATO_0000039	statistic		q is the basic statistic for the studentized range distribution, which is used for multiple comparison procedures, such as the single step procedure Tukey's range test, the Newman–Keuls method, and the Duncan's step down procedure, and establishing confidence intervals that are still valid after data snooping has occurred
http://purl.obolibrary.org/obo/STATO_0000607	proportion	http://purl.obolibrary.org/obo/STATO_0000184	ratio		a proportion is a ratio which corresponds to the fraction of the total presenting a particular feature
http://purl.obolibrary.org/obo/STATO_0000609	measure of hypothesis testing	http://purl.obolibrary.org/obo/STATO_0000039	statistic		
http://purl.obolibrary.org/obo/STATO_0000610	measure of association	http://purl.obolibrary.org/obo/STATO_0000039	statistic		measure of association is a statistic which quantitatively represents a relationship between two or more variables
http://purl.obolibrary.org/obo/STATO_0000613	difference	http://purl.obolibrary.org/obo/STATO_0000039	statistic		A statistic that is a subtraction of one quantity from another.
http://purl.obolibrary.org/obo/STATO_0000614	absolute difference	http://purl.obolibrary.org/obo/STATO_0000613	difference		A statistic that is a subtraction of one quantity from another, with no modification of the resulting value.

As a type of statistic, "Absolute Difference" is the actual difference between two quantities and can be positive or negative depending on the order of subtraction. The term "Absolute Difference" should not be confused with the mathematical term 'absolute value' which is a numerical value without a negative sign.
http://purl.obolibrary.org/obo/STATO_0000616	count difference	http://purl.obolibrary.org/obo/STATO_0000614	absolute difference		A statistic that is a subtraction of one count from another.

The term Count Difference is used to specify the Absolute Difference is with respect to a count or number of items (such as number of events, platelet counts, sample size e.g. number of people in the group) to distinguish from differences in other types of statistics (mean difference, median difference, risk difference, etc.)
http://purl.obolibrary.org/obo/STATO_0000618	Cohen's d	http://purl.obolibrary.org/obo/STATO_0000100	standardized mean difference		A standardized mean difference which is calculated as a difference between two means, divided by a square root of an average of the variances of the two groups.

In Cohen's d statistic, the statistical measure of dispersion is specified as the square root of an average of the variances of the two groups being compared. The variances of the two groups are based on within-group standard deviations.

For sample sizes < 50, a correction factor is used.
http://purl.obolibrary.org/obo/STATO_0000619	negative predictive value	http://purl.obolibrary.org/obo/STATO_0000607	proportion		A proportion in which the numerator represents the correctly non-detected items within the denominator that represents all items not detected.
http://purl.obolibrary.org/obo/STATO_0000621	diagnostic yield	http://purl.obolibrary.org/obo/STATO_0000607	proportion		diagnostic yield is a proportion in which the numerator represents the correctly detected items within the denominator that represents all items tested.
http://purl.obolibrary.org/obo/STATO_0000622	ratio-based measure of association	http://purl.obolibrary.org/obo/STATO_0000610	measure of association		ratio-based measure of association is a measure of association which relies on a quotient of 2 quantities to indicate the strength of the association.
http://purl.obolibrary.org/obo/STATO_0000623	measure of agreement	http://purl.obolibrary.org/obo/STATO_0000610	measure of association		A measure of association of two variables representing measurements of the same attribute of an entity.
http://purl.obolibrary.org/obo/STATO_0000624	Gini index	http://purl.obolibrary.org/obo/STATO_0000028	measure of dispersion		The Gini index, or Gini coefficient, is a statistical measure of dispersion which assesses the pattern of data in a frequency distribution, in particular it measures the inequality among values. 

Geometrically, the Gini index can be defined as the ratio of 2 areas delineated by 2 curves, the line of equality (curve 1) and the Lorenz curve (curve 2). The numerator of the ratio corresponds to the area defined between curve 1 and curve 2 , while the denominator corresponds to the entire area below curve 1.

Mathematically, the Gini index is defined as half of the relative mean absolute difference.

It was therefore originally used to assess income inequality and is named after the italian statistician Corrado Gini who published it in 1912.
http://purl.obolibrary.org/obo/STATO_0000625	relative mean difference	http://purl.obolibrary.org/obo/STATO_0000615	relative difference		A statistic that is a difference between 1 and a ratio of the two mean values being compared.

Relative Mean Difference = 1 - ( a / b ) where a and b are mean values.

The relative mean difference can also be defined as a statistic that is a ratio of the difference in means to the reference mean value. Relative Mean Difference = ( b - a ) / ( b ) where b is the reference mean value and a is another mean value.
http://purl.obolibrary.org/obo/STATO_0000626	relative risk difference	http://purl.obolibrary.org/obo/STATO_0000615	relative difference		A statistic that is a difference between 1 and a ratio of the two risk values being compared.

Relative Risk Difference = 1 - ( a / b ) where a and b are risk values.

The relative risk difference can also be defined as a statistic that is a ratio of the risk difference to the risk used as a reference. Relative Risk Difference = ( b - a ) / ( b ) where b is the reference risk value and a is another risk value.
http://purl.obolibrary.org/obo/STATO_0000627	odds	http://purl.obolibrary.org/obo/STATO_0000184	ratio		odds correspond to a ratio in which the numerator represents the probability that an event will occur and the denominator represents the probability that an event will not occur.

'Odds' and 'Odds ratio' are different terms. 'Odds' is a ratio of probabilities. 'Odds ratio' is a ratio of two different odds.

Odds are calculated as p / (1-p) where p is the probability of event occurrence.  When p = 0, the odds = 0. When p = 1, the odds may be expressed as not calculable or as "odds against = 0".

Odds may be expressed as p:(1-p). Odds may be expressed as p:q where q = 1-p. Odds may be expressed as a:b where a and b are multiples of p and (1-p). Examples of different expressions of the same odds include 3:2, 3/2, 0.6:0.4, 0.6/0.4, and 1.5.

Odds may be expressed as "odds for" or "odds in favor" (e.g. 1:5 for a "3" on a 6-sided die) or "odds against" (e.g. 5:1 against a "3" on a 6-sided die).

The term "betting odds" used in gambling that involves financial amounts in the formulation is not an "Odds" in the definition of the Scientific Evidence Code System.
http://purl.obolibrary.org/obo/STATO_0000629	Scott's pi	http://purl.obolibrary.org/obo/STATO_0000682	kappa		A Kappa statistic where the expected agreement between two raters is expressed in terms of the square of arithmetic means of marginal proportions of each assessment category.

Scott's pi is a kappa statistic for two raters that assumes the likelihood of each category for assignment is based on the same distribution of rater responses, leading to the use of squared arithmetic means of the marginal proportion of each assessment category as its estimate of "chance agreement."


Pr(expected) is calculated using squared "joint proportions" which are squared arithmetic means of the marginal proportions of each assessment category, in contrast to Cohen's Kappa which uses squared geometric means.


Scott's pi = ( p[a] - p[e] ) / ( 1 - p[e] ) where p[a] is the observed percent agreement and p[e] is the expected chance agreement expressed as the squared joint proportions of the marginal sums.
http://purl.obolibrary.org/obo/STATO_0000631	modified Cohen’s kappa for more than 2 raters	http://purl.obolibrary.org/obo/STATO_0000682	kappa		A Kappa statistic in which the expected agreement by chance is based on an assumption that the likelihood of each category for assignment is based on the proportion observed, and the number of raters is more than 2.

A Kappa statistic is a measure of agreement among categorical assessments, corrected for chance agreement.

In the modified Cohen's kappa for more than 2 raters, the expected chance agreement is modeled as the summation of the differences, between the square of the expected probability of the category and the quotient of its variance divided by the number of raters, for each category.

The modified Cohen's kappa for more than 2 raters is calculated as ( p[a] - p[e] ) / ( 1 - p[e] ) where p[a] is the observed percent agreement and p[e] is the expected chance agreement.
http://purl.obolibrary.org/obo/STATO_0000634	reciprocal of difference	http://purl.obolibrary.org/obo/STATO_0000039	statistic		A statistic that is a quotient of one and a difference.
http://purl.obolibrary.org/obo/STATO_0000641	Friedman test	http://purl.obolibrary.org/obo/STATO_0000279	between group comparison statistical test		The Friedman test is a non-parametric statistical test used to determine whether there are statistically significant differences among multiple related groups. It is an extension of the Wilcoxon signed-rank test for more than two related samples.
http://purl.obolibrary.org/obo/STATO_0000644	sign test	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		The sign test is a non-parametric hypothesis test used to assess whether the median of a single population is equal to a specified value, typically referred to as the null hypothesis. The sign test is particularly useful when the data are not normally distributed or when the assumptions required for parametric tests are not met.

Note that the 'sign test' is related but different from the "Wilcoxon signed rank test'.

Sign test: It does not assume any specific distribution for the data. It only requires paired data and makes no assumptions about the shape of the underlying distribution.
Wilcoxon signed-rank test: It assumes that the differences between paired observations come from a symmetric distribution. It's also more powerful than the sign test when the distribution is continuous and symmetric.
http://purl.obolibrary.org/obo/STATO_0000645	rate	http://purl.obolibrary.org/obo/STATO_0000184	ratio		A ratio in which the numerator represents any quantity and the denominator represents an interval of time.
http://purl.obolibrary.org/obo/STATO_0000647	standard error of the proportion	http://purl.obolibrary.org/obo/STATO_0000562	standard error of estimate		A measure of dispersion applied to proportions across hypothetical repeated random samples.

A standard error is used to quantify the uncertainty around a statistical estimate due to random sampling error.
http://purl.obolibrary.org/obo/STATO_0000648	standard error of the difference between independent means	http://purl.obolibrary.org/obo/STATO_0000562	standard error of estimate		A measure of dispersion applied to differences between means of independent groups across hypothetical repeated random samples.

A standard error is used to quantify the uncertainty around a statistical estimate due to random sampling error.
http://purl.obolibrary.org/obo/STATO_0000649	standard error of the difference between independent proportions	http://purl.obolibrary.org/obo/STATO_0000562	standard error of estimate		A measure of dispersion applied to differences between proportions arising from independent groups across hypothetical repeated random samples.

A standard error is used to quantify the uncertainty around a statistical estimate due to random sampling error.
http://purl.obolibrary.org/obo/STATO_0000651	statistical sample	http://purl.obolibrary.org/obo/BFO_0000004	independent continuant		the output of a statistical sampling, a draw from a distribution or a population of physical or immaterial entities.
http://purl.obolibrary.org/obo/STATO_0000653	Hosmer-Lemeshow goodness-of-fit test	http://purl.obolibrary.org/obo/STATO_0000191	goodness of fit statistical test		The Hosmer-Lemeshow test is a statistical test used to assess the goodness of fit of a logistic regression model. It evaluates how well the predicted probabilities from the model match the observed outcomes in the data. The test helps determine whether the logistic regression model adequately captures the relationship between the predictor variables and the binary outcome variable.
The test statistic follows a chi-square distribution with degrees of freedom equal to the number of groups minus the number of parameters estimated in the logistic regression model.
http://purl.obolibrary.org/obo/STATO_0000654	paired observation comparison objective	http://purl.obolibrary.org/obo/OBI_0000675	statistical hypothesis test objective		the objective of a data transformation to test a null hypothesis of absence of difference within subject holds.
http://purl.obolibrary.org/obo/STATO_0000655	calibration plot	http://purl.obolibrary.org/obo/IAO_0000573	line graph		calibration plot is a line graph which plots values resulting from predictions againts values obtained through observation.
http://purl.obolibrary.org/obo/STATO_0000656	slope	http://purl.obolibrary.org/obo/IAO_0000027	data item		the slope of a line graph is a data item denoting the rate of change between the two variables represented on the graph.
the slope is used to visualize and interpret the relationship between two variables.
http://purl.obolibrary.org/obo/STATO_0000657	intercept	http://purl.obolibrary.org/obo/IAO_0000027	data item		an intercept is a data item which corresponds to where a graph line cuts (intercepts) an coordinates axis.
http://purl.obolibrary.org/obo/STATO_0000658	mean time-to-event	http://purl.obolibrary.org/obo/STATO_0000573	average value		A mean of values in which each value is the duration of time between the start of observation and the occurrence of an event.
http://purl.obolibrary.org/obo/STATO_0000659	median time-to-event	http://purl.obolibrary.org/obo/STATO_0000574	center value		A median of values in which each value is the duration of time between the start of observation and the occurrence of an event.
http://purl.obolibrary.org/obo/STATO_0000665	difference in differences	http://purl.obolibrary.org/obo/STATO_0000614	absolute difference		A statistic that is a subtraction of one difference from another.
http://purl.obolibrary.org/obo/STATO_0000676	number density	http://purl.obolibrary.org/obo/STATO_0000675	density		A ratio in which the numerator represents a count and the denominator represents an interval of space (distance, area, or volume).
http://purl.obolibrary.org/obo/STATO_0000555	number of predictive components	http://purl.obolibrary.org/obo/STATO_0000047	count		number of predictive components is a count used as input to the principle component analysis (PCA)
http://purl.obolibrary.org/obo/STATO_0000583	completely randomized design	http://purl.obolibrary.org/obo/OBI_0500000	study design		a completely randomized design is a type of design of experiment where the observation unit receive treatments (independent variable level) entirely at random. In other words, the observations unit are randomly assigned to treatments.
Completely randomized designs differ from randomized complete block design and should not be confused as in the latter, a blocking variable is first use to assign experimental units to blocks. Then only, the members of each block are then randomly assigned to different treatment groups
http://purl.obolibrary.org/obo/STATO_0000480	ridge regression best linear unbiaised predictor	http://purl.obolibrary.org/obo/STATO_0000337	breeding value estimation		RR-BLUP is a data transformation used in the context of estimating breeding value using a Bayesian ridge regression. It can be obtained from Bayes B procedure by setting pi parameter to zero ( )  and assuming that all the markers have the same variance.
http://purl.obolibrary.org/obo/STATO_0000427	restricted maximum likelihood estimation	http://purl.obolibrary.org/obo/STATO_0000428	maximum likelihood estimation		restricted maximum likelihood estimation is a kind of maximum likelihood estimation data transformation which estimates the variance components of random-effects in univariate and multivariate meta-analysis. in contrast to 'maximum likelihood estimation', reml can produce unbiased estimates of variance and covariance parameters.
http://purl.obolibrary.org/obo/STATO_0000337	breeding value estimation	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		breeding value estimation is a data transformation process aiming at computing breeding value estimates of an organism given a set of genomic (SNP) observations, pedigree information and/or phenotypic observations.
http://purl.obolibrary.org/obo/STATO_0000338	breeding value estimation using genotype data	http://purl.obolibrary.org/obo/STATO_0000337	breeding value estimation		breeding value estimation is a data transformation process aiming at computing breeding value estimates of an organism given a set of genomic (SNP) observations.
http://purl.obolibrary.org/obo/STATO_0000339	breeding value estimation using pedigree data	http://purl.obolibrary.org/obo/STATO_0000337	breeding value estimation		breeding value estimation is a data transformation process aiming at computing breeding value estimates of an organism given a set of pedigree information.
http://purl.obolibrary.org/obo/STATO_0000340	breeding value estimation using phenotypic data	http://purl.obolibrary.org/obo/STATO_0000337	breeding value estimation		breeding value estimation is a data transformation process aiming at computing breeding value estimates of an organism given a set of phenotypic observations.
http://purl.obolibrary.org/obo/STATO_0000342	genomic selection objective	http://purl.obolibrary.org/obo/OBI_0200166	data transformation objective		genomic selection objective is a data transformation objective which is a special case of marker-assisted selection in which genetic markers covering the whole genome are used so that all quantitative trait loci (QTL) are in linkage disequilibrium with at least one marker.
http://purl.obolibrary.org/obo/STATO_0000343	genotype data set	http://purl.obolibrary.org/obo/IAO_0000100	data set		a dataset which is made up of genotypic information, that is presenting allele information at specific loci in a set of individuals of an organism.
http://purl.obolibrary.org/obo/STATO_0000407	Fagan nomogram	http://purl.obolibrary.org/obo/IAO_0000038	graph		Fagan nomogram is a graph plotting pre-test probabilities, likelyhood ratios and post-test probabilities on 3 parallel axis. The plot was first proposed by Fagan in 1975 as a way to visualize Baye's Theorem data where
P(D) is the probability that the patient has the disease before the test. P(D|T) is the probability that the patient has the disease after the test result. P(T|D) is the probability of the test result if the patient has the disease, and P(T|D̄) is the probability of the test result if the patient does not have the disease. With this terminology the usefulness of both positive and negative test results can be assessed. A line drawn from P(D) on the right  through the ratio of P(T|D) to P(T|D̄)  gives P(D|T) on the left of the nomogram.
http://purl.obolibrary.org/obo/STATO_0000408	2 step Fagan nomogram	http://purl.obolibrary.org/obo/IAO_0000038	graph		Two-Step Fagan Nomogram, which adds two extra axis between the LR axis that represents sensibility and specificity to calculate negative and positive likelihood ratios in the same nomogram
http://purl.obolibrary.org/obo/STATO_0000409	likelihood ratio	http://purl.obolibrary.org/obo/STATO_0000622	ratio-based measure of association		the likelihood ratio is a ratio which is formed by dividing the post-test odds with the pre-test odds in the context of a Bayesian formulation
http://purl.obolibrary.org/obo/STATO_0000412	prevalence	http://purl.obolibrary.org/obo/STATO_0000607	proportion		A proportion in which the numerator represents all events of interest (for example, both new and preexisting cases of a disease) in the population, which is represented by the denominator.
http://purl.obolibrary.org/obo/STATO_0000414	mortality	http://purl.obolibrary.org/obo/STATO_0000184	ratio		mortality is a ratio  formed by the number of deaths due to a disease divided by the total population size.
http://purl.obolibrary.org/obo/STATO_0000415	accuracy	http://purl.obolibrary.org/obo/STATO_0000639	percentage		in the context of binary classification, accuracy is defined as the proportion of true results (both true positives and true negatives) to the total number of cases examined (the sum of true positive, true negative, false positive and false negative).

It can be understood as a measure of the proximity of measurement results to the true value.

Accuracy is a metric used in the context of classification tasks to evaluate the proportion of correctly predicted instances among the total instances.

Key Points:

Use Case: Classification performance evaluation.
Metric: Measures the proportion of correct predictions.
Interpretation: Higher values indicate better classification performance.
http://purl.obolibrary.org/obo/STATO_0000417	pretest probability	http://purl.obolibrary.org/obo/STATO_0000184	ratio		The probability of a patient having the target disorder before a diagnostic test result is known
http://purl.obolibrary.org/obo/STATO_0000418	measure of heterogeneity	http://purl.obolibrary.org/obo/STATO_0000028	measure of dispersion		a measure of heterogeneity in meta-analysis  is a data item which aims to describe the variation in study outcomes between studies.
http://purl.obolibrary.org/obo/STATO_0000419	Cochran's Q statistic	http://purl.obolibrary.org/obo/STATO_0000418	measure of heterogeneity		The Cochran's Q  statistic is a measure of heterogeneity accros study computed by summing the squared deviations of each study's estimate from the overall meta-analytic estimate, weighting each study's contribution in the same manner as in the meta-analysis.
http://purl.obolibrary.org/obo/STATO_0000421	Tau squared	http://purl.obolibrary.org/obo/STATO_0000418	measure of heterogeneity		Tau-squared is an estimate of the between-study variance in a random-effects meta-analysis. The square root of this number (i.e. tau) is the estimated standard deviation of underlying effects across studies.
http://purl.obolibrary.org/obo/STATO_0000422	L'Abbe plot	http://purl.obolibrary.org/obo/IAO_0000038	graph		The L’Abbé plot was introduced in 1987 in the context of meta-analyses of clinical trials with dichotomous (binary) outcomes, as a plot of observed risks in the treatment group against observed risks in the control group.
Another formulation is that it plots the event rate in the experimental (intervention) group against the event rate in the control group, as an aid to exploring the heterogeneity of effect estimates within a meta-analysis.
It is diagram used in meta-analysis that compares the risks observed in the experimental and control arms of clinical trials. Each trial is located in the space of a diagram where the sizes of the circles indicate the sizes of the trials. Trials in which the experimental treatment had a higher risk than the control will be in the upper left of the plot. If risk in the both groups is the same the circle will fall on the line of equality. If the control treatment has a higher risk than the experimental treatment then the point will be in the lower right of the plot. It is often used as an indicator of heterogeneity and hence as an indicator of the likelihood that results from different trials can be validly combined. Named after Kristin L'Abbé.
http://purl.obolibrary.org/obo/STATO_0000423	observed risk	http://purl.obolibrary.org/obo/STATO_0000620	risk		the proportion of individuals in a population with the outcome of interest
http://purl.obolibrary.org/obo/STATO_0000425	Sidik-Jonkman estimator	http://purl.obolibrary.org/obo/STATO_0000471	estimate		Sidik-Jonkman estimator is a data item computed to estimate heterogeneity parameter (estimate of between-study variance) in a random effect model for meta analysis.
http://purl.obolibrary.org/obo/STATO_0000426	Hunter-Schmidt estimator	http://purl.obolibrary.org/obo/STATO_0000471	estimate		Hunter-Schmidt estimator is a data item computed to estimate heterogeneity parameter (estimate of between-study variance) in a random effect model for meta analysis.
http://purl.obolibrary.org/obo/STATO_0000428	maximum likelihood estimation	http://purl.obolibrary.org/obo/STATO_0000119	model parameter estimation		maximum likelihood estimation (MLE) is a method of estimating the parameters of a statistical model, given observations. MLE attempts to find the parameter values that maximize the likelihood function, given the observations.

The method of maximum likelihood is based on the likelihood function, {\displaystyle {\mathcal {L}}(\theta \,;x)} {\displaystyle {\mathcal {L}}(\theta \,;x)}. We are given a statistical model, i.e. a family of distributions {\displaystyle \{f(\cdot \,;\theta )\mid \theta \in \Theta \}} {\displaystyle \{f(\cdot \,;\theta )\mid \theta \in \Theta \}}, where {\displaystyle \theta } \theta  denotes the (possibly multi-dimensional) parameter for the model. The method of maximum likelihood finds the values of the model parameter, {\displaystyle \theta } \theta , that maximize the likelihood function, {\displaystyle {\mathcal {L}}(\theta \,;x)} {\displaystyle {\mathcal {L}}(\theta \,;x)}. I
http://purl.obolibrary.org/obo/STATO_0000429	DerSimonian-Laird estimator	http://purl.obolibrary.org/obo/STATO_0000471	estimate		DerSimonian-Laird estimator s a data item computed to estimate heterogeneity parameter (estimate of between-study variance) in a random effect model for meta analysis.  The estimator is used in simple noniterative procedure for characterizing the distribution of treatment effects in a series of studies
http://purl.obolibrary.org/obo/STATO_0000431	meta analysis by DerSimonian and Leard method	http://purl.obolibrary.org/obo/STATO_0000155	meta analysis		a meta analysis which relies on the computation of the DerSimonian and Leard estimator as a measure of heterogeneity over a set of studies.
http://purl.obolibrary.org/obo/STATO_0000432	meta analysis by Hunter-Schmidt method	http://purl.obolibrary.org/obo/STATO_0000155	meta analysis		a meta analysis which relies on the computation of the Hunter and Schmidt estimator as a measure of heterogeneity over a set of studies by considering the weighted mean of the raw correlation coefficient. Hunter and Schmidt developed what is commonly termed validity generalization procedures (Schmidt and Hunter, 1977). These involve correcting the effect sizes in the meta-analysis for sampling, and measurement error
and range restriction.
http://purl.obolibrary.org/obo/STATO_0000434	Cochran's q test for heterogeneity	http://purl.obolibrary.org/obo/STATO_0000027	test of association between categorical variables		Cochran's Q test is a statistical test used for unreplicated randomized block design experiments with a binary response variable and paired data.
In the analysis of two-way randomized block designs where the response variable can take only two possible outcomes (coded as 0 and 1), Cochran's Q test is a non-parametric statistical test to verify whether k treatments have identical effects.
http://purl.obolibrary.org/obo/STATO_0000435	probability distribution scale parameter	http://purl.obolibrary.org/obo/STATO_0000028	measure of dispersion		a probability distribution scale parameter is a measure of variation which is set by the operator when selecting a parametric probability distribution and which defines how spread the distribution is. The larger  the value of the scale parameter is, the more spread out the distribution.
http://purl.obolibrary.org/obo/STATO_0000436	probability distribution shape parameter	http://purl.obolibrary.org/obo/IAO_0000027	data item		a probability distribution shape parameter is a data item which is set by the operator when selecting a parametric probability distribution and which dictates the way the profile but not the location or size of the distribution plot looks like.
http://purl.obolibrary.org/obo/STATO_0000437	scale estimator	http://purl.obolibrary.org/obo/STATO_0000471	estimate		a scale estimator is a measurement datum (a statistic) which is calculated to approach the actual scale parameter of a probability distribution from observed data.
http://purl.obolibrary.org/obo/STATO_0000439	outlier detection testing objective	http://purl.obolibrary.org/obo/OBI_0000675	statistical hypothesis test objective		outlier detection testing objective is a statistical objective of a data transformation which aims to test a null hypothesis that an observation is not an outlier.
http://purl.obolibrary.org/obo/STATO_0000444	split-plot design	http://purl.obolibrary.org/obo/STATO_0000586	restricted randomized design		a split-plot design is kind of factorial design  which is used when running a full factorial completely randomized design is inpractical, either for cost or practicalities (e.g. equipment, fields), in other words, when a restricted randomization has to be applied. A split-plot design is used whenever practioners fix the level of 'hard to change factor' and run all the combinations of the other factors. The hard to change factor is also refered to as the 'whole plot' factor, while the remainders of the factors are refered to as 'split plot factor'.
Performing a split-plot design therefore means fixing one factor level, and then applying the treatments formed by the cartesian products of the levels for the other factors. A mininum of 2 factors are required and one being applied before the other(s).
http://purl.obolibrary.org/obo/STATO_0000445	split split plot design	http://purl.obolibrary.org/obo/STATO_0000444	split-plot design		a split split plot design is a study design where restricted randomization affect 2 study factors (and not 1 as in split-plot design). Such design is only possible if at least 3 independent variables are present.
http://purl.obolibrary.org/obo/STATO_0000446	restricted randomization	http://purl.obolibrary.org/obo/OBI_0302900	group randomization		Restricted randomization is a kind of randomization which is used or occured when hard to change factors exist in a study design. In other words, when complete randomization is not possible, a case of restricted randomization exists, for instance in the case of split-plot design.

Restricted randomization allows intuitively poor allocations of treatments to experimental units to be avoided, while retaining the theoretical benefits of randomization.

Restricted randomization can also result from an unplanned event and is then something that should be avoided. RandomizeR R package can be used to detect such events and assess the quality of randomization process.
http://purl.obolibrary.org/obo/STATO_0000447	whole plot number	http://purl.obolibrary.org/obo/IAO_0000027	data item		a 'whole plot number' is a data item used to count and identify the actual piece of land (in the case of real field based trials) used in a split plot design experiment and receiving treatments corresponding to the levels of a factor whose randomization is restricted (these factors are known as 'hard to change' factors).
In the case of non-field based trials, the 'whole plot' is a metaphor.
http://purl.obolibrary.org/obo/STATO_0000448	sub-plot number	http://purl.obolibrary.org/obo/IAO_0000027	data item		a 'sub plot number' is a data item used to count and identify the actual piece of land located within a 'whole plot', in the case of real field based trials using a split-plot design, and received  completely randomized treatments corresponding to the factor levels combinations of the remainder factors declared in the experiment.

in the case of 'split-split plot design',  sub-plots also receive treatments corresponding to a factor whose randomization is restriction. In such configuration, each 'sub-plot' is itself divided into 'sub sub-plot', which then received the remainder of the treatments in completely randomized fashion.
 
In the case of non-field based trials, the notion 'sub-plot' is a metaphor.
http://purl.obolibrary.org/obo/STATO_0000449	sub sub-plot number	http://purl.obolibrary.org/obo/IAO_0000027	data item		a 'sub sub-plot number' is a data item used to count and identify the actual piece of land located within a 'sub plot', in the case of real field based trials using a split-split-plot design, and received  completely randomized treatments corresponding to the factor levels combinations of the remainder factors declared in the experiment.

in the case of 'split-split plot design',  sub-plots also receive treatments corresponding to a factor whose randomization is restriction. In such configuration, each 'sub-plot' is itself divided into 'sub sub-plot', which then received the remainder of the treatments in completely randomized fashion.
 
In the case of non-field based trials, the notion 'sub sub-plot' is a metaphor.
http://purl.obolibrary.org/obo/STATO_0000450	Wilk's Lambda	http://purl.obolibrary.org/obo/STATO_0000039	statistic		"Wilks'  lambda  is  a  test  statistic  used  in  multivariate  analysis  of  variance (MANOVA)  to  test  whether  there  are  differences  between  the  means  of identified  groups  of  subjects  on  a  combination  of  dependent  variables."
http://purl.obolibrary.org/obo/STATO_0000451	Pillai's trace	http://purl.obolibrary.org/obo/STATO_0000039	statistic		Pillai trace is a statistic used for the following three tests during a multivariate analysis of variance: (a) equality of mean vectors of lp‐variate normal distributions with the common but unknown covariance matrix, (b) independence between two sets of variates distributed jointly as a normal distribution with unknown mean vector, and (c) equality of covariance matrices of two p‐variate normal distributions with unknown mean vectors.
http://purl.obolibrary.org/obo/STATO_0000452	Hotelling-Lawley trace	http://purl.obolibrary.org/obo/STATO_0000039	statistic		"The Lawley–Hotelling trace is used to test the equality of mean vectors of k p‐variate normal distributions with common but unknown covariance matrix. The explicit form of the null distribution of T$_{0}^{2}$equation image is the F distribution. The asymptotic null distribution is the chi‐square distribution. The power function of the test is described and its power is compared with the likelihood ratio test. "
http://purl.obolibrary.org/obo/STATO_0000453	Roy’s Maximum Root	http://purl.obolibrary.org/obo/STATO_0000039	statistic		Roy's maximum root is a statistic used in multivariate analysis of variance to find the maximum characteristic root or eigenvalue statistic for testing equality of k p-variate normal distributions with same covariance matrix, independence between two sets of variables jointly distributed as a normal distribution, equality of covariance matrices of two p-variate normal distributions, whether the covariance matrix of a p-variabte normal distribution with unknown mean vector equals a specified matrix.
http://purl.obolibrary.org/obo/STATO_0000454	multivariate analysis of variance	http://purl.obolibrary.org/obo/OBI_0200201	ANOVA		"The multivariate analysis of variance, or MANOVA, is a procedure for comparing multivariate sample means. As a multivariate procedure, it is used when there are two or more dependent variables, and is typically followed by significance tests involving individual dependent variables separately. 

It helps to answer:
1. Do changes in the independent variable(s) have significant effects on the dependent variables?
2. What are the relationships among the dependent variables?
3. What are the relationships among the independent variables?"
http://purl.obolibrary.org/obo/STATO_0000457	mean difference	http://purl.obolibrary.org/obo/STATO_0000614	absolute difference		The mean difference, or difference in means, measures the absolute difference between the mean value in two different groups.

The mean difference is a subtraction of one mean from another.

The primary use of this term is in analyzing between-group differences
http://purl.obolibrary.org/obo/STATO_0000459	group sequential design	http://purl.obolibrary.org/obo/OBI_0500000	study design		group sequential design is a study design used in clinical trial settings in which interim analyses of the data are conducted after groups of patients are recruited. After each interim analysis, the trial may stop early if the evidence so far shows the new treatment is particularly effective or ineffective. Such designs are ethical and cost-effective, and so are of great interest in practice.
http://purl.obolibrary.org/obo/STATO_0000460	interim analysis	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		interim analysis is a data transformation used to analyzed studies implementing a group-sequential design, to evaluate and  interpret the accumulating information during a clinical trial. It means that the analysis of data that is conducted before full data collection has been completed. Clinical trials are unusual in that enrollment of patients is a continual process staggered in time. This means that if a treatment is particularly beneficial or harmful compared to the concurrent placebo group while the study is on-going, the investigators are ethically obliged to assess that difference using the data at hand and to make a deliberate consideration of terminating the study earlier than planned.
http://purl.obolibrary.org/obo/STATO_0000461	O'brien-Flemming boundary analysis	http://purl.obolibrary.org/obo/STATO_0000460	interim analysis		the O'brien-Flemming boundary analysis is a kind of interim-analysis method implemented by O'brien and Flemming to account for the 


As all frequentist methods of the same type, it focuses on controlling the type I error rate as the repeated hypothesis testing of accumulating data increases the type I error rate of a clinical trial.
http://purl.obolibrary.org/obo/STATO_0000462	Pocock boundary analysis	http://purl.obolibrary.org/obo/STATO_0000460	interim analysis		The Pocock boundary analysis gives a p-value threshold for each interim analysis which guides the data monitoring committee on whether to stop the trial. The boundary used depends on the number of interim analyses.
The Pocock boundary is simple to use in that the p-value threshold is the same at each interim analysis. The disadvantages are that the number of interim analyses must be fixed at the start and it is not possible under this scheme to add analyses after the trial has started. Another disadvantage is that investigators and readers frequently do not understand how the p-values are reported: for example, if there are five interim analyses planned, but the trial is stopped after the third interim analysis because the p-value was 0.01, then the overall p-value for the trial is still reported as <0.05 and not as 0.01.


As all frequentist methods of the same type, it focuses on controlling the type I error rate as the repeated hypothesis testing of accumulating data increases the type I error rate of a clinical trial.
http://purl.obolibrary.org/obo/STATO_0000463	Haybittle-Peto boundary analysis	http://purl.obolibrary.org/obo/STATO_0000460	interim analysis		The Haybittle–Peto boundary analysis is an interim analysis where a rule for deciding when to stop a clinical trial prematurely is defined. It is named for John Haybittle and Richard Peto.
The Haybittle–Peto boundary is one such stopping rule, and it states that if an interim analysis shows a probability of equal to, or less than 0.001 that a difference as extreme or more between the treatments is found, given that the null hypothesis is true, then the trial should be stopped early. The final analysis is still evaluated at the normal level of significance (usually 0.05).[3][4] The main advantage of the Haybittle–Peto boundary is that the same threshold is used at every interim analysis, unlike the O'Brien–Fleming boundary, which changes at every analysis. Also, using the Haybittle–Peto boundary means that the final analysis is performed using a 0.05 level of significance as normal, which makes it easier for investigators and readers to understand. The main argument against the Haybittle–Peto boundary is that some investigators believe that the Haybittle–Peto boundary is too conservative and makes it too difficult to stop a trial.

As all frequentist methods of the same type, it focuses on controlling the type I error rate as the repeated hypothesis testing of accumulating data increases the type I error rate of a clinical trial.
http://purl.obolibrary.org/obo/STATO_0000465	empirical measure	http://purl.obolibrary.org/obo/IAO_0000027	data item		An empirical measure is a random measure arising from a particular realization of a (usually finite) sequence of random variables.
http://purl.obolibrary.org/obo/STATO_0000466	model term	http://purl.obolibrary.org/obo/IAO_0000027	data item		A model term is a data item set in statistical model formula to apportion source of variation.
http://purl.obolibrary.org/obo/STATO_0000469	model interaction effect term	http://purl.obolibrary.org/obo/STATO_0000466	model term		a model interaction effect term is a model term which accounts for variation explained by the combined effects of the factor levels of more than one (usually 2) independent variables.
http://purl.obolibrary.org/obo/STATO_0000470	model error term	http://purl.obolibrary.org/obo/STATO_0000466	model term		a model error term is a model term which accounts for  residual variation not explained by the other components (fixed and random effect terms)
http://purl.obolibrary.org/obo/STATO_0000471	estimate	http://purl.obolibrary.org/obo/STATO_0000039	statistic		a estimate is a data item which is computed from a dataset to provide an approximated value (an estimator) for a 'statistical parameter'  (a 'characteristics/parameter' of the true underlying distribution) of a real population.
http://purl.obolibrary.org/obo/STATO_0000473	Kenward-Roger degree of freedom approximation	http://purl.obolibrary.org/obo/STATO_0000472	degree of freedom approximation		The Kenward-Roger method's fundamental idea is to calculate the approximate mean and variance of their statistic and then match moments with an F distribution to obtain the denominator degrees of freedom.
http://purl.obolibrary.org/obo/STATO_0000484	Bayes B	http://purl.obolibrary.org/obo/STATO_0000337	breeding value estimation		Bayes B is a data transformation used in the context of estimating breeding value, which relies on a Bayesian model, treats the prior probability π that a SNP has zero effect to a set value (i.e π >0) and uses a mixture distribution.
http://purl.obolibrary.org/obo/STATO_0000487	additive genetic inheritance model	http://purl.obolibrary.org/obo/STATO_0000534	genetic inheritance model		an additive genetic model is a data item which refers to the contributions to the final phenotype from more than one gene, or from alleles of a single gene (in heterozygotes), that combine in such a way that the sum of their effects in unison is equal to the sum of their effects individually.
http://purl.obolibrary.org/obo/STATO_0000488	additive dominant genetic inheritance model	http://purl.obolibrary.org/obo/STATO_0000534	genetic inheritance model		an additive genetic model is a data item which refer to the contributions to the final phenotype from more than one gene, or from alleles of a single gene (in heterozygotes), that combine in such a way that the sum of their effects in unison is equal to the sum of their individual effects and their dominance effect (of alleles at a single locus).
http://purl.obolibrary.org/obo/STATO_0000489	additive dominant genetic and epistatic inheritance model	http://purl.obolibrary.org/obo/STATO_0000534	genetic inheritance model		an additive genetic model is a data item which refer to the contributions to the final phenotype from more than one gene, or from alleles of a single gene (in heterozygotes), that combine in such a way that the sum of their effects in unison is equal to the sum of their individual effect, their additive dominant (effect (of alleles at a single locus) and their epistasic effect (of alleles at more different loci).
http://purl.obolibrary.org/obo/STATO_0000490	Dunn’s multiple comparison test	http://purl.obolibrary.org/obo/STATO_0000279	between group comparison statistical test		Dunn’s Multiple Comparison Test is a post hoc (i.e. it’s run after an ANOVA) non parametric test (a “distribution free” test that doesn’t assume your data comes from a particular distribution). It is one of the least powerful of the multiple comparisons tests and can be a very conservative test–especially for larger numbers of comparisons.  The Dunn is an alternative to the Tukey test when you only want to test for differences in a small subset of all possible pairs; For larger numbers of pairwise comparisons, use Tukey’s instead. Use Dunn’s when you choose to test a specific number of comparisons before you run the ANOVA and when you are not comparing to controls. If you are comparing to a control group, use the Dunnett test instead.
http://purl.obolibrary.org/obo/STATO_0000491	Conover-Iman test of multiple comparisons using rank sums	http://purl.obolibrary.org/obo/STATO_0000279	between group comparison statistical test		Conover-Iman test for stochastic dominance is a stastical test for multiple group comparisons and reports the results among multiple pairwise comparisons after a Kruskal-Wallis test for stochastic dominance among k groups (Kruskal and Wallis, 1952). The interpretation of stochastic dominance requires an assumption that the CDF of one group does not cross the CDF of the other. 
The null hypothesis for each pairwise comparison is that the probability of observing a randomly selected value from the first group that is larger than a randomly selected value from the second group equals one half; this null hypothesis corresponds to that of the Wilcoxon-Mann-Whitney rank-sum test.
Like the rank-sum test, if the data can be assumed to be continuous, and the distributions are assumed identical except for a difference in location, Conover-Iman test may be understood as a test for median difference. conover.test accounts for tied ranks.
The Conover-Iman test is strictly valid if and only if the corresponding Kruskal-Wallis null hypothesis is rejected.
http://purl.obolibrary.org/obo/STATO_0000493	genotype matrix	http://purl.obolibrary.org/obo/STATO_0000642	matrix		a genotype matrix is a kind of genomic relationship matrix in the rawest of form and which simply corresponds to a matrix of individuals genotype for a given set of markers or genomic positions. Columns are snps or markers, Rows are individuals. Each column/row cell contains a genotype expressed as, in the genome is diploid, as a pair of characters chosen from ATGC where the dominant variant is uppercased and the recessive variant is lower cased.
http://purl.obolibrary.org/obo/STATO_0000494	MAF matrix	http://purl.obolibrary.org/obo/STATO_0000642	matrix		the MAF matrix is a genomic relationship matrix which is obtained from the genotype matrix by counting the number of minor alleles at each locus
http://purl.obolibrary.org/obo/STATO_0000495	M matrix	http://purl.obolibrary.org/obo/STATO_0000642	matrix		the M matrix is a genomic relationship matrix which is obtained by subtracting 1 to every value of the MAF matrix (gene content matrix). The values of the M matrix are only -1, 0 or 1 and makes computation easier.
M = MAF-1
http://purl.obolibrary.org/obo/STATO_0000496	P matrix	http://purl.obolibrary.org/obo/STATO_0000642	matrix		P matrix is a kind of genomic relationship matrix which contains allele frequencies expressed as a difference from 0.5 and multiplied by 2.
http://purl.obolibrary.org/obo/STATO_0000497	Z matrix	http://purl.obolibrary.org/obo/STATO_0000642	matrix		the Z-matrix is a genomic relationship matrix which is obtained by substracted the M matrix with the P matrix. It is also known as the incidence matrix for the markers.
http://purl.obolibrary.org/obo/STATO_0000498	numerator degrees of freedom	http://purl.obolibrary.org/obo/STATO_0000069	number of degrees of freedom		The degree of freedom numerator is the number of degrees of freedom that the estimate of variance used in the numerator is based on. It is one of the parameters for the F-distribution used to compute probabilities in analysis of variance.
http://purl.obolibrary.org/obo/STATO_0000499	augmented experimental design	http://purl.obolibrary.org/obo/OBI_0500000	study design		augmented design is a kind of experimental design where the goal is to compare existing (control) treatments with new treatments that have an experimental constraint of "limited replication".  To understand limited replication, consider about experiments that may only allow a single representation of the new treatment, this limitation may be many times due to the cost associated  with the experiment, limited resources, or limited number of new units that can be used in the experiment. In contrast, the existing treatments are referred as checks and are generally replicated multiple times.  With augmented design one can estimate the following:

a) Differences between checks and new treatments,
b) Differences among new treatments,
c) Differences among check treatments, and
d) Differences among new and check treatments combined.
http://purl.obolibrary.org/obo/STATO_0000500	probability distribution location parameter	http://purl.obolibrary.org/obo/IAO_0000027	data item		a probability distribution location parameter is a data item which is set by the operator when selecting a parametric probability distribution and which dictates the way the location but not the profile or size of the distribution plot looks like.
http://purl.obolibrary.org/obo/STATO_0000503	simple random sampling	http://purl.obolibrary.org/obo/STATO_0000502	statistical sampling		Simple random sampling is a statistical sampling process which creates a sample of a size n entirely by chance. In such process, each unit has the same probability of being selected. Depending on the size of the population being sampled, the sampling process may be done with or without replacement
http://purl.obolibrary.org/obo/STATO_0000505	quadrat sampling	http://purl.obolibrary.org/obo/STATO_0000509	systematic sampling		Quadrat sampling is a classic tool for the study of ecology, especially biodiversity. In general, a series of squares (quadrats) of a set size are placed in a habitat of interest and the species within those quadrats are identified and recorded. Passive quadrat sampling (done without removing the organisms found within the quadrat) can be either done by hand, with researchers carefully sorting through each individual quadrat or, more efficiently, can be done by taking a photograph of the quadrat for future analysis.
http://purl.obolibrary.org/obo/STATO_0000506	cluster sampling	http://purl.obolibrary.org/obo/STATO_0000509	systematic sampling		Cluster sampling is a sampling plan used when mutually homogeneous yet internally heterogeneous groupings are evident in a statistical population.
http://purl.obolibrary.org/obo/STATO_0000507	probability-proportional-to-size sampling	http://purl.obolibrary.org/obo/STATO_0000502	statistical sampling		Probability proportional to size ('PPS') sampling is a sampling method in which the selection probability for each element is set to be proportional to its size measure, up to a maximum of 1. In a simple PPS design, these selection probabilities can then be used as the basis for Poisson sampling. However, this has the drawback of variable sample size, and different portions of the population may still be over- or under-represented due to chance variation in selections.
http://purl.obolibrary.org/obo/STATO_0000508	stratified sampling	http://purl.obolibrary.org/obo/STATO_0000502	statistical sampling		stratified sampling is a statistical sampling method which divides the population into homogenous subpopulations, which are then sampled using random or systematic sampling methods
http://purl.obolibrary.org/obo/STATO_0000509	systematic sampling	http://purl.obolibrary.org/obo/STATO_0000502	statistical sampling		systematic sampling is a process for collecting samples and assembling a statistical sample using a system or method (.e.g unequal probabilities, without replacement, fixed sample size), as opposed to a random sampling.
http://purl.obolibrary.org/obo/STATO_0000510	quota sampling	http://purl.obolibrary.org/obo/STATO_0000502	statistical sampling		Quota sampling is a method for selecting survey participants that is a non-probabilistic version of stratified sampling. 

In quota sampling, a population is first segmented into mutually exclusive sub-groups, just as in stratified sampling. Then judgment is used to select the subjects or units from each segment based on a specified proportion. For example, an interviewer may be told to sample 200 females and 300 males between the age of 45 and 60. This means that individuals can put a demand on who they want to sample (targeting).
http://purl.obolibrary.org/obo/STATO_0000511	panel sampling	http://purl.obolibrary.org/obo/STATO_0000502	statistical sampling		Panel sampling is the method of first selecting a group of participants through a random sampling method and then asking that group for (potentially the same) information several times over a period of time. Therefore, each participant is interviewed at two or more time points; each period of data collection is called a "wave". The method was developed by sociologist Paul Lazarsfeld in 1938 as a means of studying political campaigns.
http://purl.obolibrary.org/obo/STATO_0000512	chain-referral sampling	http://purl.obolibrary.org/obo/STATO_0000502	statistical sampling		Snowball sampling (or chain sampling, chain-referral sampling, referral sampling) is a non-probability sampling technique where existing study subjects recruit future subjects from among their acquaintances. Thus the sample group is said to grow like a rolling snowball.
http://purl.obolibrary.org/obo/STATO_0000513	voluntary sampling	http://purl.obolibrary.org/obo/STATO_0000502	statistical sampling		The voluntary sampling method is a type of non-probability sampling. A voluntary sample is made up of people who self-select into the survey. Often, these subjects have a strong interest in the main topic of the survey. Volunteers may be invited through advertisements on Social Media Sites
http://purl.obolibrary.org/obo/STATO_0000514	convenience sampling	http://purl.obolibrary.org/obo/STATO_0000502	statistical sampling		Convenience sampling (also known as grab sampling, accidental sampling, or opportunity sampling) is a type of non-probability sampling that involves the sample being drawn from that part of the population that is close to hand. This type of sampling is most useful for pilot testing.
http://purl.obolibrary.org/obo/STATO_0000515	Brewer sampling	http://purl.obolibrary.org/obo/STATO_0000509	systematic sampling		Brewer's sampling is a statistical sampling method which was proposed by Brewer in 1975 and uses unequal probabibility sampling technique
http://purl.obolibrary.org/obo/STATO_0000516	minimax sampling	http://purl.obolibrary.org/obo/STATO_0000502	statistical sampling		In imbalanced datasets, where the sampling ratio does not follow the population statistics, one can resample the dataset in a conservative manner called minimax sampling. The minimax sampling has its origin in Anderson minimax ratio whose value is proved to be 0.5: in a binary classification, the class-sample sizes should be chosen equally. This ratio can be proved to be minimax ratio only under the assumption of LDA classifier with Gaussian distributions. The notion of minimax sampling is recently developed for a general class of classification rules, called class-wise smart classifiers.
http://purl.obolibrary.org/obo/STATO_0000517	complete randomization	http://purl.obolibrary.org/obo/OBI_0302900	group randomization		complete randomization is a group randomization where experimental units are randomly assigned to the entire set of groups defined by the experimental treatments.
http://purl.obolibrary.org/obo/STATO_0000518	data imputation	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		Data imputation is a data transformation process whereby missing data is replaced with an estimated value for the missing element. The substituted values are intended to create a data record that does not fail edits. Various methods may be used to produce these substituted values.
http://purl.obolibrary.org/obo/STATO_0000519	last observation carried forward data imputation	http://purl.obolibrary.org/obo/STATO_0000518	data imputation		last observation carried forward data imputation is a type of data imputation which uses a very simple, self explanatory method for substituted a missing value for an observation. It should be noted that this method gives a biased estimate of the treatment effect and underestimates the variability of the estimated result and should be used cautiously.
http://purl.obolibrary.org/obo/STATO_0000520	regression data imputation	http://purl.obolibrary.org/obo/STATO_0000518	data imputation		regression data imputation is a type of data imputation where missing values are replaced with the value of a regression function coefficient.
http://purl.obolibrary.org/obo/STATO_0000521	substitution by the mean data imputation	http://purl.obolibrary.org/obo/STATO_0000518	data imputation		substitution by the mean data imputation is a type of data imputation where missing values are replaced with the value the variable mean.
http://purl.obolibrary.org/obo/STATO_0000522	multivariate imputation with chained equations	http://purl.obolibrary.org/obo/STATO_0000518	data imputation		multivariate imputation with chained equations (MICE) is a type of data imputation which uses an algorithm devised by Stef van Buuren and Karin Groothuis-Oudshoorn
http://purl.obolibrary.org/obo/STATO_0000523	k-nearest neighbour data imputation	http://purl.obolibrary.org/obo/STATO_0000518	data imputation		k-nearest neighbour imputation is a data imputation which uses the k-nearest neighbour algorithm to compute a substitution value for the missing values. For every observation to be imputed, it identifies ‘k’ closest observations based on the euclidean distance and computes the weighted average (weighted based on distance) of these ‘k’ obs.
http://purl.obolibrary.org/obo/STATO_0000525	covariance matrix	http://purl.obolibrary.org/obo/STATO_0000642	matrix		a covariance matrix is a square matrix that contains the variances and covariances associated with several variables. The diagonal elements of the matrix contain the variances of the variables and the off-diagonal elements contain the covariances between all possible pairs of variables.
http://purl.obolibrary.org/obo/STATO_0000527	denominator degrees of freedom	http://purl.obolibrary.org/obo/STATO_0000069	number of degrees of freedom		The degree of freedom denominator is the number of degrees of freedom that the estimate of variance used in the denominator is based on. It is one of the parameters for the F-distribution used to compute probabilities in analysis of variance.
http://purl.obolibrary.org/obo/STATO_0000528	G matrix	http://purl.obolibrary.org/obo/STATO_0000642	matrix		A matrix of relationships among a group of individuals, which can be used to predict breeding values, to manage inbreeding and in genetic conservation. It can be calculated from the pedigree, but it is also possible to calculate the relationship matrix from genotypes at genetic markers such as single-nucleotide polymorphisms (SNPs). Elements of the genomic relationship matrix are estimates of the realized proportion of the genome that two individuals share, whereas the pedigree-derived relationship matrix is the expectation of this proportion.
http://purl.obolibrary.org/obo/STATO_0000529	scaled t distribution	http://purl.obolibrary.org/obo/STATO_0000059	Student's t distribution		a scaled t distribution is a kind of Student's t distribution which is shifted by 'mean' and scaled by standard deviation 'sd'.
http://purl.obolibrary.org/obo/STATO_0000530	Bayesian model	http://purl.obolibrary.org/obo/STATO_0000107	statistical model		a Bayesian model is a statistical model  where inference is based on using Bayes theorem to obtain a posterior distribution for a quantity (or quantities) of interest for some model (such as parameter values) based on some prior distribution for the relevant unknown parameters and the likelihood from the model.
http://purl.obolibrary.org/obo/STATO_0000531	prior probability distribution	http://purl.obolibrary.org/obo/STATO_0000225	probability distribution		a prior probability distribution is a probability distribution used as input to a Bayesian model to represent a priori knowledge about a model parameter. Along with the acquired/observed data, it is used to compute a posterior distribution according to the Bayes theorem.
http://purl.obolibrary.org/obo/STATO_0000532	posterior probability distribution	http://purl.obolibrary.org/obo/STATO_0000225	probability distribution		a posterior probability distribution is a probability distribution computed in a Bayesian model approach given a prior distribution and a set of events/observations.
http://purl.obolibrary.org/obo/STATO_0000533	Bayes C pi	http://purl.obolibrary.org/obo/STATO_0000337	breeding value estimation		Bayes C pi is a data transformation used to compute estimated breeding values using a Bayesian model and which assesses the SNP effect using MonteCarlo Markov chain methods. Bayes C pi treats the prior probability π that a SNP has zero effect as unknown.
The method was devised to address short comings of Bayes A and Bayes B approaches
http://purl.obolibrary.org/obo/STATO_0000534	genetic inheritance model	http://purl.obolibrary.org/obo/STATO_0000107	statistical model		genetic inheritance model is a data item defining the assumption used by a breeding value estimation method to consider when running the calculations.
http://purl.obolibrary.org/obo/STATO_0000535	sampling from a probability distribution	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		sampling from a probability distribution is a data transformation which aims at obtaining a sequence of random samples from a probability distribution for which direct sampling is difficult.
http://purl.obolibrary.org/obo/STATO_0000536	Gibbs sampling	http://purl.obolibrary.org/obo/STATO_0000535	sampling from a probability distribution		Gibbs sampling or a Gibbs sampler is a Markov chain Monte Carlo (MCMC) algorithm for obtaining a sequence of observations which are approximated from a specified multivariate probability distribution, when direct sampling is difficult.
http://purl.obolibrary.org/obo/STATO_0000537	Metropolis–Hastings sampling	http://purl.obolibrary.org/obo/STATO_0000535	sampling from a probability distribution		the Metropolis–Hastings algorithm is a Markov chain Monte Carlo (MCMC) method for obtaining a sequence of random samples from a probability distribution for which direct sampling is difficult.
http://purl.obolibrary.org/obo/STATO_0000538	continuous multivariate probability distribution	http://purl.obolibrary.org/obo/STATO_0000067	continuous probability distribution		a continuous multivariate probability distribution is a continuous probability distribution which describes the possible values, and corresponding probabilities, of two or more (usually three or more) associated random variables.
http://purl.obolibrary.org/obo/STATO_0000539	discrete multivariate probability distribution	http://purl.obolibrary.org/obo/STATO_0000117	discrete probability distribution		a discrete multivariate probability distribution is a discrete probability distribution which describes the possible values, and corresponding probabilities, of two or more (usually three or more) associated random variables.
http://purl.obolibrary.org/obo/STATO_0000540	reproducing kernel Hilbert space procedure	http://purl.obolibrary.org/obo/STATO_0000337	breeding value estimation		A data transformation that produces a reproducing kernel Hilbert space (or RKHS), which is a Hilbert space of functions in which point evaluation is a continuous linear functional.
http://purl.obolibrary.org/obo/STATO_0000541	state space model	http://purl.obolibrary.org/obo/STATO_0000107	statistical model		a state space model is a kind of statistical model which describes the probabilistic dependence between the latent state variable and the observed measurement. The state or the measurement can be either continuous or discrete. The term “state space” originated in 1960s in the area of control engineering (Kalman, 1960). SSM provides a general framework for analyzing deterministic and stochastic dynamical systems that are measured or observed through a stochastic process.
http://purl.obolibrary.org/obo/STATO_0000542	genomic estimated breeding value	http://purl.obolibrary.org/obo/STATO_0000485	estimated breeding value		genomic estimated breeding value (GEBV) is an estimated breeding value derived from information in an organism DNA (genotype). GEBV is calculated differently to conventional Estimated Breeding Values using advanced modeling technique to deal with high dimensionality data.
http://purl.obolibrary.org/obo/STATO_0000544	heritability	http://purl.obolibrary.org/obo/STATO_0000184	ratio		In a planned experiment where to covariance (genotype  x environment) can be controlled and held at 0, the heritability is defined as the ratio of the variance of the genotypic variables to the variance of the phenotypic variables.

H2 = Var(G)/Var(P)

H2 is the broad-sense heritability. This reflects all the genetic contributions to a population's phenotypic variance including additive, dominant, and epistatic (multi-genic interactions), as well as maternal and paternal effects, where individuals are directly affected by their parents' phenotype, for example, milk production in mammals.
http://purl.obolibrary.org/obo/STATO_0000545	narrow sense heritability	http://purl.obolibrary.org/obo/STATO_0000184	ratio		A particularly important component of the genetic variance is the additive variance, Var(A), which is the variance due to the average effects (additive effects) of the alleles. Since each parent passes a single allele per locus to each offspring, parent-offspring resemblance depends upon the average effect of single alleles. Additive variance represents, therefore, the genetic component of variance responsible for parent-offspring resemblance. The additive genetic portion of the phenotypic variance is known as Narrow-sense heritability and is defined as:

 h2 = Var(A)/Var(P)
http://purl.obolibrary.org/obo/STATO_0000546	Bayes R	http://purl.obolibrary.org/obo/STATO_0000337	breeding value estimation		Bayes R is a data transformation used in the context of estimating breeding value, which relies on a Bayesian model to compute 'genomic estimated breeding values'. In contrast to Bayes B methods, the new method assumes that the true SNP effects are derived from a series of normal distributions, the first with zero variance, up to one with a variance of approximately 1% of the genetic variance.
http://purl.obolibrary.org/obo/STATO_0000547	Laplace probability distribution	http://purl.obolibrary.org/obo/STATO_0000329	symmetric distribution		The double exponential distribution (a.k.a. Laplace distribution) is the distribution of differences between two independent variates with identical exponential distributions (Abramowitz and Stegun 1972, p. 930).
http://purl.obolibrary.org/obo/STATO_0000548	sampling distribution estimation by bootstrapping	http://purl.obolibrary.org/obo/STATO_0000535	sampling from a probability distribution		Bootstrapping is the practice of estimating properties of an estimator (such as its variance) by measuring those properties when sampling from an approximating distribution.
http://purl.obolibrary.org/obo/STATO_0000549	random forest procedure	http://purl.obolibrary.org/obo/OBI_0200175	class discovery data transformation		random forest procedure is a type of data transformation used in classification and statistical learning using regression. The random forest procedure is a meta estimator that fits a number of classifying decision trees on various sub-samples of the dataset (it operates by constructing a multitude of decision trees at training time) and use averaging to improve the predictive accuracy and control over-fitting. The sub-sample size is always the same as the original input sample size but the samples are drawn with replacement if bootstrap=True (default). The random forest procedure outputs the class that is the mode of the classes (classification) or mean prediction (regression) of the individual trees.
http://purl.obolibrary.org/obo/STATO_0000550	log likelihood	http://purl.obolibrary.org/obo/IAO_0000027	data item		log likelihood is a data item which corresponds to the natural logarithm of the likelihood.
log likelihood is a data item commonly used to provide a measure of accuracy of a model.
http://purl.obolibrary.org/obo/STATO_0000551	Holm false discovery rate correction	http://purl.obolibrary.org/obo/OBI_0200163	false discovery rate correction method		A data transformation process in which the Holm p-value procedure is applied with the aim of correcting false discovery rate
http://purl.obolibrary.org/obo/STATO_0000552	Hommel false discovery rate correction	http://purl.obolibrary.org/obo/OBI_0200163	false discovery rate correction method		A data transformation process in which the Hommel p-value procedure is applied with the aim of correcting false discovery rate
http://purl.obolibrary.org/obo/STATO_0000558	statistical model term testing	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		A statistical model term testing is a data transformation that accounts for the evaluation of a component of a statistical model or model term.
http://purl.obolibrary.org/obo/STATO_0000559	Wald test	http://purl.obolibrary.org/obo/OBI_0200200	chi square test		the Wald test is statistical test which computes a Wald chi-squared test for 1 or more coefficients, given their variance-covariance matrix.
The Wald test (also called the Wald Chi-Squared Test) is a way to find out if explanatory variables in a model are significant. “Significant” means that they add something to the model; variables that add nothing can be deleted without affecting the model in any meaningful way
http://purl.obolibrary.org/obo/STATO_0000561	confidence level	http://purl.obolibrary.org/obo/STATO_0000300	dimensionless ratio		the frequency (i.e., the proportion) of possible confidence intervals that contain the true value of their corresponding parameter. In other words, if confidence intervals are constructed using a given confidence level in an infinite number of independent experiments, the proportion of those intervals that contain the true value of the parameter will match the confidence level.
A probability measure of the reliability of an inferential statistical test that has been applied to sample data and which is provided along with the confidence interval for the output statistic.
http://purl.obolibrary.org/obo/STATO_0000562	standard error of estimate	http://purl.obolibrary.org/obo/STATO_0000471	estimate		The standard error is a measure of dispersion applied to estimates across hypothetical repeated random samples.

It is a measure of how precise an estimate of a statistical parameter is.

The standard error is the estimated standard deviation of an estimate.

It measures the uncertainty associated with the estimate. Compared with the standard deviations of the underlying distribution, which are usually unknown, standard errors can be calculated from observed data.
http://purl.obolibrary.org/obo/STATO_0000563	biplot	http://purl.obolibrary.org/obo/IAO_0000038	graph		Biplots are a type of exploratory graph used in statistics, a generalization of the simple two-variable scatterplot. A biplot is constructed by using the singular value decomposition (SVD) to obtain a low-rank approximation to a transformed version of the data matrix X, whose n rows are the samples (also called the cases, or objects), and whose p columns are the variables. The biplot was introduced by K. Ruben Gabriel (1971).
http://purl.obolibrary.org/obo/STATO_0000566	eigen value	http://purl.obolibrary.org/obo/OBI_0000648	clustered data set		An eigenvalue is a data item resulting from a data transformation known as eigen value decomposition. It also corresponds to a process of matrix diagonalization or any equivalent operation, ie. transforming the underlying system of equations into a special set of coordinate axes in which the matrix takes this canonical form. Each eigenvalue is paired with a corresponding so-called eigenvector.
http://purl.obolibrary.org/obo/STATO_0000567	factor analysis	http://purl.obolibrary.org/obo/OBI_0200050	dimensionality reduction		Factor analysis is a dimension reduction data transformation that is used to describe variability among observed, correlated variables in terms of a potentially lower number of unobserved variables called factors. Factor analysis is related to principal component analysis (PCA), but the two are not identical. Both PCA and factor analysis aim to reduce the dimensionality of a set of data, but the approaches taken to do so are different for the two techniques. Factor analysis is clearly designed with the objective to identify certain unobservable factors from the observed variables, whereas PCA does not directly address this objective; at best, PCA provides an approximation to the required factors.
http://purl.obolibrary.org/obo/STATO_0000568	loadings	http://purl.obolibrary.org/obo/STATO_0000034	model parameter		In factor analysis, factor loadings express the relationship of each variable to the underlying factor.
http://purl.obolibrary.org/obo/STATO_0000569	score	http://purl.obolibrary.org/obo/STATO_0000039	statistic		The score indicates how sensitive a likelihood function L(\Theta,X) is to its parameter \Theta. Explicitly, the score for \Theta is the gradient of the log-likelihood with respect to \Theta.
http://purl.obolibrary.org/obo/STATO_0000570	selectivity ratio	http://purl.obolibrary.org/obo/STATO_0000184	ratio		The selectivity ratio (SR) is defined as the ratio of explained vexpl,i to residual variance vres,i for
the variable i on the  target projection (TP) component in the context of Partial Least Squares Analysis.
http://purl.obolibrary.org/obo/STATO_0000573	average value	http://purl.obolibrary.org/obo/STATO_0000029	measure of central tendency		The arithmetic mean is defined as the sum of the numerical values of each and every observation divided by the total number of observations. S The arithmetic mean A is defined by the formula

A=sum[Ai] / n where i ranges from 1 to n and Ai represents the value of individual observations.

The arithmetic mean is significantly affected by extreme values and outliers. A better measure of central tendency is the median (http://purl.obolibrary.org/obo/STATO_0000574).
http://purl.obolibrary.org/obo/STATO_0000576	SIMPLS	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		A novel algorithm for partial least squares (PLS) regression, SIMPLS, is proposed which calculates the PLS factors directly as linear combinations of the original variables. The PLS factors are determined such as to maximize a covariance criterion, while obeying certain orthogonality and normalization restrictions. This approach follows that of other traditional multivariate methods. The construction of deflated data matrices as in the nonlinear iterative partial least squares (NIPALS)-PLS algorithm is avoided. For univariate y SIMPLS is equivalent to PLS1 and closely related to existing bidiagonalization algorithms. This follows from an analysis of PLS1 regression in terms of Krylov sequences. For multivariate Y there is a slight difference between the SIMPLS approach and NIPALS-PLS2. In practice the SIMPLS algorithm appears to be fast and easy to interpret as it does not involve a breakdown of the data sets.

The acronym SIMPLS comes from 'straightforward implementation of a statistically inspired modification of the PLS method'
http://purl.obolibrary.org/obo/STATO_0000577	PLS1	http://purl.obolibrary.org/obo/STATO_0000571	Partial Least Square regression		a partial least square regression applied when there is only one variable in Y (the matrix of response variables), or it is desirable to model and optimize separately the performance of each of the variables in Y. This case is usually referred to as PLS1 regression (J = 1).
http://purl.obolibrary.org/obo/STATO_0000578	PLS2	http://purl.obolibrary.org/obo/STATO_0000571	Partial Least Square regression		a partial least square regression applied to a multivariate response variable.
http://purl.obolibrary.org/obo/STATO_0000579	improved Kernel PLS	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		improved kernel PLS is a data transformation which implement a very fast kernel algorithm for updating PLS models in a recursive manner and for exponentially discounting past data.
http://purl.obolibrary.org/obo/STATO_0000581	singular value decomposition	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		a data transformation which compute the singular-value decomposition of a rectangular matrix.
The singular-value decomposition is very general in the sense that it can be applied to any m × n matrix whereas eigenvalue decomposition can only be applied to certain classes of square matrices.
http://purl.obolibrary.org/obo/STATO_0000582	best linear unbiased estimator	http://purl.obolibrary.org/obo/STATO_0000337	breeding value estimation		best linear unbiased estimator
http://purl.obolibrary.org/obo/STATO_0000584	Wald statistic	http://purl.obolibrary.org/obo/STATO_0000039	statistic		the Wald statistic is a statistic is used during a Wald test, a test of significance of the regression coefficient; it is based on the asymptotic normality property of maximum likelihood estimates, and is computed as:

W = b * 1/Var(b) * b

In this formula, b stands for the parameter estimates, and Var(b) stands for the asymptotic variance of the parameter estimates. The Wald statistic is tested against the Chi-square distribution in the Wald test.
http://purl.obolibrary.org/obo/STATO_0000585	degree of freedom calculation	http://purl.obolibrary.org/obo/OBI_0200000	data transformation		degree of freedom calculation is a data transformation which is part of a stastical test and which aims to determine or estimate the number of degrees of freedom in a system.
http://purl.obolibrary.org/obo/STATO_0000586	restricted randomized design	http://purl.obolibrary.org/obo/STATO_0000046	block design		a restricted randomized design is a kind of study design which uses randomization to allocate observation unit to treatment but where intuitively poor allocations of treatments to experimental units are avoided, while retaining the theoretical benefits of randomization. This is often the case when so-called 'hard to change' factors are used in an experimental design.
http://purl.obolibrary.org/obo/STATO_0000587	percentage of variance	http://purl.obolibrary.org/obo/STATO_0000639	percentage		the percentage of variance is an output of principal component analysis (PCRA), which is obtained by forming the ratio of an eigen-value divided by the sum of all eigen-values. This produces a "percentage of variance" for each eigen-vector.
http://purl.obolibrary.org/obo/STATO_0000668	absolute value	http://purl.obolibrary.org/obo/STATO_0000039	statistic		A statistic that represents the distance of a value from zero.
http://purl.obolibrary.org/obo/STATO_0000672	event rate	http://purl.obolibrary.org/obo/STATO_0000645	rate		The number of occurrences per unit of time.
http://purl.obolibrary.org/obo/STATO_0000282	F-statistic	http://purl.obolibrary.org/obo/STATO_0000039	statistic		F statistic is a statistic computed from observations and used to produce a p-value in statistical test when compared to a F distribution. the F statistic is the ratio of two scaled sums of squares reflecting different sources of variability
http://purl.obolibrary.org/obo/STATO_0000666	maximum possible value	http://purl.obolibrary.org/obo/STATO_0000039	statistic		A statistic that represents the largest value that could occur.
http://purl.obolibrary.org/obo/STATO_0000667	minimum possible value	http://purl.obolibrary.org/obo/STATO_0000039	statistic		A statistic that represents the smallest value that could occur.
http://purl.obolibrary.org/obo/STATO_0000673	event frequency rate	http://purl.obolibrary.org/obo/STATO_0000645	rate		A ratio in which the numerator represents an event frequency and the denominator represents an interval of time.
http://purl.obolibrary.org/obo/STATO_0000674	frequency	http://purl.obolibrary.org/obo/STATO_0000184	ratio		A ratio in which the numerator represents a count and the denominator represents a count (without involving an interval of time).
http://purl.obolibrary.org/obo/STATO_0000675	density	http://purl.obolibrary.org/obo/STATO_0000184	ratio		A ratio in which the numerator represents any quantity and the denominator represents an interval of space (distance, area, or volume).
http://purl.obolibrary.org/obo/STATO_0000677	hazard ratio	http://purl.obolibrary.org/obo/STATO_0000184	ratio		A measure of association that is the ratio of the hazard rate of an event in one group to the hazard rate of the same event in another group.
http://purl.obolibrary.org/obo/STATO_0000678	prevalence ratio	http://purl.obolibrary.org/obo/STATO_0000622	ratio-based measure of association		A measure of association that is the ratio of two prevalences.
http://purl.obolibrary.org/obo/STATO_0000684	standard deviation for sample	http://purl.obolibrary.org/obo/STATO_0000237	standard deviation		A standard deviation that is the square root of the quotient of the summation across data points of the square of the distance from each data point to the sample mean, and the degrees of freedom (where the degrees of freedom is sample size minus one).
http://purl.obolibrary.org/obo/STATO_0000642	matrix	http://purl.obolibrary.org/obo/IAO_0000030	information content entity		a matrix is a rectangular array of numbers, which are called entries of the matrix.
http://purl.obolibrary.org/obo/NCBITaxon_9606	Homo sapiens	http://purl.obolibrary.org/obo/NCBITaxon_314146	Euarchontoglires	man	
http://purl.obolibrary.org/obo/NCBITaxon_2	Bacteria	http://purl.obolibrary.org/obo/OBI_0100026	organism	eubacteria	
http://purl.obolibrary.org/obo/NCBITaxon_117571	Euteleostomi	http://purl.obolibrary.org/obo/NCBITaxon_7742	Vertebrata <Metazoa>	bony vertebrates	
http://purl.obolibrary.org/obo/NCBITaxon_2759	Eukaryota	http://purl.obolibrary.org/obo/OBI_0100026	organism	eukaryotes	
http://purl.obolibrary.org/obo/NCBITaxon_32523	Tetrapoda	http://purl.obolibrary.org/obo/NCBITaxon_117571	Euteleostomi	tetrapods	
http://purl.obolibrary.org/obo/NCBITaxon_32524	Amniota	http://purl.obolibrary.org/obo/NCBITaxon_32523	Tetrapoda	amniotes	
http://purl.obolibrary.org/obo/NCBITaxon_40674	Mammalia	http://purl.obolibrary.org/obo/NCBITaxon_32524	Amniota	mammals	
http://purl.obolibrary.org/obo/NCBITaxon_7742	Vertebrata <Metazoa>	http://purl.obolibrary.org/obo/NCBITaxon_33213	Bilateria	vertebrates	
http://purl.obolibrary.org/obo/BFO_0000141	immaterial entity	http://purl.obolibrary.org/obo/BFO_0000004	independent continuant		
http://purl.obolibrary.org/obo/BFO_0000009	two-dimensional spatial region	http://purl.obolibrary.org/obo/BFO_0000006	spatial region		
http://purl.obolibrary.org/obo/BFO_0000018	zero-dimensional spatial region	http://purl.obolibrary.org/obo/BFO_0000006	spatial region		
http://purl.obolibrary.org/obo/BFO_0000026	one-dimensional spatial region	http://purl.obolibrary.org/obo/BFO_0000006	spatial region		
http://purl.obolibrary.org/obo/BFO_0000028	three-dimensional spatial region	http://purl.obolibrary.org/obo/BFO_0000006	spatial region		
http://purl.obolibrary.org/obo/BFO_0000034	function	http://purl.obolibrary.org/obo/BFO_0000016	disposition		
http://purl.obolibrary.org/obo/BFO_0000016	disposition	http://purl.obolibrary.org/obo/BFO_0000017	realizable entity		
http://purl.obolibrary.org/obo/BFO_0000019	quality	http://purl.obolibrary.org/obo/BFO_0000020	specifically dependent continuant		
http://purl.obolibrary.org/obo/BFO_0000006	spatial region	http://purl.obolibrary.org/obo/BFO_0000141	immaterial entity		
http://purl.obolibrary.org/obo/STATO_0000741	statistical model component	http://purl.obolibrary.org/obo/IAO_0000027	data item		
http://purl.obolibrary.org/obo/IAO_0000582	time stamped measurement datum	http://purl.obolibrary.org/obo/IAO_0000109	measurement datum		
http://purl.obolibrary.org/obo/NCBITaxon_10239	Viruses	http://purl.obolibrary.org/obo/OBI_0100026	organism		
http://purl.obolibrary.org/obo/NCBITaxon_2157	Archaea	http://purl.obolibrary.org/obo/OBI_0100026	organism		
http://purl.obolibrary.org/obo/NCBITaxon_33154	Opisthokonta	http://purl.obolibrary.org/obo/NCBITaxon_2759	Eukaryota		
http://purl.obolibrary.org/obo/NCBITaxon_33213	Bilateria	http://purl.obolibrary.org/obo/NCBITaxon_33154	Opisthokonta		
http://purl.obolibrary.org/obo/NCBITaxon_314146	Euarchontoglires	http://purl.obolibrary.org/obo/NCBITaxon_40674	Mammalia		
http://purl.obolibrary.org/obo/BFO_0000001	entity				
http://purl.obolibrary.org/obo/RO_0000079	function of	http://purl.obolibrary.org/obo/RO_0000052	inheres in	is function of	a relation between a function and an independent continuant (the bearer), in which the function specifically depends on the bearer for its existence
http://purl.obolibrary.org/obo/RO_0000080	quality of	http://purl.obolibrary.org/obo/RO_0000052	inheres in	quality_of	a relation between a quality and an independent continuant (the bearer), in which the quality specifically depends on the bearer for its existence
http://purl.obolibrary.org/obo/RO_0000085	has function	http://purl.obolibrary.org/obo/RO_0000053	bearer of	has_function	a relation between an independent continuant (the bearer) and a function, in which the function specifically depends on the bearer for its existence
http://purl.obolibrary.org/obo/RO_0000086	has quality	http://purl.obolibrary.org/obo/RO_0000053	bearer of	has_quality	a relation between an independent continuant (the bearer) and a quality, in which the quality specifically depends on the bearer for its existence
http://purl.obolibrary.org/obo/RO_0000087	has role	http://purl.obolibrary.org/obo/RO_0000053	bearer of	has_role	a relation between an independent continuant (the bearer) and a role, in which the role specifically depends on the bearer for its existence
http://purl.obolibrary.org/obo/RO_0000081	role of	http://purl.obolibrary.org/obo/RO_0000052	inheres in	role_of	a relation between a role and an independent continuant (the bearer), in which the role specifically depends on the bearer for its existence
http://purl.obolibrary.org/obo/IAO_0000219	denotes	http://purl.obolibrary.org/obo/IAO_0000136	is about		denotes is a primitive, instance-level, relation obtaining between an information content entity and some portion of reality. Denotation is what happens when someone creates an information content entity E in order to specifically refer to something. The only relation between E and the thing is that E can be used to 'pick out' the thing. This relation connects those two together. Freedictionary.com sense 3: To signify directly; refer to specifically
http://purl.obolibrary.org/obo/IAO_0000221	is quality measurement of	http://purl.obolibrary.org/obo/IAO_0000136	is about		m is a quality measurement of q at t. When q is a quality, there is a measurement process p that has specified output m, a measurement datum, that is about q
http://purl.obolibrary.org/obo/IAO_0000413	is duration of	http://purl.obolibrary.org/obo/IAO_0000136	is about		relates a process to a time-measurement-datum that represents the duration of the process
http://purl.obolibrary.org/obo/IAO_0000581	has time stamp	http://purl.obolibrary.org/obo/BFO_0000051	has part		relates a time stamped measurement datum to the time measurement datum that denotes the time when the measurement was taken
http://purl.obolibrary.org/obo/IAO_0000583	has measurement datum	http://purl.obolibrary.org/obo/BFO_0000051	has part		relates a time stamped measurement datum to the measurement datum that was measured
http://purl.obolibrary.org/obo/OBI_0000293	has_specified_input	http://purl.obolibrary.org/obo/RO_0000057	has participant		A relation between a planned process and a continuant participating in that process that is not created during  the process. The presence of the continuant during the process is explicitly specified in the plan specification which the process realizes the concretization of.
http://purl.obolibrary.org/obo/OBI_0000295	is_specified_input_of	http://purl.obolibrary.org/obo/RO_0000056	participates in		The inverse of the relation between a planned process and a continuant participating in that process that is not created during  the process. The presence of the continuant during the process is explicitly specified in the plan specification which the process realizes the concretization of.
http://purl.obolibrary.org/obo/OBI_0000299	has_specified_output	http://purl.obolibrary.org/obo/RO_0000057	has participant		A relation between a planned process and a continuant participating in that process. The presence of the continuant at the end of the process is explicitly specified in the objective specification which the process realizes the concretization of.
http://purl.obolibrary.org/obo/OBI_0000312	is_specified_output_of	http://purl.obolibrary.org/obo/RO_0000056	participates in		A relation between a continuant participating in a planned process. The presence of the continuant at the end of the process is explicitly specified in the objective specification which the process realizes the concretization of.
http://purl.obolibrary.org/obo/OBI_0000643	has grain	http://purl.obolibrary.org/obo/BFO_0000051	has part		the relation of the cells in the finger of the skin to the finger, in which an indeterminate number of grains are parts of the whole by virtue of being grains in a collective that is part of the whole, and in which removing one granular part does not nec- essarily damage or diminish the whole. Ontological Whether there is a fixed, or nearly fixed number of parts - e.g. fingers of the hand, chambers of the heart, or wheels of a car - such that there can be a notion of a single one being missing, or whether, by contrast, the number of parts is indeterminate - e.g., cells in the skin of the hand, red cells in blood, or rubber molecules in the tread of the tire of the wheel of the car.
http://purl.obolibrary.org/obo/OBI_0000846	is member of organization	http://purl.obolibrary.org/obo/STATO_0000114	is member of		Relating a legal person or organization to an organization in the case where the legal person or organization has a role as member of the organization.
http://purl.obolibrary.org/obo/OBI_0001688	has organization member	http://purl.obolibrary.org/obo/STATO_0000115	has member		Relating an organization to a legal person or organization.
http://purl.obolibrary.org/obo/OBI_0001927	specifies value of	http://purl.obolibrary.org/obo/IAO_0000136	is about		A relation between a value specification and an entity which the specification is about.
http://purl.obolibrary.org/obo/OBI_0001937	has specified numeric value	http://purl.obolibrary.org/obo/OBI_0002135	has specified value		A relation between a value specification and a number that quantifies it.
http://purl.obolibrary.org/obo/OBI_0001938	has value specification	http://purl.obolibrary.org/obo/BFO_0000051	has part		A relation between an information content entity and a value specification that specifies its value.
http://purl.obolibrary.org/obo/STATO_0000001	declares	http://purl.obolibrary.org/obo/IAO_0000136	is about		property to indicate that a design declares a variable; the inverse property is 'is declared by'
http://purl.obolibrary.org/obo/STATO_0000344	has effect on	http://purl.obolibrary.org/obo/IAO_0000136	is about		'has effect on' is a special case of the 'is about' relationship to be used for mixed effect models
http://purl.obolibrary.org/obo/STATO_0000345	has fixed effect on	http://purl.obolibrary.org/obo/STATO_0000344	has effect on		'has fixed effect on' is a special case of the 'is about' relationship to be used with fixed effect models
http://purl.obolibrary.org/obo/STATO_0000380	has interaction effect on	http://purl.obolibrary.org/obo/STATO_0000344	has effect on		'has interaction effect on' is a special case of the 'is about' relationship to be used for mixed effect models
http://purl.obolibrary.org/obo/STATO_0000381	has random effect on	http://purl.obolibrary.org/obo/STATO_0000344	has effect on		'has random effect on' is a special case of the 'is about' relationship to be used for random effect models
http://purl.obolibrary.org/obo/STATO_0000382	has order in sequence	http://purl.obolibrary.org/obo/IAO_0000136	is about		'has order in sequence' is a special case of the 'is about' relation being used to be able to enumerate the different terms within a linear mixed model formula (thus assinging and order to random effect terms, fixed effect terms, interaction effect terms and error terms).
http://purl.obolibrary.org/obo/STATO_0000476	is modeled by	http://purl.obolibrary.org/obo/IAO_0000136	is about		This property is a specialization of the 'is about' property to represent the relationship between a dependent variable and an statistical model.
http://purl.obolibrary.org/obo/IAO_0000039	has measurement unit label	http://purl.obolibrary.org/obo/BFO_0000051	has part		
http://purl.obolibrary.org/obo/IAO_0000407	has coordinate unit label	http://purl.obolibrary.org/obo/BFO_0000051	has part		
http://purl.obolibrary.org/obo/BFO_0000063	precedes	http://purl.obolibrary.org/obo/RO_0002222	temporal relation		
http://purl.obolibrary.org/obo/RO_0000052	inheres in			inheres_in	a relation between a specifically dependent continuant (the dependent) and an independent continuant (the bearer), in which the dependent specifically depends on the bearer for its existence
http://purl.obolibrary.org/obo/RO_0000053	bearer of			is bearer of	a relation between an independent continuant (the bearer) and a specifically dependent continuant (the dependent), in which the dependent specifically depends on the bearer for its existence
http://purl.obolibrary.org/obo/RO_0000056	participates in			participates_in	a relation between a continuant and a process, in which the continuant is somehow involved in the process
http://purl.obolibrary.org/obo/RO_0000057	has participant			has_participant	a relation between a process and a continuant, in which the continuant is somehow involved in the process
http://purl.obolibrary.org/obo/RO_0001000	derives from			derives_from	a relation between two distinct material entities, the new entity and the old entity, in which the new entity begins to exist when the old entity ceases to exist, and the new entity inherits the significant portion of the matter of the old entity
http://purl.obolibrary.org/obo/RO_0001015	location of			location_of	a relation between two independent continuants, the location and the target, in which the target is entirely within the location
http://purl.obolibrary.org/obo/RO_0001025	located in			located_in	a relation between two independent continuants, the target and the location, in which the target is entirely within the location
http://purl.obolibrary.org/obo/BFO_0000051	has part			has_part	a core relation that holds between a whole and its part
http://purl.obolibrary.org/obo/BFO_0000050	part of			part_of	a core relation that holds between a part and its whole
http://purl.obolibrary.org/obo/OBI_0002135	has specified value				A relation between a value specification and a literal.
http://purl.obolibrary.org/obo/IAO_0000136	is about				is_about is a (currently) primitive relation that relates an information artifact to an entity.
http://purl.obolibrary.org/obo/IAO_0000417	is quality measured as				inverse of the relation of is quality measurement of
http://purl.obolibrary.org/obo/OBI_0000124	is_supported_by_data				The relation between a data item and a conclusion where the conclusion is the output of a data interpreting process and the data item is used as an input to that process
http://purl.obolibrary.org/obo/OBI_0000314	is_proxy_for				A relation between continuant instances c1 and c2 where within an experiment/ protocol application,  measurement of c1 is used to determine what a measurement of c2 would be.
http://purl.obolibrary.org/obo/OBI_0000417	achieves_planned_objective				This relation obtains between a planned process and a objective specification when the criteria specified in the objective specification are met at the end of the planned process.
http://purl.obolibrary.org/obo/OBI_0000833	objective_achieved_by				This relation obtains between a a objective specification and a planned process when the criteria specified in the objective specification are met at the end of the planned process.
http://purl.obolibrary.org/obo/RO_0000058	is concretized as				A relationship between a generically dependent continuant and a specifically dependent continuant, in which the generically dependent continuant depends on some independent continuant in virtue of the fact that the specifically dependent continuant also depends on that same independent continuant. A generically dependent continuant may be concretized as multiple specifically dependent continuants.
http://purl.obolibrary.org/obo/RO_0000059	concretizes				A relationship between a specifically dependent continuant and a generically dependent continuant, in which the generically dependent continuant depends on some independent continuant in virtue of the fact that the specifically dependent continuant also depends on that same independent continuant. Multiple specifically dependent continuants can concretize the same generically dependent continuant.
http://purl.obolibrary.org/obo/STATO_0000101	has numerator				the relationship between a fraction and the number above the line
http://purl.obolibrary.org/obo/STATO_0000129	has value				A relationship (data property) between an entity and its value.
http://purl.obolibrary.org/obo/STATO_0000259	has denominator				the relationship between a fraction and the number below the line (or divisor)
http://purl.obolibrary.org/obo/STATO_0000004	is declared by				property to indicate the variables declared by a design; the inverse property is 'declares'
http://purl.obolibrary.org/obo/STATO_0000102	executes				relationship between a planned process and the plan specification that it carries out; it is defined as equivalent to the composed relationship (realizes o concretizes)
http://purl.obolibrary.org/obo/STATO_0000110	assumes values specified by				A relation between an entity and a value specification, where the value specification is about the entity.
http://purl.obolibrary.org/obo/STATO_0000114	is member of				relationship between an element and a set it belongs to
http://purl.obolibrary.org/obo/STATO_0000115	has member				relationship between a set and one of its elements
http://purl.obolibrary.org/obo/STATO_0000205	is denoted by				Inverse relation of 'denotes', where denotation is what happens when someone creates an information content entity E in order to specifically refer to something (from 'denotes' definition).
http://purl.obolibrary.org/obo/STATO_0000403	estimate of				Relationship between a parameter of a model and the estimate produced by estimation process as used in statistical modeling.
http://purl.obolibrary.org/obo/STATO_0000341	is model for				This property is the inverse relationship of 'is model by' and represents the relationship between a dependent variable and an statistical model.
http://purl.obolibrary.org/obo/STATO_0000557	computed from				computed_from is a relation between 2 information content entity denoting how one is derived from another on through the application of a data transformation or computation process.
http://purl.obolibrary.org/obo/IAO_0000235	denoted by				inverse of the relation 'denotes'
http://purl.obolibrary.org/obo/BFO_0000054	realized in			realized_in	
http://purl.obolibrary.org/obo/BFO_0000062	preceded by			preceded_by	
http://purl.obolibrary.org/obo/BFO_0000055	realizes				
http://purl.obolibrary.org/obo/IAO_0000004	has measurement value				
http://purl.obolibrary.org/obo/IAO_0000404	has x coordinate value				
http://purl.obolibrary.org/obo/IAO_0000406	has y coordinate value				
http://purl.obolibrary.org/obo/RO_0002222	temporal relation