TY - JOUR
T1 - Statistics for outcomes research
AU - Etzioni, David
AU - Liu, Jerome
AU - Ko, Clifford Y.
PY - 2003/3/1
Y1 - 2003/3/1
N2 - Statistics is the language of data and numbers. The appropriate use of statistical methods ensures accurate interpretation of research results. This article is intended to orient the reader to major concepts underlying the selection and application of univariate and multivariate statistical techniques. Data can be categorized as being qualitative or quantitative. Both types of data should first be analyzed with descriptive statistics in order to understand the underlying distribution of values. Qualitative data is analyzed with tests examining the frequency of observations (i.e. Chi-square test). Quantitative data should be further classified as being parametric (normally distributed) or non-parametric (non-normally distributed). This classification is important when selecting a test to compare two or more groups of observations. The t-test is usually used when data is normally distributed, whereas other tests (i.e. Mann-Whitney, Wilcoxon) are more appropriate with non-normal distributions. Multivariate statistics such as linear regression or logistic regression are able to separate the effects of multiple variables on a single outcome of interest. Linear regression is used for continuous outcomes such as height, weight, or age. Logistic regression deals with binary (yes/no) outcome variables such as whether or not a complication occurred. This brief review is meant to advance the reader's knowledge of basic statistical methods and warn of potenial pitfalls in data analyses.
AB - Statistics is the language of data and numbers. The appropriate use of statistical methods ensures accurate interpretation of research results. This article is intended to orient the reader to major concepts underlying the selection and application of univariate and multivariate statistical techniques. Data can be categorized as being qualitative or quantitative. Both types of data should first be analyzed with descriptive statistics in order to understand the underlying distribution of values. Qualitative data is analyzed with tests examining the frequency of observations (i.e. Chi-square test). Quantitative data should be further classified as being parametric (normally distributed) or non-parametric (non-normally distributed). This classification is important when selecting a test to compare two or more groups of observations. The t-test is usually used when data is normally distributed, whereas other tests (i.e. Mann-Whitney, Wilcoxon) are more appropriate with non-normal distributions. Multivariate statistics such as linear regression or logistic regression are able to separate the effects of multiple variables on a single outcome of interest. Linear regression is used for continuous outcomes such as height, weight, or age. Logistic regression deals with binary (yes/no) outcome variables such as whether or not a complication occurred. This brief review is meant to advance the reader's knowledge of basic statistical methods and warn of potenial pitfalls in data analyses.
UR - http://www.scopus.com/inward/record.url?scp=0037346052&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0037346052&partnerID=8YFLogxK
U2 - 10.1053/scrs.2003.127413
DO - 10.1053/scrs.2003.127413
M3 - Short survey
AN - SCOPUS:0037346052
SN - 1043-1489
VL - 14
SP - 3
EP - 11
JO - Seminars in Colon and Rectal Surgery
JF - Seminars in Colon and Rectal Surgery
IS - 1
ER -