TY - JOUR

T1 - Statistics for outcomes research

AU - Etzioni, David

AU - Liu, Jerome

AU - Ko, Clifford Y.

PY - 2003/3/1

Y1 - 2003/3/1

N2 - Statistics is the language of data and numbers. The appropriate use of statistical methods ensures accurate interpretation of research results. This article is intended to orient the reader to major concepts underlying the selection and application of univariate and multivariate statistical techniques. Data can be categorized as being qualitative or quantitative. Both types of data should first be analyzed with descriptive statistics in order to understand the underlying distribution of values. Qualitative data is analyzed with tests examining the frequency of observations (i.e. Chi-square test). Quantitative data should be further classified as being parametric (normally distributed) or non-parametric (non-normally distributed). This classification is important when selecting a test to compare two or more groups of observations. The t-test is usually used when data is normally distributed, whereas other tests (i.e. Mann-Whitney, Wilcoxon) are more appropriate with non-normal distributions. Multivariate statistics such as linear regression or logistic regression are able to separate the effects of multiple variables on a single outcome of interest. Linear regression is used for continuous outcomes such as height, weight, or age. Logistic regression deals with binary (yes/no) outcome variables such as whether or not a complication occurred. This brief review is meant to advance the reader's knowledge of basic statistical methods and warn of potenial pitfalls in data analyses.

AB - Statistics is the language of data and numbers. The appropriate use of statistical methods ensures accurate interpretation of research results. This article is intended to orient the reader to major concepts underlying the selection and application of univariate and multivariate statistical techniques. Data can be categorized as being qualitative or quantitative. Both types of data should first be analyzed with descriptive statistics in order to understand the underlying distribution of values. Qualitative data is analyzed with tests examining the frequency of observations (i.e. Chi-square test). Quantitative data should be further classified as being parametric (normally distributed) or non-parametric (non-normally distributed). This classification is important when selecting a test to compare two or more groups of observations. The t-test is usually used when data is normally distributed, whereas other tests (i.e. Mann-Whitney, Wilcoxon) are more appropriate with non-normal distributions. Multivariate statistics such as linear regression or logistic regression are able to separate the effects of multiple variables on a single outcome of interest. Linear regression is used for continuous outcomes such as height, weight, or age. Logistic regression deals with binary (yes/no) outcome variables such as whether or not a complication occurred. This brief review is meant to advance the reader's knowledge of basic statistical methods and warn of potenial pitfalls in data analyses.

UR - http://www.scopus.com/inward/record.url?scp=0037346052&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0037346052&partnerID=8YFLogxK

U2 - 10.1053/scrs.2003.127413

DO - 10.1053/scrs.2003.127413

M3 - Short survey

AN - SCOPUS:0037346052

SN - 1043-1489

VL - 14

SP - 3

EP - 11

JO - Seminars in Colon and Rectal Surgery

JF - Seminars in Colon and Rectal Surgery

IS - 1

ER -