Estimating the variance of estimated trends in proportions when there is no unique subject identifier

William K. Mountford; Stuart R. Lipsitz; Garrett M. Fitzmaurice; Rickey E. Carter; Jeremy B. Soule; John A. Colwell; Daniel T. Lackland

doi:10.1111/j.1467-985X.2006.00453.x

Estimating the variance of estimated trends in proportions when there is no unique subject identifier

William K. Mountford, Stuart R. Lipsitz, Garrett M. Fitzmaurice, Rickey E. Carter, Jeremy B. Soule, John A. Colwell, Daniel T. Lackland

Quantitative Health Sciences

Research output: Contribution to journal › Article › peer-review

2 Scopus citations

Abstract

Longitudinal population-based surveys are widely used in the health sciences to study patterns of change over time. In many of these data sets unique patient identifiers are not publicly available, making it impossible to link the repeated measures from the same individual directly. This poses a statistical challenge for making inferences about time trends because repeated measures from the same individual are likely to be positively correlated, i.e., although the time trend that is estimated under the naïve assumption of independence is unbiased, an unbiased estimate of the variance cannot be obtained without knowledge of the subject identifiers linking repeated measures over time. We propose a simple method for obtaining a conservative estimate of variability for making inferences about trends in proportions overtime, ensuring that the type I error is no greater than the specified level. The method proposed is illustrated by using longitudinal data on diabetes hospitalization proportions in South Carolina.

Original language	English (US)
Pages (from-to)	185-193
Number of pages	9
Journal	Journal of the Royal Statistical Society. Series A: Statistics in Society
Volume	170
Issue number	1
DOIs	https://doi.org/10.1111/j.1467-985X.2006.00453.x
State	Published - Jan 2007

Keywords

Generalized estimating equations
Longitudinal data
Maximal correlation
Type I error

ASJC Scopus subject areas

Statistics and Probability
Social Sciences (miscellaneous)
Economics and Econometrics
Statistics, Probability and Uncertainty

Access to Document

10.1111/j.1467-985X.2006.00453.x

Cite this

@article{1ce5673b1bdd46d6be2b272952551ca4,

title = "Estimating the variance of estimated trends in proportions when there is no unique subject identifier",

abstract = "Longitudinal population-based surveys are widely used in the health sciences to study patterns of change over time. In many of these data sets unique patient identifiers are not publicly available, making it impossible to link the repeated measures from the same individual directly. This poses a statistical challenge for making inferences about time trends because repeated measures from the same individual are likely to be positively correlated, i.e., although the time trend that is estimated under the na{\"i}ve assumption of independence is unbiased, an unbiased estimate of the variance cannot be obtained without knowledge of the subject identifiers linking repeated measures over time. We propose a simple method for obtaining a conservative estimate of variability for making inferences about trends in proportions overtime, ensuring that the type I error is no greater than the specified level. The method proposed is illustrated by using longitudinal data on diabetes hospitalization proportions in South Carolina.",

keywords = "Generalized estimating equations, Longitudinal data, Maximal correlation, Type I error",

author = "Mountford, {William K.} and Lipsitz, {Stuart R.} and Fitzmaurice, {Garrett M.} and Carter, {Rickey E.} and Soule, {Jeremy B.} and Colwell, {John A.} and Lackland, {Daniel T.}",

year = "2007",

month = jan,

doi = "10.1111/j.1467-985X.2006.00453.x",

language = "English (US)",

volume = "170",

pages = "185--193",

journal = "Journal of the Royal Statistical Society. Series A: Statistics in Society",

issn = "0964-1998",

publisher = "Wiley-Blackwell",

number = "1",

}

TY - JOUR

T1 - Estimating the variance of estimated trends in proportions when there is no unique subject identifier

AU - Mountford, William K.

AU - Lipsitz, Stuart R.

AU - Fitzmaurice, Garrett M.

AU - Carter, Rickey E.

AU - Soule, Jeremy B.

AU - Colwell, John A.

AU - Lackland, Daniel T.

PY - 2007/1

Y1 - 2007/1

N2 - Longitudinal population-based surveys are widely used in the health sciences to study patterns of change over time. In many of these data sets unique patient identifiers are not publicly available, making it impossible to link the repeated measures from the same individual directly. This poses a statistical challenge for making inferences about time trends because repeated measures from the same individual are likely to be positively correlated, i.e., although the time trend that is estimated under the naïve assumption of independence is unbiased, an unbiased estimate of the variance cannot be obtained without knowledge of the subject identifiers linking repeated measures over time. We propose a simple method for obtaining a conservative estimate of variability for making inferences about trends in proportions overtime, ensuring that the type I error is no greater than the specified level. The method proposed is illustrated by using longitudinal data on diabetes hospitalization proportions in South Carolina.

AB - Longitudinal population-based surveys are widely used in the health sciences to study patterns of change over time. In many of these data sets unique patient identifiers are not publicly available, making it impossible to link the repeated measures from the same individual directly. This poses a statistical challenge for making inferences about time trends because repeated measures from the same individual are likely to be positively correlated, i.e., although the time trend that is estimated under the naïve assumption of independence is unbiased, an unbiased estimate of the variance cannot be obtained without knowledge of the subject identifiers linking repeated measures over time. We propose a simple method for obtaining a conservative estimate of variability for making inferences about trends in proportions overtime, ensuring that the type I error is no greater than the specified level. The method proposed is illustrated by using longitudinal data on diabetes hospitalization proportions in South Carolina.

KW - Generalized estimating equations

KW - Longitudinal data

KW - Maximal correlation

KW - Type I error

UR - http://www.scopus.com/inward/record.url?scp=33845741139&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33845741139&partnerID=8YFLogxK

U2 - 10.1111/j.1467-985X.2006.00453.x

DO - 10.1111/j.1467-985X.2006.00453.x

M3 - Article

AN - SCOPUS:33845741139

SN - 0964-1998

VL - 170

SP - 185

EP - 193

JO - Journal of the Royal Statistical Society. Series A: Statistics in Society

JF - Journal of the Royal Statistical Society. Series A: Statistics in Society

IS - 1

ER -

Estimating the variance of estimated trends in proportions when there is no unique subject identifier

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this