Evaluating re-identification risks with respect to the HIPAA privacy rule

Kathleen Benitez; Bradley Malin

doi:10.1136/jamia.2009.000026

Loading next page...

References (31)

C. Skinner, M. Elliot (2002)
A measure of disclosure risk for microdata
Journal of the Royal Statistical Society: Series B (Statistical Methodology), 64
C. Safran, M. Bloomrosen, W. Hammond, S. Labkoff, S. Markel-Fox, P. Tang, D. Detmer (2007)
White Paper: Toward a National Framework for the Secondary Use of Health Data: An American Medical Informatics Association White Paper
Journal of the American Medical Informatics Association : JAMIA, 14 1
P. Golle (2006)
Revisiting the uniqueness of simple demographics in the US population
L. Sweeney (1997)
Weaving Technology and Policy Together to Maintain Confidentiality
The Journal of Law, Medicine & Ethics, 25
C. Skinner, D. Holmes (1998)
Estimating the re-identification risk per record in microdata
(2007)
Social Security data puts 1.3 mil. voters at risk: suit. Chicago Sun-Times
R. Parker, P. Aggleton (2002)
HIV/AIDS-related stigma and discrimination: a conceptual framework and implications for action
(2007)
Policy for sharing of data obtained in NIH supported or conducted genome-wide association studies (GWAS) NOT-OD-07e088
Stuart Schechter (2005)
Toward econometric models of the security risk from remote attacks
IEEE Security & Privacy, 3
M. Weiner, P. Embí (2009)
Toward Reuse of Clinical Data for Research and Quality Improvement: The End of the Beginning?
Annals of Internal Medicine, 151
B. Greenberg, Laura Voshell (2002)
RELATING RISK OF DISCLOSURE FOR MICRODATA AND GEOGRAPHIC AREA SIZE
A. McGuire, R. Gibbs (2006)
No Longer De-Identified
Science, 312
K. Emam, Ann Brown, P. AbdelMalik (2009)
Model Formulation: Evaluating Predictors of Geographic Area Population Size Cut-offs to Manage Re-identification Risk
Journal of the American Medical Informatics Association : JAMIA, 16 2
(2008)
New details reveal numerous mistakes prior to election commission break-in
B. Fung, Ke Wang, Philip Yu (2007)
Anonymizing Classification Data for Privacy Preservation
IEEE Transactions on Knowledge and Data Engineering, 19
U.S. Census Bureau. American FactFinder
G. Kenagy, C. Hsieh (2005)
The risk less known: female-to-male transgender persons’ vulnerability to HIV infection
AIDS Care, 17
T. Truta, F. Fotouhi, D. Barth-Jones (2003)
Disclosure risk measures for microdata
15th International Conference on Scientific and Statistical Database Management, 2003.
R. Agrawal, Christopher Johnson (2007)
Securing electronic health records without impeding the flow of information
International journal of medical informatics, 76 5-6
A. Gionis, Tamir Tassa (2009)
k-Anonymization with Minimal Loss of Information
IEEE Transactions on Knowledge and Data Engineering, 21
Ashwin Machanavajjhala, J. Gehrke, Daniel Kifer, Muthuramakrishnan Venkitasubramaniam (2006)
L-diversity: privacy beyond k-anonymity
22nd International Conference on Data Engineering (ICDE'06)
(1999)
Princeton Survey Research Associates. Medical privacy and confidentiality survey
A. McGuire, R. Gibbs (2006)
Genetics. No longer de-identified.
Science, 312 5772
(2004)
Voter privacy in the digital age Report from the California Voter Foundation
Hhs Rights (2002)
Standards for privacy of individually identifiable health information. Final rule.
Federal register, 67 157
M. Mulry (2006)
Summary of Accuracy and Coverage Evaluation for Census 2000
D. Reidpath, K. Chan (2005)
HIV discrimination: integrating the results from a six-country situational analysis in the Asia Pacific
AIDS Care, 17
Wei Jiang, M. Atzori (2006)
Secure Distributed k-Anonymous Pattern Mining
Sixth International Conference on Data Mining (ICDM'06)
D. Blumenthal (2009)
Stimulating the adoption of health information technology.
The New England journal of medicine, 360 15
P. Samarati (2001)
Protecting Respondents' Identities in Microdata Release
IEEE Trans. Knowl. Data Eng., 13
(2000)
Uniqueness of simple demographics in the U.S. population Working paper LIDAP-WP4

Publisher: Oxford University Press
Copyright: © 2010, Published by the BMJ Publishing Group Limited For permission to use, (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
ISSN: 1067-5027
eISSN: 1527-974X
DOI: 10.1136/jamia.2009.000026
pmid: 20190059
Publisher site: See Article on Publisher Site

Abstract

AbstractObjective Many healthcare organizations follow data protection policies that specify which patient identifiers must be suppressed to share “de-identified” records. Such policies, however, are often applied without knowledge of the risk of “re-identification”. The goals of this work are: (1) to estimate re-identification risk for data sharing policies of the Health Insurance Portability and Accountability Act (HIPAA) Privacy Rule; and (2) to evaluate the risk of a specific re-identification attack using voter registration lists.Measurements We define several risk metrics: (1) expected number of re-identifications; (2) estimated proportion of a population in a group of size g or less, and (3) monetary cost per re-identification. For each US state, we estimate the risk posed to hypothetical datasets, protected by the HIPAA Safe Harbor and Limited Dataset policies by an attacker with full knowledge of patient identifiers and with limited knowledge in the form of voter registries.Results The percentage of a state's population estimated to be vulnerable to unique re-identification (ie, g=1) when protected via Safe Harbor and Limited Datasets ranges from 0.01% to 0.25% and 10% to 60%, respectively. In the voter attack, this number drops for many states, and for some states is 0%, due to the variable availability of voter registries in the real world. We also find that re-identification cost ranges from $0 to $17 000, further confirming risk variability.Conclusions This work illustrates that blanket protection policies, such as Safe Harbor, leave different organizations vulnerable to re-identification at different rates. It provides justification for locally performed re-identification risk estimates prior to sharing data.

Journal

Journal of the American Medical Informatics Association – Oxford University Press

Published: Mar 1, 2010

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Evaluating re-identification risks with respect to the HIPAA privacy rule

Evaluating re-identification risks with respect to the HIPAA privacy rule

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Evaluating re-identification risks with respect to the HIPAA privacy rule

Evaluating re-identification risks with respect to the HIPAA privacy rule

References (31)

Abstract

Journal

Recommended Articles

There are no references for this article.

Our policy towards the use of cookies