Use and Understanding of Anonymization and De-Identification in the Biomedical Literature: Scoping Review

The secondary use of health data is central to biomedical research in the era of data science and precision medicine. National and international initiatives, such as the Global Open Findable, Accessible, Interoperable, and Reusable (GO FAIR) initiative, are supporting this approach in different ways...

Full description

Saved in:
Bibliographic Details
Published inJournal of medical Internet research Vol. 21; no. 5; p. e13484
Main Authors Chevrier, Raphaël, Foufi, Vasiliki, Gaudet-Blavignac, Christophe, Robert, Arnaud, Lovis, Christian
Format Journal Article
LanguageEnglish
Published Canada Journal of Medical Internet Research 31.05.2019
JMIR Publications
Subjects
Online AccessGet full text
ISSN1438-8871
1439-4456
1438-8871
DOI10.2196/13484

Cover

Loading…
Abstract The secondary use of health data is central to biomedical research in the era of data science and precision medicine. National and international initiatives, such as the Global Open Findable, Accessible, Interoperable, and Reusable (GO FAIR) initiative, are supporting this approach in different ways (eg, making the sharing of research data mandatory or improving the legal and ethical frameworks). Preserving patients' privacy is crucial in this context. De-identification and anonymization are the two most common terms used to refer to the technical approaches that protect privacy and facilitate the secondary use of health data. However, it is difficult to find a consensus on the definitions of the concepts or on the reliability of the techniques used to apply them. A comprehensive review is needed to better understand the domain, its capabilities, its challenges, and the ratio of risk between the data subjects' privacy on one side, and the benefit of scientific advances on the other. This work aims at better understanding how the research community comprehends and defines the concepts of de-identification and anonymization. A rich overview should also provide insights into the use and reliability of the methods. Six aspects will be studied: (1) terminology and definitions, (2) backgrounds and places of work of the researchers, (3) reasons for anonymizing or de-identifying health data, (4) limitations of the techniques, (5) legal and ethical aspects, and (6) recommendations of the researchers. Based on a scoping review protocol designed a priori, MEDLINE was searched for publications discussing de-identification or anonymization and published between 2007 and 2017. The search was restricted to MEDLINE to focus on the life sciences community. The screening process was performed by two reviewers independently. After searching 7972 records that matched at least one search term, 135 publications were screened and 60 full-text articles were included. (1) Terminology: Definitions of the terms de-identification and anonymization were provided in less than half of the articles (29/60, 48%). When both terms were used (41/60, 68%), their meanings divided the authors into two equal groups (19/60, 32%, each) with opposed views. The remaining articles (3/60, 5%) were equivocal. (2) Backgrounds and locations: Research groups were based predominantly in North America (31/60, 52%) and in the European Union (22/60, 37%). The authors came from 19 different domains; computer science (91/248, 36.7%), biomedical informatics (47/248, 19.0%), and medicine (38/248, 15.3%) were the most prevalent ones. (3) Purpose: The main reason declared for applying these techniques is to facilitate biomedical research. (4) Limitations: Progress is made on specific techniques but, overall, limitations remain numerous. (5) Legal and ethical aspects: Differences exist between nations in the definitions, approaches, and legal practices. (6) Recommendations: The combination of organizational, legal, ethical, and technical approaches is necessary to protect health data. Interest is growing for privacy-enhancing techniques in the life sciences community. This interest crosses scientific boundaries, involving primarily computer science, biomedical informatics, and medicine. The variability observed in the use of the terms de-identification and anonymization emphasizes the need for clearer definitions as well as for better education and dissemination of information on the subject. The same observation applies to the methods. Several legislations, such as the American Health Insurance Portability and Accountability Act (HIPAA) and the European General Data Protection Regulation (GDPR), regulate the domain. Using the definitions they provide could help address the variable use of these two concepts in the research community.
AbstractList The secondary use of health data is central to biomedical research in the era of data science and precision medicine. National and international initiatives, such as the Global Open Findable, Accessible, Interoperable, and Reusable (GO FAIR) initiative, are supporting this approach in different ways (eg, making the sharing of research data mandatory or improving the legal and ethical frameworks). Preserving patients’ privacy is crucial in this context. De-identification and anonymization are the two most common terms used to refer to the technical approaches that protect privacy and facilitate the secondary use of health data. However, it is difficult to find a consensus on the definitions of the concepts or on the reliability of the techniques used to apply them. A comprehensive review is needed to better understand the domain, its capabilities, its challenges, and the ratio of risk between the data subjects’ privacy on one side, and the benefit of scientific advances on the other. This work aims at better understanding how the research community comprehends and defines the concepts of de-identification and anonymization. A rich overview should also provide insights into the use and reliability of the methods. Six aspects will be studied: (1) terminology and definitions, (2) backgrounds and places of work of the researchers, (3) reasons for anonymizing or de-identifying health data, (4) limitations of the techniques, (5) legal and ethical aspects, and (6) recommendations of the researchers. Based on a scoping review protocol designed a priori, MEDLINE was searched for publications discussing de-identification or anonymization and published between 2007 and 2017. The search was restricted to MEDLINE to focus on the life sciences community. The screening process was performed by two reviewers independently. After searching 7972 records that matched at least one search term, 135 publications were screened and 60 full-text articles were included. (1) Terminology: Definitions of the terms de-identification and anonymization were provided in less than half of the articles (29/60, 48%). When both terms were used (41/60, 68%), their meanings divided the authors into two equal groups (19/60, 32%, each) with opposed views. The remaining articles (3/60, 5%) were equivocal. (2) Backgrounds and locations: Research groups were based predominantly in North America (31/60, 52%) and in the European Union (22/60, 37%). The authors came from 19 different domains; computer science (91/248, 36.7%), biomedical informatics (47/248, 19.0%), and medicine (38/248, 15.3%) were the most prevalent ones. (3) Purpose: The main reason declared for applying these techniques is to facilitate biomedical research. (4) Limitations: Progress is made on specific techniques but, overall, limitations remain numerous. (5) Legal and ethical aspects: Differences exist between nations in the definitions, approaches, and legal practices. (6) Recommendations: The combination of organizational, legal, ethical, and technical approaches is necessary to protect health data. Interest is growing for privacy-enhancing techniques in the life sciences community. This interest crosses scientific boundaries, involving primarily computer science, biomedical informatics, and medicine. The variability observed in the use of the terms de-identification and anonymization emphasizes the need for clearer definitions as well as for better education and dissemination of information on the subject. The same observation applies to the methods. Several legislations, such as the American Health Insurance Portability and Accountability Act (HIPAA) and the European General Data Protection Regulation (GDPR), regulate the domain. Using the definitions they provide could help address the variable use of these two concepts in the research community.
The secondary use of health data is central to biomedical research in the era of data science and precision medicine. National and international initiatives, such as the Global Open Findable, Accessible, Interoperable, and Reusable (GO FAIR) initiative, are supporting this approach in different ways (eg, making the sharing of research data mandatory or improving the legal and ethical frameworks). Preserving patients' privacy is crucial in this context. De-identification and anonymization are the two most common terms used to refer to the technical approaches that protect privacy and facilitate the secondary use of health data. However, it is difficult to find a consensus on the definitions of the concepts or on the reliability of the techniques used to apply them. A comprehensive review is needed to better understand the domain, its capabilities, its challenges, and the ratio of risk between the data subjects' privacy on one side, and the benefit of scientific advances on the other. This work aims at better understanding how the research community comprehends and defines the concepts of de-identification and anonymization. A rich overview should also provide insights into the use and reliability of the methods. Six aspects will be studied: (1) terminology and definitions, (2) backgrounds and places of work of the researchers, (3) reasons for anonymizing or de-identifying health data, (4) limitations of the techniques, (5) legal and ethical aspects, and (6) recommendations of the researchers. Based on a scoping review protocol designed a priori, MEDLINE was searched for publications discussing de-identification or anonymization and published between 2007 and 2017. The search was restricted to MEDLINE to focus on the life sciences community. The screening process was performed by two reviewers independently. After searching 7972 records that matched at least one search term, 135 publications were screened and 60 full-text articles were included. (1) Terminology: Definitions of the terms de-identification and anonymization were provided in less than half of the articles (29/60, 48%). When both terms were used (41/60, 68%), their meanings divided the authors into two equal groups (19/60, 32%, each) with opposed views. The remaining articles (3/60, 5%) were equivocal. (2) Backgrounds and locations: Research groups were based predominantly in North America (31/60, 52%) and in the European Union (22/60, 37%). The authors came from 19 different domains; computer science (91/248, 36.7%), biomedical informatics (47/248, 19.0%), and medicine (38/248, 15.3%) were the most prevalent ones. (3) Purpose: The main reason declared for applying these techniques is to facilitate biomedical research. (4) Limitations: Progress is made on specific techniques but, overall, limitations remain numerous. (5) Legal and ethical aspects: Differences exist between nations in the definitions, approaches, and legal practices. (6) Recommendations: The combination of organizational, legal, ethical, and technical approaches is necessary to protect health data. Interest is growing for privacy-enhancing techniques in the life sciences community. This interest crosses scientific boundaries, involving primarily computer science, biomedical informatics, and medicine. The variability observed in the use of the terms de-identification and anonymization emphasizes the need for clearer definitions as well as for better education and dissemination of information on the subject. The same observation applies to the methods. Several legislations, such as the American Health Insurance Portability and Accountability Act (HIPAA) and the European General Data Protection Regulation (GDPR), regulate the domain. Using the definitions they provide could help address the variable use of these two concepts in the research community.
The secondary use of health data is central to biomedical research in the era of data science and precision medicine. National and international initiatives, such as the Global Open Findable, Accessible, Interoperable, and Reusable (GO FAIR) initiative, are supporting this approach in different ways (eg, making the sharing of research data mandatory or improving the legal and ethical frameworks). Preserving patients' privacy is crucial in this context. De-identification and anonymization are the two most common terms used to refer to the technical approaches that protect privacy and facilitate the secondary use of health data. However, it is difficult to find a consensus on the definitions of the concepts or on the reliability of the techniques used to apply them. A comprehensive review is needed to better understand the domain, its capabilities, its challenges, and the ratio of risk between the data subjects' privacy on one side, and the benefit of scientific advances on the other.BACKGROUNDThe secondary use of health data is central to biomedical research in the era of data science and precision medicine. National and international initiatives, such as the Global Open Findable, Accessible, Interoperable, and Reusable (GO FAIR) initiative, are supporting this approach in different ways (eg, making the sharing of research data mandatory or improving the legal and ethical frameworks). Preserving patients' privacy is crucial in this context. De-identification and anonymization are the two most common terms used to refer to the technical approaches that protect privacy and facilitate the secondary use of health data. However, it is difficult to find a consensus on the definitions of the concepts or on the reliability of the techniques used to apply them. A comprehensive review is needed to better understand the domain, its capabilities, its challenges, and the ratio of risk between the data subjects' privacy on one side, and the benefit of scientific advances on the other.This work aims at better understanding how the research community comprehends and defines the concepts of de-identification and anonymization. A rich overview should also provide insights into the use and reliability of the methods. Six aspects will be studied: (1) terminology and definitions, (2) backgrounds and places of work of the researchers, (3) reasons for anonymizing or de-identifying health data, (4) limitations of the techniques, (5) legal and ethical aspects, and (6) recommendations of the researchers.OBJECTIVEThis work aims at better understanding how the research community comprehends and defines the concepts of de-identification and anonymization. A rich overview should also provide insights into the use and reliability of the methods. Six aspects will be studied: (1) terminology and definitions, (2) backgrounds and places of work of the researchers, (3) reasons for anonymizing or de-identifying health data, (4) limitations of the techniques, (5) legal and ethical aspects, and (6) recommendations of the researchers.Based on a scoping review protocol designed a priori, MEDLINE was searched for publications discussing de-identification or anonymization and published between 2007 and 2017. The search was restricted to MEDLINE to focus on the life sciences community. The screening process was performed by two reviewers independently.METHODSBased on a scoping review protocol designed a priori, MEDLINE was searched for publications discussing de-identification or anonymization and published between 2007 and 2017. The search was restricted to MEDLINE to focus on the life sciences community. The screening process was performed by two reviewers independently.After searching 7972 records that matched at least one search term, 135 publications were screened and 60 full-text articles were included. (1) Terminology: Definitions of the terms de-identification and anonymization were provided in less than half of the articles (29/60, 48%). When both terms were used (41/60, 68%), their meanings divided the authors into two equal groups (19/60, 32%, each) with opposed views. The remaining articles (3/60, 5%) were equivocal. (2) Backgrounds and locations: Research groups were based predominantly in North America (31/60, 52%) and in the European Union (22/60, 37%). The authors came from 19 different domains; computer science (91/248, 36.7%), biomedical informatics (47/248, 19.0%), and medicine (38/248, 15.3%) were the most prevalent ones. (3) Purpose: The main reason declared for applying these techniques is to facilitate biomedical research. (4) Limitations: Progress is made on specific techniques but, overall, limitations remain numerous. (5) Legal and ethical aspects: Differences exist between nations in the definitions, approaches, and legal practices. (6) Recommendations: The combination of organizational, legal, ethical, and technical approaches is necessary to protect health data.RESULTSAfter searching 7972 records that matched at least one search term, 135 publications were screened and 60 full-text articles were included. (1) Terminology: Definitions of the terms de-identification and anonymization were provided in less than half of the articles (29/60, 48%). When both terms were used (41/60, 68%), their meanings divided the authors into two equal groups (19/60, 32%, each) with opposed views. The remaining articles (3/60, 5%) were equivocal. (2) Backgrounds and locations: Research groups were based predominantly in North America (31/60, 52%) and in the European Union (22/60, 37%). The authors came from 19 different domains; computer science (91/248, 36.7%), biomedical informatics (47/248, 19.0%), and medicine (38/248, 15.3%) were the most prevalent ones. (3) Purpose: The main reason declared for applying these techniques is to facilitate biomedical research. (4) Limitations: Progress is made on specific techniques but, overall, limitations remain numerous. (5) Legal and ethical aspects: Differences exist between nations in the definitions, approaches, and legal practices. (6) Recommendations: The combination of organizational, legal, ethical, and technical approaches is necessary to protect health data.Interest is growing for privacy-enhancing techniques in the life sciences community. This interest crosses scientific boundaries, involving primarily computer science, biomedical informatics, and medicine. The variability observed in the use of the terms de-identification and anonymization emphasizes the need for clearer definitions as well as for better education and dissemination of information on the subject. The same observation applies to the methods. Several legislations, such as the American Health Insurance Portability and Accountability Act (HIPAA) and the European General Data Protection Regulation (GDPR), regulate the domain. Using the definitions they provide could help address the variable use of these two concepts in the research community.CONCLUSIONSInterest is growing for privacy-enhancing techniques in the life sciences community. This interest crosses scientific boundaries, involving primarily computer science, biomedical informatics, and medicine. The variability observed in the use of the terms de-identification and anonymization emphasizes the need for clearer definitions as well as for better education and dissemination of information on the subject. The same observation applies to the methods. Several legislations, such as the American Health Insurance Portability and Accountability Act (HIPAA) and the European General Data Protection Regulation (GDPR), regulate the domain. Using the definitions they provide could help address the variable use of these two concepts in the research community.
Background The secondary use of health data is central to biomedical research in the era of data science and precision medicine. National and international initiatives, such as the Global Open Findable, Accessible, Interoperable, and Reusable (GO FAIR) initiative, are supporting this approach in different ways (eg, making the sharing of research data mandatory or improving the legal and ethical frameworks). Preserving patients’ privacy is crucial in this context. De-identification and anonymization are the two most common terms used to refer to the technical approaches that protect privacy and facilitate the secondary use of health data. However, it is difficult to find a consensus on the definitions of the concepts or on the reliability of the techniques used to apply them. A comprehensive review is needed to better understand the domain, its capabilities, its challenges, and the ratio of risk between the data subjects’ privacy on one side, and the benefit of scientific advances on the other. Objective This work aims at better understanding how the research community comprehends and defines the concepts of de-identification and anonymization. A rich overview should also provide insights into the use and reliability of the methods. Six aspects will be studied: (1) terminology and definitions, (2) backgrounds and places of work of the researchers, (3) reasons for anonymizing or de-identifying health data, (4) limitations of the techniques, (5) legal and ethical aspects, and (6) recommendations of the researchers. Methods Based on a scoping review protocol designed a priori, MEDLINE was searched for publications discussing de-identification or anonymization and published between 2007 and 2017. The search was restricted to MEDLINE to focus on the life sciences community. The screening process was performed by two reviewers independently. Results After searching 7972 records that matched at least one search term, 135 publications were screened and 60 full-text articles were included. (1) Terminology: Definitions of the terms de-identification and anonymization were provided in less than half of the articles (29/60, 48%). When both terms were used (41/60, 68%), their meanings divided the authors into two equal groups (19/60, 32%, each) with opposed views. The remaining articles (3/60, 5%) were equivocal. (2) Backgrounds and locations: Research groups were based predominantly in North America (31/60, 52%) and in the European Union (22/60, 37%). The authors came from 19 different domains; computer science (91/248, 36.7%), biomedical informatics (47/248, 19.0%), and medicine (38/248, 15.3%) were the most prevalent ones. (3) Purpose: The main reason declared for applying these techniques is to facilitate biomedical research. (4) Limitations: Progress is made on specific techniques but, overall, limitations remain numerous. (5) Legal and ethical aspects: Differences exist between nations in the definitions, approaches, and legal practices. (6) Recommendations: The combination of organizational, legal, ethical, and technical approaches is necessary to protect health data. Conclusions Interest is growing for privacy-enhancing techniques in the life sciences community. This interest crosses scientific boundaries, involving primarily computer science, biomedical informatics, and medicine. The variability observed in the use of the terms de-identification and anonymization emphasizes the need for clearer definitions as well as for better education and dissemination of information on the subject. The same observation applies to the methods. Several legislations, such as the American Health Insurance Portability and Accountability Act (HIPAA) and the European General Data Protection Regulation (GDPR), regulate the domain. Using the definitions they provide could help address the variable use of these two concepts in the research community.
Audience Academic
Author Foufi, Vasiliki
Robert, Arnaud
Chevrier, Raphaël
Lovis, Christian
Gaudet-Blavignac, Christophe
AuthorAffiliation 2 Faculty of Medicine University of Geneva Geneva Switzerland
1 Division of Medical Information Sciences University Hospitals of Geneva Geneva Switzerland
AuthorAffiliation_xml – name: 2 Faculty of Medicine University of Geneva Geneva Switzerland
– name: 1 Division of Medical Information Sciences University Hospitals of Geneva Geneva Switzerland
Author_xml – sequence: 1
  givenname: Raphaël
  orcidid: 0000-0002-0334-4883
  surname: Chevrier
  fullname: Chevrier, Raphaël
– sequence: 2
  givenname: Vasiliki
  orcidid: 0000-0002-8625-0734
  surname: Foufi
  fullname: Foufi, Vasiliki
– sequence: 3
  givenname: Christophe
  orcidid: 0000-0001-6527-5898
  surname: Gaudet-Blavignac
  fullname: Gaudet-Blavignac, Christophe
– sequence: 4
  givenname: Arnaud
  orcidid: 0000-0001-8692-7402
  surname: Robert
  fullname: Robert, Arnaud
– sequence: 5
  givenname: Christian
  orcidid: 0000-0002-2681-8076
  surname: Lovis
  fullname: Lovis, Christian
BackLink https://www.ncbi.nlm.nih.gov/pubmed/31152528$$D View this record in MEDLINE/PubMed
BookMark eNptktuKFDEQhhtZcQ_uK0iDCIr0mlOn014I43oaGBRc5zpkkspspDsZO-k9-PRmZnZlWyQXKSpf_anir-PiwAcPRXGK0RnBLX-DKRPsUXGEGRWVEA0-eBAfFscx_kSIINbiJ8UhxbgmNRFHhVtGKJU35dIbGGLKofPrMthylj-47d1vlVzwO-QDVHMDPjnr9D7rfJkuoXzvQg8mJ7ty4RIMKo0DvC0vdNhsxb7DlYPrp8Vjq7oIp3f3SbH89PHH-Zdq8e3z_Hy2qDTjLFUCayYQtsjYhq5arg1vVgIzgQmn3IAhoKmqqdXKMEqUMMIiwoA3lq3yUPSkeLfX3Yyr3JXODQ-qk5vB9Wq4lUE5OX3x7lKuw5XkvBakRVng5Z3AEH6NEJPsXdTQdcpDGKMkhFJRI9HyjD7fo2vVgXTehqyot7icNbyldd22babO_kPlY6B3OttoXc5PCl5NCjKT4Cat1RijnF98nbLPHo77d857izPweg_oIcQ4gJXapZ19uQvXSYzkdoHkboEy_eIf-l5wyv0Br7rCQQ
CitedBy_id crossref_primary_10_1002_wmh3_580
crossref_primary_10_2196_29871
crossref_primary_10_1002_jhrm_70002
crossref_primary_10_1093_jamia_ocab281
crossref_primary_10_1038_s41597_021_00967_y
crossref_primary_10_1055_s_0040_1701988
crossref_primary_10_1515_ohe_2023_0048
crossref_primary_10_1186_s12910_022_00852_2
crossref_primary_10_1093_ejo_cjae067
crossref_primary_10_2196_26914
crossref_primary_10_1002_widm_1495
crossref_primary_10_1093_jamiaopen_ooaa041
crossref_primary_10_17159_sajs_2023_15062
crossref_primary_10_2196_68998
crossref_primary_10_2196_33264
crossref_primary_10_1093_jamia_ocaa260
crossref_primary_10_1016_j_xhgg_2022_100100
crossref_primary_10_1093_jnci_djaf003
crossref_primary_10_47912_jscdm_368
crossref_primary_10_2196_49445
crossref_primary_10_3390_curroncol28040217
crossref_primary_10_1016_j_compbiomed_2023_107655
crossref_primary_10_1016_j_semradonc_2022_06_009
crossref_primary_10_3389_frai_2024_1473837
crossref_primary_10_1016_j_mcpdig_2023_02_003
crossref_primary_10_1093_noajnl_vdae230
crossref_primary_10_1080_10447318_2025_2450411
crossref_primary_10_1371_journal_pdig_0000027
crossref_primary_10_1139_bcb_2021_0506
crossref_primary_10_1016_j_eswa_2025_126621
crossref_primary_10_1055_s_0041_1731286
crossref_primary_10_2196_37594
crossref_primary_10_1016_j_compbiomed_2024_109576
crossref_primary_10_2196_22739
crossref_primary_10_1177_19322968241306129
crossref_primary_10_17159_sajs_2021_10808
crossref_primary_10_1162_dint_a_00106
crossref_primary_10_1177_0846537120967349
crossref_primary_10_1016_j_neubiorev_2022_104709
crossref_primary_10_1186_s12874_024_02404_1
crossref_primary_10_1016_j_ssmqr_2023_100321
crossref_primary_10_1186_s13244_024_01711_x
crossref_primary_10_1007_s10618_024_01066_3
crossref_primary_10_23736_S2532_1285_20_00015_4
crossref_primary_10_1055_a_2373_3291
crossref_primary_10_34133_research_0532
crossref_primary_10_57187_s_3538
crossref_primary_10_3390_clinpract13040089
crossref_primary_10_1111_inr_70013
crossref_primary_10_3390_vaccines13030321
crossref_primary_10_1073_pnas_2304415120
crossref_primary_10_2196_16607
crossref_primary_10_3390_app14145975
crossref_primary_10_3346_jkms_2021_36_e299
crossref_primary_10_1016_j_artmed_2024_102845
crossref_primary_10_2196_17456
crossref_primary_10_3389_fnins_2022_975795
crossref_primary_10_1148_ryai_2020190137
crossref_primary_10_1148_radiol_2020192536
crossref_primary_10_1055_s_0040_1702014
crossref_primary_10_1016_j_oor_2024_100591
crossref_primary_10_1016_j_dib_2023_109930
crossref_primary_10_1177_02666669241247781
crossref_primary_10_1371_journal_pone_0303828
crossref_primary_10_1002_cpt_2270
crossref_primary_10_1177_17407745221087469
crossref_primary_10_1186_s41747_024_00485_7
crossref_primary_10_4258_hir_2021_27_1_39
Cites_doi 10.1213/ANE.0000000000001331
10.1186/1472-6939-12-16
10.1136/bmj.h1139
10.1038/nbt0805-925
10.1186/1472-6947-13-71
10.1109/TITB.2012.2212281
10.1186/s12911-016-0287-2
10.1177/1073110516644204
10.1093/jamia/ocv154
10.1186/s12911-016-0293-4
10.1016/j.ijmedinf.2013.11.005
10.1073/pnas.0911686107
10.1007/s11673-017-9806-9
10.1016/j.jbi.2012.12.003
10.3390/s17051059
10.4274/balkanmedj.2017.0966
10.1197/jamia.M3144
10.1371/journal.pone.0028071
10.1111/jlme.12036
10.2310/JIM.0b013e3181c9b2ea
10.1016/j.jbi.2013.12.002
10.1016/S0140-6736(09)61486-0
10.1136/jme.2006.020032
10.1093/jamia/ocv004
10.1186/1748-5908-5-69
10.1089/bio.2015.0100
10.4338/ACI-2012-07-RA-0028
10.1097/MLR.0b013e3182585355
10.1109/TKDE.2009.88
10.1111/jlme.12312
10.3414/ME16-01-0012
10.1109/JBHI.2017.2676880
10.1177/1460458216647760
10.1136/jamia.2009.002725
10.1136/jamia.2010.004622
10.1016/j.is.2012.11.005
10.1097/XEB.0000000000000050
10.1186/s12874-016-0169-4
10.2310/6650.2007.06044
10.1186/1745-6215-11-9
10.1016/j.jclinepi.2017.04.016
10.1038/nbt.3696
10.1109/TITB.2012.2185850
10.1371/journal.pone.0053875
10.1007/s12687-017-0300-1
10.1186/1471-2288-10-70
10.1197/jamia.M2273
10.1016/j.jbi.2014.05.002
10.1056/NEJMp1605348
10.1186/s12874-016-0116-4
10.1093/jamia/ocw152
10.1197/jamia.M2207
10.3346/jkms.2015.30.1.7
10.7717/peerj.1506
10.1016/j.jbi.2014.06.002
10.15252/embr.201540943
10.1016/j.jbi.2015.09.007
ContentType Journal Article
Copyright Raphaël Chevrier, Vasiliki Foufi, Christophe Gaudet-Blavignac, Arnaud Robert, Christian Lovis. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 31.05.2019.
COPYRIGHT 2019 Journal of Medical Internet Research
Raphaël Chevrier, Vasiliki Foufi, Christophe Gaudet-Blavignac, Arnaud Robert, Christian Lovis. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 31.05.2019. 2019
Copyright_xml – notice: Raphaël Chevrier, Vasiliki Foufi, Christophe Gaudet-Blavignac, Arnaud Robert, Christian Lovis. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 31.05.2019.
– notice: COPYRIGHT 2019 Journal of Medical Internet Research
– notice: Raphaël Chevrier, Vasiliki Foufi, Christophe Gaudet-Blavignac, Arnaud Robert, Christian Lovis. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 31.05.2019. 2019
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
ISN
7X8
5PM
DOI 10.2196/13484
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
Gale In Context: Canada
MEDLINE - Academic
PubMed Central (Full Participant titles)
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList

MEDLINE
MEDLINE - Academic

Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Library & Information Science
EISSN 1438-8871
ExternalDocumentID PMC6658290
A769355999
31152528
10_2196_13484
Genre Journal Article
Scoping Review
GroupedDBID ---
.4I
.DC
29L
2WC
36B
53G
5GY
5VS
77K
7RV
7X7
8FI
8FJ
AAFWJ
AAKPC
AAWTL
AAYXX
ABDBF
ABIVO
ABUWG
ACGFO
ADBBV
AEGXH
AENEX
AFKRA
AFPKN
AIAGR
ALIPV
ALMA_UNASSIGNED_HOLDINGS
ALSLI
AOIJS
BAWUL
BCNDV
BENPR
CCPQU
CITATION
CNYFK
CS3
DIK
DU5
DWQXO
E3Z
EAP
EBD
EBS
EJD
ELW
EMB
EMOBN
ESX
F5P
FRP
FYUFA
GROUPED_DOAJ
GX1
HMCUK
HYE
IAO
ICO
IEA
IHR
INH
ISN
ITC
KQ8
M1O
M48
NAPCQ
OK1
OVT
P2P
PGMZT
PHGZM
PHGZT
PIMPY
PQQKQ
RNS
RPM
SJN
SV3
TR2
UKHRP
XSB
ACUHS
CGR
CUY
CVF
ECM
EIF
NPM
PPXIY
PRQQA
PMFND
77I
7X8
PUEGO
5PM
ID FETCH-LOGICAL-c464t-81c4801f0df73b96cd67b814812636ded2ec3a53fcad432a8d8f024e67f4b5253
IEDL.DBID M48
ISSN 1438-8871
1439-4456
IngestDate Sun Aug 31 08:12:08 EDT 2025
Fri Sep 05 11:07:18 EDT 2025
Tue Jun 17 22:20:52 EDT 2025
Tue Jun 10 21:19:48 EDT 2025
Fri Jun 27 05:55:24 EDT 2025
Mon Jul 21 05:26:51 EDT 2025
Tue Jul 01 02:05:43 EDT 2025
Thu Apr 24 22:50:04 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 5
Keywords anonymization
pseudonymization
confidentiality
deidentification
data protection
privacy
de-identification
secondary use
anonymisation
scoping review
Language English
License Raphaël Chevrier, Vasiliki Foufi, Christophe Gaudet-Blavignac, Arnaud Robert, Christian Lovis. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 31.05.2019.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c464t-81c4801f0df73b96cd67b814812636ded2ec3a53fcad432a8d8f024e67f4b5253
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
ObjectType-Review-3
content type line 23
ORCID 0000-0002-8625-0734
0000-0001-8692-7402
0000-0001-6527-5898
0000-0002-0334-4883
0000-0002-2681-8076
OpenAccessLink http://journals.scholarsportal.info/openUrl.xqy?doi=10.2196/13484
PMID 31152528
PQID 2233850896
PQPubID 23479
ParticipantIDs pubmedcentral_primary_oai_pubmedcentral_nih_gov_6658290
proquest_miscellaneous_2233850896
gale_infotracmisc_A769355999
gale_infotracacademiconefile_A769355999
gale_incontextgauss_ISN_A769355999
pubmed_primary_31152528
crossref_citationtrail_10_2196_13484
crossref_primary_10_2196_13484
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2019-05-31
PublicationDateYYYYMMDD 2019-05-31
PublicationDate_xml – month: 05
  year: 2019
  text: 2019-05-31
  day: 31
PublicationDecade 2010
PublicationPlace Canada
PublicationPlace_xml – name: Canada
– name: Toronto, Canada
PublicationTitle Journal of medical Internet research
PublicationTitleAlternate J Med Internet Res
PublicationYear 2019
Publisher Journal of Medical Internet Research
JMIR Publications
Publisher_xml – name: Journal of Medical Internet Research
– name: JMIR Publications
References ref13
ref57
ref12
ref56
ref15
ref59
ref14
ref53
ref11
ref55
ref10
ref54
ref17
ref16
ref19
ref18
(ref2) 2015
Lu, Y (ref64) 2017; 239
ref51
ref50
ref46
ref45
ref48
ref47
ref42
ref41
ref44
ref43
ref49
ref8
ref7
El Emam, K (ref60) 2009; 62
ref9
ref4
ref3
ref6
ref5
ref40
ref35
Bartholomäus, S (ref58) 2015; 210
ref34
ref37
ref31
ref75
ref30
ref33
ref77
ref76
ref1
ref39
ref38
Tamersoy, A (ref65) 2010; 2010
ref71
ref70
ref73
ref72
ref24
ref23
ref67
ref26
Grouin, C (ref36) 2009; 150
Smith, C (ref74) 2017; 235
ref25
ref69
ref20
Foufi, V (ref68) 2017; 244
ref22
ref66
ref21
ref28
ref27
Sinnott, R (ref63) 2008; 138
ref29
Ye, H (ref32) 2011; 2011
ref62
Prasser, F (ref52) 2016; 228
ref61
References_xml – ident: ref1
– ident: ref35
  doi: 10.1213/ANE.0000000000001331
– ident: ref5
– volume: 239
  start-page: 84
  year: 2017
  ident: ref64
  publication-title: Stud Health Technol Inform
– ident: ref70
  doi: 10.1186/1472-6939-12-16
– ident: ref6
  doi: 10.1136/bmj.h1139
– ident: ref20
  doi: 10.1038/nbt0805-925
– ident: ref59
  doi: 10.1186/1472-6947-13-71
– ident: ref47
  doi: 10.1109/TITB.2012.2212281
– ident: ref39
  doi: 10.1186/s12911-016-0287-2
– ident: ref27
– ident: ref37
  doi: 10.1177/1073110516644204
– ident: ref49
  doi: 10.1093/jamia/ocv154
– ident: ref57
  doi: 10.1186/s12911-016-0293-4
– volume: 138
  start-page: 90
  year: 2008
  ident: ref63
  publication-title: Stud Health Technol Inform
– ident: ref33
  doi: 10.1016/j.ijmedinf.2013.11.005
– ident: ref43
  doi: 10.1073/pnas.0911686107
– ident: ref71
  doi: 10.1007/s11673-017-9806-9
– ident: ref16
  doi: 10.1016/j.jbi.2012.12.003
– ident: ref54
  doi: 10.3390/s17051059
– ident: ref29
  doi: 10.4274/balkanmedj.2017.0966
– year: 2015
  ident: ref2
  publication-title: Sharing Clinical Trial Data: Maximizing Benefits, Minimizing Risk
– ident: ref61
– ident: ref7
  doi: 10.1197/jamia.M3144
– ident: ref15
  doi: 10.1371/journal.pone.0028071
– ident: ref53
  doi: 10.1111/jlme.12036
– ident: ref12
  doi: 10.2310/JIM.0b013e3181c9b2ea
– ident: ref41
  doi: 10.1016/j.jbi.2013.12.002
– volume: 210
  start-page: 424
  year: 2015
  ident: ref58
  publication-title: Stud Health Technol Inform
– ident: ref4
  doi: 10.1016/S0140-6736(09)61486-0
– ident: ref9
  doi: 10.1136/jme.2006.020032
– ident: ref23
– ident: ref55
  doi: 10.1093/jamia/ocv004
– ident: ref25
  doi: 10.1186/1748-5908-5-69
– ident: ref38
  doi: 10.1089/bio.2015.0100
– ident: ref75
  doi: 10.4338/ACI-2012-07-RA-0028
– ident: ref28
  doi: 10.1097/MLR.0b013e3182585355
– ident: ref19
  doi: 10.1109/TKDE.2009.88
– ident: ref66
  doi: 10.1111/jlme.12312
– ident: ref48
  doi: 10.3414/ME16-01-0012
– ident: ref51
  doi: 10.1109/JBHI.2017.2676880
– ident: ref34
  doi: 10.1177/1460458216647760
– ident: ref46
  doi: 10.1136/jamia.2009.002725
– ident: ref45
  doi: 10.1136/jamia.2010.004622
– ident: ref17
  doi: 10.1016/j.is.2012.11.005
– ident: ref26
  doi: 10.1097/XEB.0000000000000050
– volume: 235
  start-page: 313
  year: 2017
  ident: ref74
  publication-title: Stud Health Technol Inform
– volume: 62
  start-page: 307
  issue: 4
  year: 2009
  ident: ref60
  publication-title: Can J Hosp Pharm
– ident: ref10
  doi: 10.1186/s12874-016-0169-4
– ident: ref22
– volume: 228
  start-page: 312
  year: 2016
  ident: ref52
  publication-title: Stud Health Technol Inform
– ident: ref30
  doi: 10.2310/6650.2007.06044
– volume: 2011
  start-page: 1573
  year: 2011
  ident: ref32
  publication-title: AMIA Annu Symp Proc
– ident: ref56
  doi: 10.1186/1745-6215-11-9
– ident: ref73
  doi: 10.1016/j.jclinepi.2017.04.016
– ident: ref21
  doi: 10.1038/nbt.3696
– ident: ref44
  doi: 10.1109/TITB.2012.2185850
– ident: ref42
  doi: 10.1371/journal.pone.0053875
– volume: 2010
  start-page: 782
  year: 2010
  ident: ref65
  publication-title: AMIA Annu Symp Proc
– ident: ref50
  doi: 10.1007/s12687-017-0300-1
– ident: ref13
  doi: 10.1186/1471-2288-10-70
– ident: ref3
  doi: 10.1197/jamia.M2273
– ident: ref76
  doi: 10.1016/j.jbi.2014.05.002
– ident: ref11
  doi: 10.1056/NEJMp1605348
– ident: ref67
– ident: ref24
  doi: 10.1186/s12874-016-0116-4
– volume: 150
  start-page: 735
  year: 2009
  ident: ref36
  publication-title: Stud Health Technol Inform
– ident: ref69
  doi: 10.1093/jamia/ocw152
– ident: ref8
  doi: 10.1197/jamia.M2207
– ident: ref40
  doi: 10.3346/jkms.2015.30.1.7
– ident: ref31
  doi: 10.7717/peerj.1506
– ident: ref77
– ident: ref14
  doi: 10.1016/j.jbi.2014.06.002
– ident: ref62
  doi: 10.15252/embr.201540943
– ident: ref18
– ident: ref72
  doi: 10.1016/j.jbi.2015.09.007
– volume: 244
  start-page: 23
  year: 2017
  ident: ref68
  publication-title: Stud Health Technol Inform
SSID ssj0020491
Score 2.5271907
SecondaryResourceType review_article
Snippet The secondary use of health data is central to biomedical research in the era of data science and precision medicine. National and international initiatives,...
Background The secondary use of health data is central to biomedical research in the era of data science and precision medicine. National and international...
SourceID pubmedcentral
proquest
gale
pubmed
crossref
SourceType Open Access Repository
Aggregation Database
Index Database
Enrichment Source
StartPage e13484
SubjectTerms Biomedical Research - methods
Data Anonymization - standards
Ethical aspects
Health insurance industry
Humans
Medical research
Medicine, Experimental
Privacy
Privacy, Right of
Reproducibility of Results
Review
Title Use and Understanding of Anonymization and De-Identification in the Biomedical Literature: Scoping Review
URI https://www.ncbi.nlm.nih.gov/pubmed/31152528
https://www.proquest.com/docview/2233850896
https://pubmed.ncbi.nlm.nih.gov/PMC6658290
Volume 21
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV3da9RAEB9sC0dBROtXtA2rFH2KXrKbTVYQabWlineKGuhbSLK77UHJaXMH-t87k-yFpOhLXnZIlp2ZzMzOzG8ADhNpDUZcFZVVFIFAGxmoyOogVWUoaRCBbbPns7k8y8Sn83hQTegOsPlnaEfzpLLrq1e_f_15hwr_lsqYUYBeh1ykYgt20BhJir9mok8kROgAhxO4PSLdhQlhzEQxjWAfWKOb_-SBURoXTA4s0OlduONcR3bU8foe3DL1Hhy4xgP2grnOIjpp5lR2DyYzlzy_D4usMayoNcuGDS1saVl3B-A6MluSDyboWnitu9Nji5qhr8iO23Z94iz73CMyv8HvtY1XrEs1PIDs9OTH-7PATVoIKiHFKkjDimBk7FTbhJdKVlomZYqRUhhJLrXRkal4EXNbFVrwqEh1atG4G5lYUeJB8oewjTs1j4GJqSUQOamT2AiluZJlVESaMGRKUcTWg8PNieeVgyGnaRhXOYYjxKO85ZEHfk_2s8PduEnwnNiVE4ZFTUUyF8W6afKP3-f5Ec13JCQ15cFLR2SX-JGqcD0HuFWCvRpR7o8oUcmq0fKzjVTktESVabVZrpsc3SueoperpAePOinpN7yRMg-Skfz0BITtPV6pF5ctxrdEzzBS0yf_fedT2EXfTXWFDPuwvbpemwP0j1alD1vJeeLDzvHJ_Os3v71lwOcs_OK3uvEXYU0SoA
linkProvider Scholars Portal
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Use+and+Understanding+of+Anonymization+and+De-Identification+in+the+Biomedical+Literature%3A+Scoping+Review&rft.jtitle=Journal+of+medical+Internet+research&rft.au=Chevrier%2C+Rapha%C3%ABl&rft.au=Foufi%2C+Vasiliki&rft.au=Gaudet-Blavignac%2C+Christophe&rft.au=Robert%2C+Arnaud&rft.date=2019-05-31&rft.eissn=1438-8871&rft.volume=21&rft.issue=5&rft.spage=e13484&rft_id=info:doi/10.2196%2F13484&rft_id=info%3Apmid%2F31152528&rft.externalDocID=31152528
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1438-8871&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1438-8871&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1438-8871&client=summon