Using electronic patient records to discover disease correlations and stratify patient cohorts

Electronic patient records remain a rather unexplored, but potentially rich data source for discovering correlations between diseases. We describe a general approach for gathering phenotypic descriptions of patients from medical records in a systematic and non-cohort dependent manner. By extracting...

Full description

Saved in:
Bibliographic Details
Published inPLoS computational biology Vol. 7; no. 8; p. e1002141
Main Authors Roque, Francisco S, Jensen, Peter B, Schmock, Henriette, Dalgaard, Marlene, Andreatta, Massimo, Hansen, Thomas, Søeby, Karen, Bredkjær, Søren, Juul, Anders, Werge, Thomas, Jensen, Lars J, Brunak, Søren
Format Journal Article
LanguageEnglish
Published United States Public Library of Science 01.08.2011
Public Library of Science (PLoS)
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Electronic patient records remain a rather unexplored, but potentially rich data source for discovering correlations between diseases. We describe a general approach for gathering phenotypic descriptions of patients from medical records in a systematic and non-cohort dependent manner. By extracting phenotype information from the free-text in such records we demonstrate that we can extend the information contained in the structured record data, and use it for producing fine-grained patient stratification and disease co-occurrence statistics. The approach uses a dictionary based on the International Classification of Disease ontology and is therefore in principle language independent. As a use case we show how records from a Danish psychiatric hospital lead to the identification of disease correlations, which subsequently can be mapped to systems biology frameworks.
AbstractList Electronic patient records remain a rather unexplored, but potentially rich data source for discovering correlations between diseases. We describe a general approach for gathering phenotypic descriptions of patients from medical records in a systematic and non-cohort dependent manner. By extracting phenotype information from the free-text in such records we demonstrate that we can extend the information contained in the structured record data, and use it for producing fine-grained patient stratification and disease co-occurrence statistics. The approach uses a dictionary based on the International Classification of Disease ontology and is therefore in principle language independent. As a use case we show how records from a Danish psychiatric hospital lead to the identification of disease correlations, which subsequently can be mapped to systems biology frameworks.
  Electronic patient records remain a rather unexplored, but potentially rich data source for discovering correlations between diseases. We describe a general approach for gathering phenotypic descriptions of patients from medical records in a systematic and non-cohort dependent manner. By extracting phenotype information from the free-text in such records we demonstrate that we can extend the information contained in the structured record data, and use it for producing fine-grained patient stratification and disease co-occurrence statistics. The approach uses a dictionary based on the International Classification of Disease ontology and is therefore in principle language independent. As a use case we show how records from a Danish psychiatric hospital lead to the identification of disease correlations, which subsequently can be mapped to systems biology frameworks.
Electronic patient records remain a rather unexplored, but potentially rich data source for discovering correlations between diseases. We describe a general approach for gathering phenotypic descriptions of patients from medical records in a systematic and non-cohort dependent manner. By extracting phenotype information from the free-text in such records we demonstrate that we can extend the information contained in the structured record data, and use it for producing fine-grained patient stratification and disease co-occurrence statistics. The approach uses a dictionary based on the International Classification of Disease ontology and is therefore in principle language independent. As a use case we show how records from a Danish psychiatric hospital lead to the identification of disease correlations, which subsequently can be mapped to systems biology frameworks. Text mining and information extraction can be seen as the challenge of converting information hidden in text into manageable data. We have used text mining to automatically extract clinically relevant terms from 5543 psychiatric patient records and map these to disease codes in the International Classification of Disease ontology (ICD10). Mined codes were supplemented by existing coded data. For each patient we constructed a phenotypic profile of associated ICD10 codes. This allowed us to cluster patients together based on the similarity of their profiles. The result is a patient stratification based on more complete profiles than the primary diagnosis, which is typically used. Similarly we investigated comorbidities by looking for pairs of disease codes cooccuring in patients more often than expected. Our high ranking pairs were manually curated by a medical doctor who flagged 93 candidates as interesting. For a number of these we were able to find genes/proteins known to be associated with the diseases using the OMIM database. The disease-associated proteins allowed us to construct protein networks suspected to be involved in each of the phenotypes. Shared proteins between two associated diseases might provide insight to the disease comorbidity.
Author Schmock, Henriette
Andreatta, Massimo
Dalgaard, Marlene
Bredkjær, Søren
Jensen, Lars J
Jensen, Peter B
Søeby, Karen
Roque, Francisco S
Hansen, Thomas
Juul, Anders
Werge, Thomas
Brunak, Søren
AuthorAffiliation Vanderbilt University, United States of America
4 Department of Growth and Reproduction GR, Rigshospitalet, Copenhagen, Denmark
2 NNF Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
1 Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, Lyngby, Denmark
5 Department of Clinical Biochemistry, Hvidovre Hospital, Copenhagen University Hospital, Hvidovre, Denmark
3 Institute of Biological Psychiatry, Mental Health Center Sct. Hans, Copenhagen University Hospital, Roskilde, Denmark
6 Psychiatry Region Sealand, Ringsted, Denmark
AuthorAffiliation_xml – name: 6 Psychiatry Region Sealand, Ringsted, Denmark
– name: 4 Department of Growth and Reproduction GR, Rigshospitalet, Copenhagen, Denmark
– name: 2 NNF Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
– name: 3 Institute of Biological Psychiatry, Mental Health Center Sct. Hans, Copenhagen University Hospital, Roskilde, Denmark
– name: 5 Department of Clinical Biochemistry, Hvidovre Hospital, Copenhagen University Hospital, Hvidovre, Denmark
– name: 1 Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, Lyngby, Denmark
– name: Vanderbilt University, United States of America
Author_xml – sequence: 1
  givenname: Francisco S
  surname: Roque
  fullname: Roque, Francisco S
  organization: Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, Lyngby, Denmark
– sequence: 2
  givenname: Peter B
  surname: Jensen
  fullname: Jensen, Peter B
– sequence: 3
  givenname: Henriette
  surname: Schmock
  fullname: Schmock, Henriette
– sequence: 4
  givenname: Marlene
  surname: Dalgaard
  fullname: Dalgaard, Marlene
– sequence: 5
  givenname: Massimo
  surname: Andreatta
  fullname: Andreatta, Massimo
– sequence: 6
  givenname: Thomas
  surname: Hansen
  fullname: Hansen, Thomas
– sequence: 7
  givenname: Karen
  surname: Søeby
  fullname: Søeby, Karen
– sequence: 8
  givenname: Søren
  surname: Bredkjær
  fullname: Bredkjær, Søren
– sequence: 9
  givenname: Anders
  surname: Juul
  fullname: Juul, Anders
– sequence: 10
  givenname: Thomas
  surname: Werge
  fullname: Werge, Thomas
– sequence: 11
  givenname: Lars J
  surname: Jensen
  fullname: Jensen, Lars J
– sequence: 12
  givenname: Søren
  surname: Brunak
  fullname: Brunak, Søren
BackLink https://www.ncbi.nlm.nih.gov/pubmed/21901084$$D View this record in MEDLINE/PubMed
BookMark eNpVUktvEzEQtlARbQP_AMHeOCWM14_1XpBQBbRSpV7oFcuP2XQjxw72plL_PQ7ZRq188NjzPTzjuSRnMUUk5COFFWUd_bpJ-xxNWO2cHVcUoKWcviEXVAi27JhQZy_ic3JZygaghr18R85b2gMFxS_In_syxnWDAd2UUxxdszPTiHFqMrqUfWmm1PixuPSI-RCgKdjUTMZQgSmWxkTflCnX0_B0Yrv0kPJU3pO3gwkFP8z7gtz__PH76np5e_fr5ur77dLxvpuW6FveUQO9AgQwXgwdk66TFgY69FJ4PoDgDjpurPPgRM-4FKKVnTUCLWUL8vmouwup6Lk1RVNWl-Kcq4q4OSJ8Mhu9y-PW5CedzKj_X6S81iZPowuo-94aLhVI7xyX6K3qpANlkYvWsuq9IN9mt73done14GzCK9HXmTg-6HV61IzK2nleBb7MAjn93WOZ9La2GEMwEdO-aKUUY5L1UJH8iHQ5lZJxOLlQ0Ic5eC5WH-ZAz3NQaZ9evvBEev549g_cbrU2
CitedBy_id crossref_primary_10_3390_jpm12050768
crossref_primary_10_1186_s12874_017_0299_3
crossref_primary_10_1007_s41870_021_00692_0
crossref_primary_10_1109_TITB_2012_2212281
crossref_primary_10_1136_amiajnl_2012_001448
crossref_primary_10_1038_bjc_2012_315
crossref_primary_10_1007_s12652_019_01217_1
crossref_primary_10_1109_TCBB_2018_2849968
crossref_primary_10_1016_j_mad_2020_111403
crossref_primary_10_1371_journal_pone_0066341
crossref_primary_10_1186_s12911_021_01387_z
crossref_primary_10_1016_j_ypmed_2020_106061
crossref_primary_10_1038_nrg3208_c2
crossref_primary_10_1371_journal_pone_0237911
crossref_primary_10_1111_acps_12279
crossref_primary_10_3109_0167482X_2011_626940
crossref_primary_10_1109_JBHI_2015_2496424
crossref_primary_10_1186_s13326_015_0004_6
crossref_primary_10_1016_j_jbi_2019_103360
crossref_primary_10_1038_s41598_018_22578_1
crossref_primary_10_1038_nbt_2422
crossref_primary_10_3390_s22010131
crossref_primary_10_1007_s10916_014_0071_8
crossref_primary_10_1016_j_annemergmed_2018_10_034
crossref_primary_10_1016_j_chaos_2022_112633
crossref_primary_10_1136_amiajnl_2014_002767
crossref_primary_10_1016_j_jbi_2015_12_003
crossref_primary_10_1016_j_artmed_2023_102642
crossref_primary_10_1016_j_jbi_2015_12_006
crossref_primary_10_1016_j_jbi_2021_103711
crossref_primary_10_1016_j_jbi_2018_04_008
crossref_primary_10_1007_s11845_018_1825_y
crossref_primary_10_1109_JBHI_2017_2783049
crossref_primary_10_1016_j_jbi_2014_01_012
crossref_primary_10_1371_journal_pgen_1009054
crossref_primary_10_1088_1755_1315_332_3_032024
crossref_primary_10_1109_TNB_2015_2422612
crossref_primary_10_1136_bmjopen_2015_008721
crossref_primary_10_1038_nrg3208
crossref_primary_10_4158_EP15914_OR
crossref_primary_10_1016_j_jad_2019_09_044
crossref_primary_10_1093_bib_bby049
crossref_primary_10_1136_bmjopen_2019_036099
crossref_primary_10_1177_1178222617713018
crossref_primary_10_1016_j_artmed_2013_12_003
crossref_primary_10_1016_j_bbagrm_2019_194416
crossref_primary_10_1038_s41598_021_97831_1
crossref_primary_10_1038_s41598_017_07111_0
crossref_primary_10_1097_MEG_0000000000001869
crossref_primary_10_1093_eurheartj_ehx487
crossref_primary_10_1051_e3sconf_202235101028
crossref_primary_10_2196_medinform_7744
crossref_primary_10_1016_j_jbi_2015_10_001
crossref_primary_10_1186_1472_6947_13_30
crossref_primary_10_1093_jamia_ocv216
crossref_primary_10_1016_j_procs_2016_09_077
crossref_primary_10_1111_j_1365_2125_2011_04153_x
crossref_primary_10_3390_genes11121476
crossref_primary_10_1186_1742_4682_9_22
crossref_primary_10_1038_s41598_017_15647_4
crossref_primary_10_1177_2151459318801756
crossref_primary_10_1186_1878_5085_4_23
crossref_primary_10_4137_CIN_S13893
crossref_primary_10_4258_hir_2016_22_3_196
crossref_primary_10_1089_big_2013_0023
crossref_primary_10_1136_amiajnl_2012_001076
crossref_primary_10_1016_j_eswa_2015_09_028
crossref_primary_10_1098_rsif_2020_1040
crossref_primary_10_1136_bmjopen_2016_012012
crossref_primary_10_1111_epi_17734
crossref_primary_10_1016_j_protcy_2013_12_152
crossref_primary_10_1177_0193945918824070
crossref_primary_10_3390_healthcare6020054
crossref_primary_10_1186_1472_6947_15_S4_S1
crossref_primary_10_1093_jamia_ocu019
crossref_primary_10_1093_jamiaopen_ooz067
crossref_primary_10_1016_j_ijmedinf_2023_105088
crossref_primary_10_1371_journal_pcbi_1005487
crossref_primary_10_1016_j_jbi_2016_08_005
crossref_primary_10_1093_bioinformatics_btt327
crossref_primary_10_1186_1471_2105_15_105
crossref_primary_10_1038_s43588_021_00141_9
crossref_primary_10_1186_s13326_016_0093_x
crossref_primary_10_1093_bioinformatics_btv254
crossref_primary_10_2196_12239
crossref_primary_10_1038_s41467_018_05690_8
crossref_primary_10_1017_cts_2016_25
crossref_primary_10_1016_j_jbi_2018_09_016
crossref_primary_10_1016_j_ymeth_2014_11_020
crossref_primary_10_1007_s00127_016_1266_8
crossref_primary_10_1155_2016_1671965
crossref_primary_10_1016_j_jbi_2016_05_002
crossref_primary_10_1098_rsif_2018_0248
crossref_primary_10_1177_0269881120903466
crossref_primary_10_1002_sim_8893
crossref_primary_10_1038_nbt_2749
crossref_primary_10_1146_annurev_biodatasci_080917_013508
crossref_primary_10_1016_j_drudis_2013_09_012
crossref_primary_10_1016_j_compbiomed_2016_04_014
crossref_primary_10_1093_bioinformatics_btz061
crossref_primary_10_1093_jamia_ocx052
crossref_primary_10_1016_j_jacc_2019_01_074
crossref_primary_10_1109_ACCESS_2020_3016782
crossref_primary_10_1093_bioinformatics_btt359
crossref_primary_10_1002_wsbm_1310
crossref_primary_10_1016_j_prevetmed_2016_11_012
crossref_primary_10_1038_nbt_2345
crossref_primary_10_1186_s12911_020_01140_y
crossref_primary_10_1186_1472_6947_15_S2_S3
crossref_primary_10_1093_jamia_ocv046
crossref_primary_10_3390_ijerph17228303
crossref_primary_10_1186_s12911_021_01565_z
crossref_primary_10_3389_fphys_2014_00332
crossref_primary_10_7554_eLife_44941
crossref_primary_10_1371_journal_pcbi_1006115
crossref_primary_10_1371_journal_pone_0070585
crossref_primary_10_1038_nrg_2016_87
crossref_primary_10_2217_pme_14_55
crossref_primary_10_1038_s41598_017_04939_4
crossref_primary_10_3390_data8010011
crossref_primary_10_1007_s11518_020_5462_4
crossref_primary_10_1542_peds_2013_0819
crossref_primary_10_1016_j_jbi_2013_10_007
crossref_primary_10_4062_biomolther_2020_001
crossref_primary_10_1016_j_jbi_2014_10_004
crossref_primary_10_3390_diagnostics12071552
crossref_primary_10_1093_hmg_ddx014
crossref_primary_10_1093_ije_dyaa157
crossref_primary_10_1136_amiajnl_2013_001708
crossref_primary_10_1002_mpr_1481
crossref_primary_10_1016_j_ebiom_2024_105081
crossref_primary_10_1186_s12911_018_0654_2
crossref_primary_10_1073_pnas_2110406119
crossref_primary_10_1159_000501432
crossref_primary_10_1186_s12859_018_2468_8
crossref_primary_10_3389_fphys_2016_00561
crossref_primary_10_1088_1742_6596_1732_1_012013
crossref_primary_10_1016_j_jbi_2016_02_003
crossref_primary_10_1093_jamia_ocad119
crossref_primary_10_1016_j_ipm_2015_07_005
crossref_primary_10_2215_CJN_10981016
crossref_primary_10_1186_s13326_018_0179_8
crossref_primary_10_3233_JIFS_190143
crossref_primary_10_1093_jamiaopen_ooy050
crossref_primary_10_1016_j_jbi_2014_10_010
crossref_primary_10_2196_14782
crossref_primary_10_1186_s13023_018_0830_6
crossref_primary_10_1371_journal_pcbi_1004747
crossref_primary_10_1016_j_jbi_2019_103235
crossref_primary_10_1007_s40264_014_0145_z
crossref_primary_10_1159_000381851
crossref_primary_10_1145_3127881
crossref_primary_10_13106_jafeb_2019_vol6_no4_189
crossref_primary_10_1371_journal_pone_0288903
crossref_primary_10_1007_s12170_014_0424_3
crossref_primary_10_1136_bmjopen_2016_015297
Cites_doi 10.1016/j.jhealeco.2004.07.002
10.1016/j.ijmedinf.2007.09.001
10.1016/j.jbi.2010.09.009
10.1007/s10799-009-0061-6
10.1016/j.jbi.2004.11.016
10.1055/s-0038-1634006
10.1136/jnnp.63.4.549
10.1073/pnas.0810772105
10.1016/S0140-6736(66)90189-9
10.1016/j.ajhg.2010.03.003
10.1046/j.1440-1754.2003.00116.x
10.1080/15265161.2010.492894
10.1056/NEJMsa0802005
10.1197/jamia.M1727
10.1136/jamia.2010.003939
10.1007/s10048-004-0181-4
10.1016/j.jaad.2006.10.020
10.1002/humu.21456
10.1186/1472-6947-6-30
10.1007/s10729-009-9112-0
10.3109/10401230009147085
10.1016/j.jbi.2006.02.010
10.1038/nbt1295
10.1136/bmj.c3111
10.1053/j.gastro.2005.02.018
10.1136/jamia.2001.0080254
10.1136/bmj.328.7437.438
10.1016/S0920-1211(02)00080-3
10.1016/j.compbiomed.2005.08.003
10.1136/jamia.2009.001560
10.1145/635484.635487
10.1371/journal.pcbi.1000353
10.1038/msb.2010.36
10.1002/acr.20184
10.1038/nrg1768
10.3414/ME9128
10.1093/nar/gki033
10.1002/asi.4630270302
10.1046/j.1523-1747.2003.12370.x
10.1016/j.tibtech.2006.10.002
10.1093/bioinformatics/btq126
10.1186/2041-1480-1-5
10.1073/pnas.0701361104
10.1038/nature04209
10.1038/msb.2009.16
10.1197/jamia.M1552
10.1136/jamia.2010.003657
10.3414/ME9135
10.1016/S1386-5056(02)00030-8
10.1158/0008-5472.CAN-09-3460
10.1197/jamia.M2408
10.1016/S0168-8510(02)00208-7
10.1016/j.ajic.2005.06.008
10.3414/ME9132
10.1176/appi.ajp.2009.09101473
10.1006/jbin.2001.1029
ContentType Journal Article
Copyright Roque et al. 2011
2011 Roque et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited: Roque FS, Jensen PB, Schmock H, Dalgaard M, Andreatta M, et al. (2011) Using Electronic Patient Records to Discover Disease Correlations and Stratify Patient Cohorts. PLoS Comput Biol 7(8): e1002141. doi:10.1371/journal.pcbi.1002141
Copyright_xml – notice: Roque et al. 2011
– notice: 2011 Roque et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited: Roque FS, Jensen PB, Schmock H, Dalgaard M, Andreatta M, et al. (2011) Using Electronic Patient Records to Discover Disease Correlations and Stratify Patient Cohorts. PLoS Comput Biol 7(8): e1002141. doi:10.1371/journal.pcbi.1002141
DBID CGR
CUY
CVF
ECM
EIF
NPM
AAYXX
CITATION
7X8
5PM
DOA
DOI 10.1371/journal.pcbi.1002141
DatabaseName Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
CrossRef
MEDLINE - Academic
PubMed Central (Full Participant titles)
Directory of Open Access Journals
DatabaseTitle MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
CrossRef
MEDLINE - Academic
DatabaseTitleList
MEDLINE


Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
Medicine
Computer Science
DocumentTitleAlternate EPR Based Co-Morbidity and Patient Stratification
EISSN 1553-7358
Editor Ritchie, Marylyn D.
Editor_xml – sequence: 1
  givenname: Marylyn D.
  surname: Ritchie
  fullname: Ritchie, Marylyn D.
EndPage e1002141
ExternalDocumentID 1313184448
oai_doaj_org_article_99ba46806dcc46edb876c08be452b359
10_1371_journal_pcbi_1002141
21901084
Genre Research Support, Non-U.S. Gov't
Journal Article
GroupedDBID ---
123
29O
2WC
3V.
53G
5VS
7X7
88E
8FE
8FG
8FH
8FI
8FJ
AAFWJ
AAKPC
ABDBF
ABUWG
ACGFO
ACIHN
ACIWK
ACPRK
ADBBV
ADRAZ
AEAQA
AENEX
AFKRA
AFRAH
AHMBA
ALIPV
ALMA_UNASSIGNED_HOLDINGS
AOIJS
ARAPS
AZQEC
B0M
BAWUL
BBNVY
BCNDV
BENPR
BGLVJ
BHPHI
BPHCQ
BVXVI
BWKFM
C1A
CCPQU
CGR
CS3
CUY
CVF
DIK
DWQXO
E3Z
EAP
EAS
EBD
EBS
ECM
EIF
EJD
EMK
EMOBN
ESX
F5P
FPL
FYUFA
GNUQQ
GROUPED_DOAJ
GX1
H13
HCIFZ
HMCUK
HYE
IAO
IGS
INH
INR
IPNFZ
ISN
ISR
ITC
J9A
K6V
K7-
KQ8
LK8
M0N
M1P
M48
M7P
M~E
NPM
O5R
O5S
OK1
P2P
P62
PGMZT
PIMPY
PQQKQ
PROAC
PSQYO
PV9
RIG
RNS
RPM
RZL
SV3
TR2
TUS
UKHRP
WOQ
WOW
XSB
~8M
AAYXX
AFPKN
CITATION
7X8
5PM
AAPBV
ABPTK
ID FETCH-LOGICAL-c497t-ed2471a0980e00ad5f736c76b0f1f965d4f054c074abcd0c5934655267ba5eb13
IEDL.DBID RPM
ISSN 1553-7358
1553-734X
IngestDate Sun Oct 01 00:20:29 EDT 2023
Thu Sep 05 15:38:57 EDT 2024
Tue Sep 17 21:09:21 EDT 2024
Fri Aug 16 10:07:20 EDT 2024
Fri Aug 23 03:46:09 EDT 2024
Tue Oct 15 23:43:48 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 8
Keywords International Classification of Diseases
Reproducibility of Results
Comorbidity
Humans
Computational Biology
Data Mining
Data Collection
Electronic Health Records
Cluster Analysis
Cohort Studies
Language English
License This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
Creative Commons Attribution License
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c497t-ed2471a0980e00ad5f736c76b0f1f965d4f054c074abcd0c5934655267ba5eb13
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Conceived and designed the experiments: F. Roque, P. Jensen, S. Bredkjær, L. Jensen, S. Brunak. Performed the experiments: F. Roque, P. Jensen, H. Schmock, M. Andreatta. Analyzed the data: F. Roque, P. Jensen, H. Schmock, M. Dalgaard, M. Andreatta, T. Hansen, K. Søeby, A. Juul, T. Werge, S. Brunak. Wrote the paper: F. Roque, P. Jensen.
OpenAccessLink https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3161904/
PMID 21901084
PQID 888336390
PQPubID 23479
ParticipantIDs plos_journals_1313184448
doaj_primary_oai_doaj_org_article_99ba46806dcc46edb876c08be452b359
pubmedcentral_primary_oai_pubmedcentral_nih_gov_3161904
proquest_miscellaneous_888336390
crossref_primary_10_1371_journal_pcbi_1002141
pubmed_primary_21901084
PublicationCentury 2000
PublicationDate 2011-08-01
PublicationDateYYYYMMDD 2011-08-01
PublicationDate_xml – month: 08
  year: 2011
  text: 2011-08-01
  day: 01
PublicationDecade 2010
PublicationPlace United States
PublicationPlace_xml – name: United States
– name: San Francisco, USA
PublicationTitle PLoS computational biology
PublicationTitleAlternate PLoS Comput Biol
PublicationYear 2011
Publisher Public Library of Science
Public Library of Science (PLoS)
Publisher_xml – name: Public Library of Science
– name: Public Library of Science (PLoS)
References 18565855 - N Engl J Med. 2008 Jul 3;359(1):50-60
15187068 - J Am Med Inform Assoc. 2004 Sep-Oct;11(5):392-402
20439396 - Am J Psychiatry. 2010 May;167(5):498-500
20819856 - J Am Med Inform Assoc. 2010 Sep-Oct;17(5):524-7
16779011 - AMIA Annu Symp Proc. 2005;:106-10
20402283 - Health Care Manag Sci. 2010 Mar;13(1):65-73
12123149 - J Biomed Inform. 2001 Oct;34(5):301-10
14976100 - BMJ. 2004 Feb 21;328(7437):438-9
19023491 - Methods Inf Med. 2008;47(6):541-8
12880440 - J Invest Dermatol. 2003 Aug;121(2):430-2
20819853 - J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13
17700902 - Yearb Med Inform. 2007;:34-46
16616698 - J Biomed Inform. 2007 Feb;40(1):44-6
11825149 - Proc AMIA Symp. 2001;:17-21
16872495 - BMC Med Inform Decis Mak. 2006;6:30
17147971 - J Am Acad Dermatol. 2007 May;56(5):848-53
15617794 - J Health Econ. 2005 Jan;24(1):189-210
16375883 - Comput Biol Med. 2006 Dec;36(12):1351-77
17045684 - Trends Biotechnol. 2006 Dec;24(12):571-9
16418747 - Nat Rev Genet. 2006 Feb;7(2):119-29
15797003 - J Biomed Inform. 2005 Apr;38(2):145-53
20068186 - Cancer Res. 2010 Jan 15;70(2):575-87
19357641 - Mol Syst Biol. 2009;5:262
20335276 - Bioinformatics. 2010 May 1;26(9):1205-10
12453552 - Int J Med Inform. 2002 Nov 20;66(1-3):3-21
20362271 - Am J Hum Genet. 2010 Apr 9;86(4):560-72
18660887 - Yearb Med Inform. 2008;:128-44
8947648 - Proc AMIA Annu Fall Symp. 1996;:159-63
20884377 - J Biomed Inform. 2010 Dec;43(6):891-901
12151129 - Epilepsy Res. 2002 Jun;50(1-2):195-202
15608251 - Nucleic Acids Res. 2005 Jan 1;33(Database issue):D514-7
16216656 - Am J Infect Control. 2005 Oct;33(8):439-43
17567225 - Am J Manag Care. 2007 Jun;13(6 Part 1):281-8
20235204 - Arthritis Care Res (Hoboken). 2010 Aug;62(8):1120-7
21309042 - Hum Mutat. 2011 May;32(5):512-6
17951106 - Int J Med Inform. 2008 May;77(5):291-304
12849909 - Health Policy. 2003 Aug;65(2):101-7
16350276 - Methods Inf Med. 2005;44(4):546-50
16779036 - AMIA Annu Symp Proc. 2005;:231-5
20618981 - J Biomed Semantics. 2010 Mar 31;1(1):5
12603809 - J Paediatr Child Health. 2003 Mar;39(2):152-4
20554687 - BMJ. 2010;340:c3111
19151878 - Methods Inf Med. 2009;48(1):4-10
17947624 - J Am Med Inform Assoc. 2008 Jan-Feb;15(1):14-24
9357597 - Proc AMIA Annu Fall Symp. 1997;:101-5
19151882 - Methods Inf Med. 2009;48(1):38-44
20819858 - J Am Med Inform Assoc. 2010 Sep-Oct;17(5):532-5
19360091 - PLoS Comput Biol. 2009 Apr;5(4):e1000353
17051292 - Yearb Med Inform. 2006;:29-39
4159965 - Lancet. 1966 Apr 16;1(7442):856-9
20571530 - Mol Syst Biol. 2010 Jun 22;6:381
15133719 - Neurogenetics. 2004 Jun;5(2):129-33
16779084 - AMIA Annu Symp Proc. 2005;:470-4
11320070 - J Am Med Inform Assoc. 2001 May-Jun;8(3):254-66
20818550 - Am J Bioeth. 2010 Sep;10(9):19-20
17502601 - Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90
19104045 - Proc Natl Acad Sci U S A. 2008 Dec 30;105(52):20870-5
9343147 - J Neurol Neurosurg Psychiatry. 1997 Oct;63(4):549-50
17344885 - Nat Biotechnol. 2007 Mar;25(3):309-16
10798824 - Ann Clin Psychiatry. 2000 Mar;12(1):35-42
15684124 - J Am Med Inform Assoc. 2005 May-Jun;12(3):269-74
15825133 - Gastroenterology. 2005 Apr;128(4 Suppl 1):S92-7
16189514 - Nature. 2005 Oct 20;437(7062):1173-8
K-I Goh (ref26) 2007; 104
HU Prokosch (ref2) 2009; 48
S Ananiadou (ref64) 2006; 24
DM Crammer K (ref32) 2007
P Ganesan (ref48) 2003; 21
CD Manning (ref65) 2009
PJ Embi (ref35) 2005
W Eaton (ref56) 2004; 328
K Lage (ref69) 2010; 6
J-F Rual (ref42) 2005; 437
MH Ullman-Cullere (ref10) 2011; 32
CA Kulikowski (ref9) 2009; 48
P Knaup (ref12) 2007
U Sax (ref7) 2005; 44
A Hamosh (ref68) 2005; 33
C Friedman (ref29) 2004; 11
QT Zeng (ref40) 2006; 6
CA Hidalgo (ref66) 2009; 5
K Häyrinen (ref11) 2008; 77
MW Jaspers (ref6) 2006
RM Plovnick (ref60) 2010; 167
JM Sutherland (ref14) 2009; 13
W Chapman (ref47) 2007
D Krasowska (ref54) 2007; 56
T Suzuki (ref30) 2008; 47
K Lage (ref70) 2008; 105
CM DesRoches (ref3) 2008; 359
L Serden (ref13) 2003; 65
J Patrick (ref33) 2010; 17
T Greenhalgh (ref5) 2010; 340
O Uzuner (ref21) 2008; 15
JP Haas (ref25) 2005; 33
S Fessatou (ref58) 2003; 39
KP Liao (ref19) 2010; 62
MA Hoffman (ref8) 2007; 40
SE Robertson (ref45) 1976
R Haux (ref1) 2002; 66
KO Bushara (ref57) 2005; 128
DJ Muzina (ref51) 2002; 50
MC Tremblay (ref20) 2009; 10
D Becker (ref15) 2005; 24
JC Prather (ref16) 1997
H Ma (ref59) 2010; 70
AR Aronson (ref38) 2001
Y Benjamini (ref67) 1995; 57
IM Mullins (ref17) 2006; 36
AS Paller (ref49) 2003; 121
LJ Jensen (ref63) 2006; 7
J Park (ref27) 2009; 5
A Wright (ref18) 2010; 43
NJ Colson (ref50) 2004; 5
B Honigman (ref24) 2001; 8
W Long (ref31) 2005
A Ikeda (ref53) 1997; 63
MD Ritchie (ref44) 2010; 86
SV Pakhomov (ref36) 2005; 38
JC Denny (ref43) 2010; 26
Y Mercke (ref52) 2000; 12
K Lage (ref41) 2007; 25
SM Meystre (ref37) 2008
I Spasic (ref34) 2010; 17
H Cao (ref28) 2005
W Chapman (ref46) 2001; 34
WL Galanter (ref23) 2005; 12
CW Whitty (ref55) 1966; 1
GK Savova (ref39) 2010; 17
KM Hettne (ref61) 2010; 1
S Pakhomov (ref22) 2007; 13
S Hoffman (ref4) 2010; 10
WR Hersh (ref62) 1996
References_xml – volume: 24
  start-page: 189
  year: 2005
  ident: ref15
  article-title: Detecting Medicare abuse.
  publication-title: Journal of Health Economics
  doi: 10.1016/j.jhealeco.2004.07.002
  contributor:
    fullname: D Becker
– start-page: 101
  year: 1997
  ident: ref16
  article-title: Medical data mining: knowledge discovery in a clinical data warehouse.
  publication-title: Proc AMIA Annu Fall Symp
  contributor:
    fullname: JC Prather
– volume: 77
  start-page: 291
  year: 2008
  ident: ref11
  article-title: Definition, structure, content, use and impacts of electronic health records: a review of the research literature.
  publication-title: Int J Med Inform
  doi: 10.1016/j.ijmedinf.2007.09.001
  contributor:
    fullname: K Häyrinen
– start-page: 29
  year: 2006
  ident: ref6
  article-title: The computerized patient record: where do we stand ?
  publication-title: Yearb Med Inform
  contributor:
    fullname: MW Jaspers
– volume: 43
  start-page: 891
  year: 2010
  ident: ref18
  article-title: An automated technique for identifying associations between medications, laboratory results and problems.
  publication-title: J Biomed Inform
  doi: 10.1016/j.jbi.2010.09.009
  contributor:
    fullname: A Wright
– volume: 10
  start-page: 253
  year: 2009
  ident: ref20
  article-title: Identifying fall-related injuries: Text mining the electronic medical record.
  publication-title: Inf Technol Manag
  doi: 10.1007/s10799-009-0061-6
  contributor:
    fullname: MC Tremblay
– volume: 38
  start-page: 145
  year: 2005
  ident: ref36
  article-title: Prospective recruitment of patients with congestive heart failure using an ad-hoc binary classifier.
  publication-title: J Biomed Inform
  doi: 10.1016/j.jbi.2004.11.016
  contributor:
    fullname: SV Pakhomov
– volume: 44
  start-page: 546
  year: 2005
  ident: ref7
  article-title: Integration of genomic data in Electronic Health Records–opportunities and dilemmas.
  publication-title: Methods Inf Med
  doi: 10.1055/s-0038-1634006
  contributor:
    fullname: U Sax
– volume: 63
  start-page: 549
  year: 1997
  ident: ref53
  article-title: Alopecia with carbamazepine in two patients with focal seizures.
  publication-title: J Neurol Neurosurg Psychiatr
  doi: 10.1136/jnnp.63.4.549
  contributor:
    fullname: A Ikeda
– volume: 105
  start-page: 20870
  year: 2008
  ident: ref70
  article-title: A large-scale analysis of tissue-specific pathology and gene expression of human disease genes and complexes.
  publication-title: Proc Natl Acad Sci U S A
  doi: 10.1073/pnas.0810772105
  contributor:
    fullname: K Lage
– volume: 1
  start-page: 856
  year: 1966
  ident: ref55
  article-title: The effect of oral contraceptives on migraine.
  publication-title: Lancet
  doi: 10.1016/S0140-6736(66)90189-9
  contributor:
    fullname: CW Whitty
– volume: 86
  start-page: 560
  year: 2010
  ident: ref44
  article-title: Robust replication of genotype-phenotype associations across multiple diseases in an electronic medical record.
  publication-title: Am J Hum Genet
  doi: 10.1016/j.ajhg.2010.03.003
  contributor:
    fullname: MD Ritchie
– volume: 39
  start-page: 152
  year: 2003
  ident: ref58
  article-title: Coeliac disease and alopecia areata in childhood.
  publication-title: J Paediatr Child Health
  doi: 10.1046/j.1440-1754.2003.00116.x
  contributor:
    fullname: S Fessatou
– volume: 10
  start-page: 19
  year: 2010
  ident: ref4
  article-title: Electronic health records and research: privacy versus scientific priorities.
  publication-title: Am J Bioeth
  doi: 10.1080/15265161.2010.492894
  contributor:
    fullname: S Hoffman
– start-page: 81
  year: 2007
  ident: ref47
  article-title: ConText: An Algorithm for Identifying Contextual Features from Clinical Text.
  publication-title: BioNLP 2007: Biological, translational, and clinical language processing
  contributor:
    fullname: W Chapman
– volume: 57
  start-page: 289
  year: 1995
  ident: ref67
  article-title: Controlling the false discovery rate: a practical and powerful approach to multiple testing.
  publication-title: J R Stat Soc Series B Stat Methodol
  contributor:
    fullname: Y Benjamini
– volume: 359
  start-page: 50
  year: 2008
  ident: ref3
  article-title: Electronic health records in ambulatory care–a national survey of physicians.
  publication-title: N Engl J Med
  doi: 10.1056/NEJMsa0802005
  contributor:
    fullname: CM DesRoches
– volume: 12
  start-page: 269
  year: 2005
  ident: ref23
  article-title: A trial of automated decision support alerts for contraindicated medications using computerized physician order entry.
  publication-title: J Am Med Inform Assoc
  doi: 10.1197/jamia.M1727
  contributor:
    fullname: WL Galanter
– volume: 17
  start-page: 524
  year: 2010
  ident: ref33
  article-title: High accuracy information extraction of medication information from clinical notes: 2009 i2b2 medication extraction challenge.
  publication-title: J Am Med Inform Assoc
  doi: 10.1136/jamia.2010.003939
  contributor:
    fullname: J Patrick
– volume: 5
  start-page: 129
  year: 2004
  ident: ref50
  article-title: The estrogen receptor 1 G594A polymorphism is associated with migraine susceptibility in two independent case/control groups.
  publication-title: Neurogenetics
  doi: 10.1007/s10048-004-0181-4
  contributor:
    fullname: NJ Colson
– volume: 56
  start-page: 848
  year: 2007
  ident: ref54
  article-title: Cutaneous effects of the most commonly used antidepressant medication, the selective serotonin reuptake inhibitors.
  publication-title: J Am Acad Dermatol
  doi: 10.1016/j.jaad.2006.10.020
  contributor:
    fullname: D Krasowska
– start-page: 470
  year: 2005
  ident: ref31
  article-title: Extracting diagnoses from discharge summaries.
  publication-title: AMIA Annual Symposium proceedings/AMIA Symposium AMIA Symposium
  contributor:
    fullname: W Long
– volume: 32
  start-page: 512
  year: 2011
  ident: ref10
  article-title: Emerging landscape of genomics in the electronic health record for personalized medicine.
  publication-title: Hum Mutat
  doi: 10.1002/humu.21456
  contributor:
    fullname: MH Ullman-Cullere
– year: 2009
  ident: ref65
  article-title: An Introduction to Information Retrieval
  contributor:
    fullname: CD Manning
– volume: 6
  start-page: 30
  year: 2006
  ident: ref40
  article-title: Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system.
  publication-title: BMC Med Inform Decis Mak
  doi: 10.1186/1472-6947-6-30
  contributor:
    fullname: QT Zeng
– volume: 13
  start-page: 65
  year: 2009
  ident: ref14
  article-title: Adjusting case mix payment amounts for inaccurately reported comorbidity data.
  publication-title: Health Care Manag Sci
  doi: 10.1007/s10729-009-9112-0
  contributor:
    fullname: JM Sutherland
– volume: 12
  start-page: 35
  year: 2000
  ident: ref52
  article-title: Hair loss in psychopharmacology.
  publication-title: Ann Clin Psychiatry
  doi: 10.3109/10401230009147085
  contributor:
    fullname: Y Mercke
– volume: 40
  start-page: 44
  year: 2007
  ident: ref8
  article-title: The genome-enabled electronic medical record.
  publication-title: J Biomed Inform
  doi: 10.1016/j.jbi.2006.02.010
  contributor:
    fullname: MA Hoffman
– volume: 25
  start-page: 309
  year: 2007
  ident: ref41
  article-title: A human phenome-interactome network of protein complexes implicated in genetic disorders.
  publication-title: Nat Biotechnol
  doi: 10.1038/nbt1295
  contributor:
    fullname: K Lage
– volume: 340
  start-page: c3111
  year: 2010
  ident: ref5
  article-title: Adoption and non-adoption of a shared electronic summary record in England: a mixed-method case study.
  publication-title: BMJ
  doi: 10.1136/bmj.c3111
  contributor:
    fullname: T Greenhalgh
– volume: 128
  start-page: S92
  year: 2005
  ident: ref57
  article-title: Neurologic presentation of celiac disease.
  publication-title: Gastroenterology
  doi: 10.1053/j.gastro.2005.02.018
  contributor:
    fullname: KO Bushara
– start-page: 128
  year: 2008
  ident: ref37
  article-title: Extracting information from textual documents in the electronic health record: a review of recent research.
  publication-title: Yearb Med Inform
  contributor:
    fullname: SM Meystre
– volume: 8
  start-page: 254
  year: 2001
  ident: ref24
  article-title: Using computerized data to identify adverse drug events in outpatients.
  publication-title: J Am Med Inform Assoc
  doi: 10.1136/jamia.2001.0080254
  contributor:
    fullname: B Honigman
– volume: 328
  start-page: 438
  year: 2004
  ident: ref56
  article-title: Coeliac disease and schizophrenia: population based case control study with linkage of Danish national registers.
  publication-title: BMJ
  doi: 10.1136/bmj.328.7437.438
  contributor:
    fullname: W Eaton
– volume: 50
  start-page: 195
  year: 2002
  ident: ref51
  article-title: Antiepileptic drugs in psychiatry-focus on randomized controlled trial.
  publication-title: Epilepsy Res
  doi: 10.1016/S0920-1211(02)00080-3
  contributor:
    fullname: DJ Muzina
– volume: 13
  start-page: 281
  year: 2007
  ident: ref22
  article-title: Electronic medical records for clinical research: application to the identification of heart failure.
  publication-title: Am J Manag Care
  contributor:
    fullname: S Pakhomov
– start-page: 129
  year: 2007
  ident: ref32
  article-title: Automatic Code Assignment to Medical Text
  contributor:
    fullname: DM Crammer K
– volume: 36
  start-page: 1351
  year: 2006
  ident: ref17
  article-title: Data mining and clinical data repositories: Insights from a 667,000 patient data set.
  publication-title: Comput Biol Med
  doi: 10.1016/j.compbiomed.2005.08.003
  contributor:
    fullname: IM Mullins
– volume: 17
  start-page: 507
  year: 2010
  ident: ref39
  article-title: Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.
  publication-title: J Am Med Inform Assoc
  doi: 10.1136/jamia.2009.001560
  contributor:
    fullname: GK Savova
– volume: 21
  year: 2003
  ident: ref48
  article-title: Exploiting Hierarchical Domain Structure to Compute Similarity.
  publication-title: ACM Transactions on Information Systems (TOIS)
  doi: 10.1145/635484.635487
  contributor:
    fullname: P Ganesan
– volume: 5
  start-page: e1000353
  year: 2009
  ident: ref66
  article-title: A dynamic network approach for the study of human phenotypes.
  publication-title: PLoS Comput Biol
  doi: 10.1371/journal.pcbi.1000353
  contributor:
    fullname: CA Hidalgo
– volume: 6
  start-page: 381
  year: 2010
  ident: ref69
  article-title: Dissecting spatio-temporal protein networks driving human heart development and related disorders.
  publication-title: Mol Syst Biol
  doi: 10.1038/msb.2010.36
  contributor:
    fullname: K Lage
– volume: 62
  start-page: 1120
  year: 2010
  ident: ref19
  article-title: Electronic medical records for discovery research in rheumatoid arthritis.
  publication-title: Arthritis Care Res (Hoboken)
  doi: 10.1002/acr.20184
  contributor:
    fullname: KP Liao
– start-page: 17
  year: 2001
  ident: ref38
  article-title: Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.
  publication-title: Proc AMIA Symp
  contributor:
    fullname: AR Aronson
– volume: 7
  start-page: 119
  year: 2006
  ident: ref63
  article-title: Literature mining for the biologist: from information retrieval to biological discovery.
  publication-title: Nat Rev Genet
  doi: 10.1038/nrg1768
  contributor:
    fullname: LJ Jensen
– volume: 47
  start-page: 541
  year: 2008
  ident: ref30
  article-title: Automatic DPC code selection from electronic medical records: text mining trial of discharge summary.
  publication-title: Methods Inf Med
  doi: 10.3414/ME9128
  contributor:
    fullname: T Suzuki
– volume: 33
  start-page: D514
  year: 2005
  ident: ref68
  article-title: Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders.
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gki033
  contributor:
    fullname: A Hamosh
– start-page: 129
  year: 1976
  ident: ref45
  article-title: Relevance weighting of search terms.
  publication-title: J Am Soc Inf Sci
  doi: 10.1002/asi.4630270302
  contributor:
    fullname: SE Robertson
– volume: 121
  start-page: 430
  year: 2003
  ident: ref49
  article-title: Compound heterozygous mutations in the hairless gene in atrichia with papular lesions.
  publication-title: J Invest Dermatol
  doi: 10.1046/j.1523-1747.2003.12370.x
  contributor:
    fullname: AS Paller
– volume: 24
  start-page: 571
  year: 2006
  ident: ref64
  article-title: Text mining and its potential applications in systems biology.
  publication-title: Trends Biotechnol
  doi: 10.1016/j.tibtech.2006.10.002
  contributor:
    fullname: S Ananiadou
– volume: 26
  start-page: 1205
  year: 2010
  ident: ref43
  article-title: PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations.
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btq126
  contributor:
    fullname: JC Denny
– volume: 1
  start-page: 5
  year: 2010
  ident: ref61
  article-title: Rewriting and suppressing UMLS terms for improved biomedical term identification.
  publication-title: J Biomed Semantics
  doi: 10.1186/2041-1480-1-5
  contributor:
    fullname: KM Hettne
– volume: 104
  start-page: 8685
  year: 2007
  ident: ref26
  article-title: The human disease network.
  publication-title: Proc Natl Acad Sci U S A
  doi: 10.1073/pnas.0701361104
  contributor:
    fullname: K-I Goh
– volume: 437
  start-page: 1173
  year: 2005
  ident: ref42
  article-title: Towards a proteome-scale map of the human protein-protein interaction network.
  publication-title: Nature
  doi: 10.1038/nature04209
  contributor:
    fullname: J-F Rual
– volume: 5
  start-page: 262
  year: 2009
  ident: ref27
  article-title: The impact of cellular networks on disease comorbidity.
  publication-title: Mol Syst Biol
  doi: 10.1038/msb.2009.16
  contributor:
    fullname: J Park
– volume: 11
  start-page: 392
  year: 2004
  ident: ref29
  article-title: Automated encoding of clinical documents based on natural language processing.
  publication-title: J Am Med Inform Assoc
  doi: 10.1197/jamia.M1552
  contributor:
    fullname: C Friedman
– volume: 17
  start-page: 532
  year: 2010
  ident: ref34
  article-title: Medication information extraction with linguistic pattern matching and semantic rules.
  publication-title: J Am Med Inform Assoc
  doi: 10.1136/jamia.2010.003657
  contributor:
    fullname: I Spasic
– start-page: 159
  year: 1996
  ident: ref62
  article-title: Empirical, automated vocabulary discovery using large text corpora and advanced natural language processing tools.
  publication-title: Proc AMIA Annu Fall Symp
  contributor:
    fullname: WR Hersh
– volume: 48
  start-page: 4
  year: 2009
  ident: ref9
  article-title: Biomedical and health informatics in translational medicine.
  publication-title: Methods Inf Med
  doi: 10.3414/ME9135
  contributor:
    fullname: CA Kulikowski
– start-page: 34
  year: 2007
  ident: ref12
  article-title: Electronic patient records: moving from islands and bridges towards electronic health records for continuity of care.
  publication-title: Yearb Med Inform
  contributor:
    fullname: P Knaup
– volume: 66
  start-page: 3
  year: 2002
  ident: ref1
  article-title: Health care in the information society. A prognosis for the year 2013.
  publication-title: Int J Med Inform
  doi: 10.1016/S1386-5056(02)00030-8
  contributor:
    fullname: R Haux
– volume: 70
  start-page: 575
  year: 2010
  ident: ref59
  article-title: Use of four biomarkers to evaluate the risk of breast cancer subtypes in the women's contraceptive and reproductive experiences study.
  publication-title: Cancer Res
  doi: 10.1158/0008-5472.CAN-09-3460
  contributor:
    fullname: H Ma
– volume: 15
  start-page: 14
  year: 2008
  ident: ref21
  article-title: Identifying patient smoking status from medical discharge records.
  publication-title: J Am Med Inform Assoc
  doi: 10.1197/jamia.M2408
  contributor:
    fullname: O Uzuner
– start-page: 231
  year: 2005
  ident: ref35
  article-title: Development of an electronic health record-based Clinical Trial Alert system to enhance recruitment at the point of care.
  publication-title: AMIA Annual Symposium proceedings/AMIA Symposium AMIA Symposium
  contributor:
    fullname: PJ Embi
– volume: 65
  start-page: 101
  year: 2003
  ident: ref13
  article-title: Have DRG-based prospective payment systems influenced the number of secondary diagnoses in health care administrative data?
  publication-title: Health Policy
  doi: 10.1016/S0168-8510(02)00208-7
  contributor:
    fullname: L Serden
– volume: 33
  start-page: 439
  year: 2005
  ident: ref25
  article-title: Use of computerized surveillance to detect nosocomial pneumonia in neonatal intensive care unit patients.
  publication-title: Am J Infect Control
  doi: 10.1016/j.ajic.2005.06.008
  contributor:
    fullname: JP Haas
– volume: 48
  start-page: 38
  year: 2009
  ident: ref2
  article-title: Perspectives for medical informatics. Reusing the electronic medical record for clinical research.
  publication-title: Methods Inf Med
  doi: 10.3414/ME9132
  contributor:
    fullname: HU Prokosch
– volume: 167
  start-page: 498
  year: 2010
  ident: ref60
  article-title: The progression of electronic health records and implications for psychiatry.
  publication-title: Am J Psychiatry
  doi: 10.1176/appi.ajp.2009.09101473
  contributor:
    fullname: RM Plovnick
– start-page: 106
  year: 2005
  ident: ref28
  article-title: Mining a clinical data warehouse to discover disease-finding associations using co-occurrence statistics.
  publication-title: AMIA Annu Symp Proc
  contributor:
    fullname: H Cao
– volume: 34
  start-page: 301
  year: 2001
  ident: ref46
  article-title: A simple algorithm for identifying negated findings and diseases in discharge summaries.
  publication-title: J Biomed Inform
  doi: 10.1006/jbin.2001.1029
  contributor:
    fullname: W Chapman
SSID ssj0035896
Score 2.514676
Snippet Electronic patient records remain a rather unexplored, but potentially rich data source for discovering correlations between diseases. We describe a general...
  Electronic patient records remain a rather unexplored, but potentially rich data source for discovering correlations between diseases. We describe a general...
SourceID plos
doaj
pubmedcentral
proquest
crossref
pubmed
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
StartPage e1002141
SubjectTerms Bias
Biology
Classification
Cluster Analysis
Codes
Cohort Studies
Comorbidity
Computational Biology - methods
Computer Science
Data Collection - methods
Data mining
Data Mining - methods
Disease
Electronic Health Records
Humans
International Classification of Diseases
Medical research
Medicine
Ontology
Patients
Reproducibility of Results
Studies
SummonAdditionalLinks – databaseName: Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1NT9wwELUQUiUuVSltSaHIh15TnPgrPgIqQhw4gbSnRvHYVpGqLCLhwL-vx3a2bFWpF5RbPhR7Zmw_2-P3CPnqGw2tj-0bDIdatEHUg2pdbUFbAQAu2JQge6Ou7sT1Sq5eSH1hTlimB86GOzXGDkJ1TDkAobyzsfkC66wXsrVc5qN7jVwmU7kP5rJLylwoilNrLlbl0BzXzWnx0bcHsPeJgLQRzdaglLj7kev013r6F-78O33yxXh0-Y68LUCSnuUK7JMdP74nb7K05PMB-ZFyAegflRtaGFRpXpaZ6LymeCQXUzhp2aahgFodJTuODqOjmVU3PG--RkHdx3n6QO4uv99eXNVFS6EGYfRce9fGYWhgpmOescHJoLkCrSwLTTBKOhEieIMIKAYLjoE0HJnVWqXtIGN_zj-S3XE9-kNCGWjPbRz6fGgEIMCxAVlf4sTSoAhqRerFmP1Dpszo076ZjlONbJwejd8X41fkHC2-eRcJr9ONGAZ9CYP-f2FQkUP01_KDKf4uXl0sVFcRuviwj20HN0SG0a-fpr5DpeUI0VhFPmWXbgrRIlBinaiI3nL2Vim3n4z3PxM9N48g2jDx-TWqdUT2lkVs1hyT3fnxyX-JKGi2JyngfwO4XQeI
  priority: 102
  providerName: Directory of Open Access Journals
– databaseName: Scholars Portal Open Access Journals
  dbid: M48
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1La9wwEB5CSqGX0vQVt0nRoVcH2ZYl6xBKu00IhZYeupBTjTWSmoVgp-sNdP99NPKDbkgPxRe_ZInR65Nn9H0A712mMHehf6MuMBW5F2kjc5saVEYgovUmBsh-kxdL8eWyvNyDSbN1NGD_4NKO9KSW6-uTP7-3H0KHP42qDSqbEp3coFlFStGMdrI_ykUhqM1_FbNfoSirqNhFYjmpClfjZrp_fWVnsoqc_sSBet31D-HR-2GVf81T58_g6Qgw2cehRRzAnmufw-NBcnL7An7GGAF2NqvfsO8DsyobFqI923Ts86pHCu2kE3LfsAVpeIxRc6xpLYuktiu_nVMvuivyPbyE5fnZj8VFOmospCi02qTO5mF6ariuuOO8saVXhUQlDfeZ17K0wgdQhwFoNAYtx1IXxLiWS2WaMozzxSvYb7vWHQLjqFxhwpTofCaQgI_xxAYTFpyaxFETSCdj1jcDlUYd_WkqLEEG49Rk_Ho0fgKfyOLzu0SEHW9061_12K9qrU0jZMWlRRTSWRNGd-SVcaLMTVHqBA6pvqYM-pBdOKpQqCoBNtVhHfoUOUqa1nW3fV2RAnOAbjyB10OVzoXICUDxSiSgdip7p5S7T9rVVaTtLgK41ly8-U8zvIUn039snh3B_mZ9644DENqYd7Ft3wHyCQll
  priority: 102
  providerName: Scholars Portal
Title Using electronic patient records to discover disease correlations and stratify patient cohorts
URI https://www.ncbi.nlm.nih.gov/pubmed/21901084
https://search.proquest.com/docview/888336390
https://pubmed.ncbi.nlm.nih.gov/PMC3161904
https://doaj.org/article/99ba46806dcc46edb876c08be452b359
http://dx.doi.org/10.1371/journal.pcbi.1002141
Volume 7
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Pa9swFH60HRu7dFv2o96PoMOuTmRLtqRjkyYrg5QwVshpxpLlNdDaoUkP_e_7JMthGTsNgx2SCAm_96Qnv8_fB_DVJsKkFuPbKGZintY8LvO0irURmhtjqlp7gOxVfnnNv6-y1RFk_bswHrRv9HrU3N6NmvWNx1Zu7sy4x4mNl4spwzRFUT4-hmPBWL9F76ZflkkvyuX0cGLB-Cq8L8dEMg7mGW1cB4mnC0s8G7DDKEh-sDR5Bn_HeHrbbv-Vff4NovxjVZq_htOQTpLzbthv4Mg2A3jeCUw-DuBVL9pAQgwP4MUiVNPfwi-PFyCzvRIOWXYsq6TblG7JriUX661xME_3wZVyyNTpeQQEHSmbiniC23X9uG89bW9cHeIdXM9nP6eXcdBbiA1XYhfbKsWlqqRKUktpWWW1YLkRuaZ1Uqs8q3iNCZ7BpKPUpqImU8yxr6W50GWGcz57DydN29gzINQIyzQuj7ZOuHFJkK4dMwxuPpUTSo0g7m91seloNQpfWxO4HeluXeGsVAQrRTBx9tj_15Fi-y_a-99FcI1CKV3yXNK8MobnttI40xsqteVZqlmmIjhz1uw72GJ3eEgclIyA9BYuML5c0aRsbPuwLaRTY8Y0jkbwoTP4fhC930QgDlzhYJSHv6BHewrv4MEf_7vlJ3jZP92myWc42d0_2C-YHu30EINiJfAs59-G8Ox8cjGZ43Uyu1r-GPpHDnhecDn0YfMEIyMWvw
link.rule.ids 230,315,733,786,790,870,891,2115,2236,24346,27955,27956,31753,33407,33778,53825,53827
linkProvider National Library of Medicine
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Nb9QwEB2VIigXCgu04dMHrtl1EieOj7C0WqBb9dCiPRHFE4euaJNVkz2UX4_t2Cu24gLKJcqHbGfGmXHm5T2A9yriGCs9v1EkGLK4ZmGZxVUokUuGiFUtLUD2NJtdsC-LdLEDqf8XxoL2US7HzdX1uFleWmzl6honHic2OZtPE52mCMom9-C-nq9x6hfpwws4SXMry2UUcUKesIX7Yy7h0cQZaLwyTUSWMCyyfMAGpZCzreBkOfwN5-lV2_0t_7wLo_wjLh3vwzc_ogGO8nO87uUYf90he_znIT-Bxy5TJR-G009hRzUjeDBoV96OYN_rQRD3ehjBw7kr1D-D7xaKQI42IjvkbCBwJcN6tyN9Sz4tOzQIUrNjqkRkaqRCHDiPlE1FLHfusr7d3D1tL02J4zlcHB-dT2ehk3IIkQneh6qKdRQsqciporSs0ponGfJM0jqqRZZWrNa5I-p8ppRYUUxFYojd4ozLMtXhJHkBu03bqEMgFLlKpI68qo4YmvxK1oZ0Rq9rhdFgDSD0NixWA2NHYct2XK90hkdXGPMXzvwBfDSG3lxr-LbtgfbmR-FMUAghS5blNKsQWaYqqYMI0lwqlsYySUUAh8ZNfAOdbk5vue5UHgDxrlPoqWvqMWWj2nVX5EboWWeINICDwZM2nfAOGQDf8rGtXm6f0Z5j2cGdp7z87zvfwd7sfH5SnHw-_foKHvmP6DR6Dbv9zVq90VlYL9_aOfcb1WoyBQ
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Pb5swFH7aOq3apd2yH2U_fdgVMNhgOG5po-5HqxxWKdphCD9MG62FqJBD-9fPNiZqqp2qXKIAsh1_5j3zPr4P4LOKBMZKr2_MGfo8rrlfpnHlSxSSI2JVS0uQPU2Pz_j3RbK4Y_VlSfsol0FzeRU0ywvLrVxdYTjyxML5yZTpNCWnPFxVdfgYnug1G4txoz7chFmSWWsu44rjC8YX7q05JqLQTVKwMs1EVjQssprAhqmQ8a0AZXX8je7pZdv9Lwe9T6W8E5tm-_B7HNVASfkbrHsZ4O09wccHDfs57LmMlXwZTnkBj1QzgaeDh-XNBPZHXwjibhMT2D1xBfuX8MdSEsjRxmyHzAchVzLsezvSt-Rw2aFhkpovplpEpsYyxJH0SNlUxGroLuubzdXT9sKUOl7B2ezo1_TYd5YOPvJc9L6qYh0NS5pnVFFaVkktWIoilbSO6jxNKl7rHBJ1XlNKrCgmOTMCb3EqZJnosMJew07TNuoACEWhmNQRWNURR5NnydqIz-j9bW68WD3wx3ksVoNyR2HLd0LveIa_rjAQKBwEPPhqJntzrtHdtj-01-eFm4Yiz2XJ04ymFSJPVSV1MEGaScWTWLIk9-DAQGVsoNPN6U-mO5V5QEb4FHoJm7pM2ah23RWZMXzWmSL14M2Apk0nRlB6ILZwttXL7SMaPVYl3KHl7YOv_AS788NZ8fPb6Y938Gx8lk6j97DTX6_VB52M9fKjXXb_AF7gNIU
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Using+Electronic+Patient+Records+to+Discover+Disease+Correlations+and+Stratify+Patient+Cohorts&rft.jtitle=PLoS+computational+biology&rft.au=Roque%2C+Francisco+S.&rft.au=Jensen%2C+Peter+B.&rft.au=Schmock%2C+Henriette&rft.au=Dalgaard%2C+Marlene&rft.date=2011-08-01&rft.issn=1553-7358&rft.eissn=1553-7358&rft.volume=7&rft.issue=8&rft.spage=e1002141&rft_id=info:doi/10.1371%2Fjournal.pcbi.1002141&rft.externalDBID=n%2Fa&rft.externalDocID=10_1371_journal_pcbi_1002141
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1553-7358&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1553-7358&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1553-7358&client=summon