SNPPhenA: a corpus for extracting ranked associations of single-nucleotide polymorphisms and phenotypes from literature

Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some corpora and methods have been developed with the purpose of extracting mutations and diseases from texts. However, there is no available corpus,...

Full description

Saved in:
Bibliographic Details
Published inJournal of biomedical semantics Vol. 8; no. 1; pp. 14 - 13
Main Authors Bokharaeian, Behrouz, Diaz, Alberto, Taghizadeh, Nasrin, Chitsaz, Hamidreza, Chavoshinejad, Ramyar
Format Journal Article
LanguageEnglish
Published England BioMed Central 07.04.2017
BMC
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some corpora and methods have been developed with the purpose of extracting mutations and diseases from texts. However, there is no available corpus, for extracting associations from texts, that is annotated with linguistic-based negation, modality markers, neutral candidates, and confidence level of associations. In this research, different steps were presented so as to produce the SNPPhenA corpus. They include automatic Named Entity Recognition (NER) followed by the manual annotation of SNP and phenotype names, annotation of the SNP-phenotype associations and their level of confidence, as well as modality markers. Moreover, the produced corpus was annotated with negation scopes and cues as well as neutral candidates that play crucial role as far as negation and the modality phenomenon in relation to extraction tasks. The agreement between annotators was measured by Cohen's Kappa coefficient where the resulting scores indicated the reliability of the corpus. The Kappa score was 0.79 for annotating the associations and 0.80 for the confidence degree of associations. Further presented were the basic statistics of the annotated features of the corpus in addition to the results of our first experiments related to the extraction of ranked SNP-Phenotype associations. The prepared guideline documents render the corpus more convenient and facile to use. The corpus, guidelines and inter-annotator agreement analysis are available on the website of the corpus: http://nil.fdi.ucm.es/?q=node/639 . Specifying the confidence degree of SNP-phenotype associations from articles helps identify the strength of associations that could in turn assist genomics scientists in determining phenotypic plasticity and the importance of environmental factors. What is more, our first experiments with the corpus show that linguistic-based confidence alongside other non-linguistic features can be utilized in order to estimate the strength of the observed SNP-phenotype associations. Not Applicable.
AbstractList Background Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some corpora and methods have been developed with the purpose of extracting mutations and diseases from texts. However, there is no available corpus, for extracting associations from texts, that is annotated with linguistic-based negation, modality markers, neutral candidates, and confidence level of associations. Method In this research, different steps were presented so as to produce the SNPPhenA corpus. They include automatic Named Entity Recognition (NER) followed by the manual annotation of SNP and phenotype names, annotation of the SNP-phenotype associations and their level of confidence, as well as modality markers. Moreover, the produced corpus was annotated with negation scopes and cues as well as neutral candidates that play crucial role as far as negation and the modality phenomenon in relation to extraction tasks. Result The agreement between annotators was measured by Cohen’s Kappa coefficient where the resulting scores indicated the reliability of the corpus. The Kappa score was 0.79 for annotating the associations and 0.80 for the confidence degree of associations. Further presented were the basic statistics of the annotated features of the corpus in addition to the results of our first experiments related to the extraction of ranked SNP-Phenotype associations. The prepared guideline documents render the corpus more convenient and facile to use. The corpus, guidelines and inter-annotator agreement analysis are available on the website of the corpus: http://nil.fdi.ucm.es/?q=node/639. Conclusion Specifying the confidence degree of SNP-phenotype associations from articles helps identify the strength of associations that could in turn assist genomics scientists in determining phenotypic plasticity and the importance of environmental factors. What is more, our first experiments with the corpus show that linguistic-based confidence alongside other non-linguistic features can be utilized in order to estimate the strength of the observed SNP-phenotype associations.
Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some corpora and methods have been developed with the purpose of extracting mutations and diseases from texts. However, there is no available corpus, for extracting associations from texts, that is annotated with linguistic-based negation, modality markers, neutral candidates, and confidence level of associations. In this research, different steps were presented so as to produce the SNPPhenA corpus. They include automatic Named Entity Recognition (NER) followed by the manual annotation of SNP and phenotype names, annotation of the SNP-phenotype associations and their level of confidence, as well as modality markers. Moreover, the produced corpus was annotated with negation scopes and cues as well as neutral candidates that play crucial role as far as negation and the modality phenomenon in relation to extraction tasks. The agreement between annotators was measured by Cohen's Kappa coefficient where the resulting scores indicated the reliability of the corpus. The Kappa score was 0.79 for annotating the associations and 0.80 for the confidence degree of associations. Further presented were the basic statistics of the annotated features of the corpus in addition to the results of our first experiments related to the extraction of ranked SNP-Phenotype associations. The prepared guideline documents render the corpus more convenient and facile to use. The corpus, guidelines and inter-annotator agreement analysis are available on the website of the corpus: http://nil.fdi.ucm.es/?q=node/639 . Specifying the confidence degree of SNP-phenotype associations from articles helps identify the strength of associations that could in turn assist genomics scientists in determining phenotypic plasticity and the importance of environmental factors. What is more, our first experiments with the corpus show that linguistic-based confidence alongside other non-linguistic features can be utilized in order to estimate the strength of the observed SNP-phenotype associations. Not Applicable.
Abstract Background Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some corpora and methods have been developed with the purpose of extracting mutations and diseases from texts. However, there is no available corpus, for extracting associations from texts, that is annotated with linguistic-based negation, modality markers, neutral candidates, and confidence level of associations. Method In this research, different steps were presented so as to produce the SNPPhenA corpus. They include automatic Named Entity Recognition (NER) followed by the manual annotation of SNP and phenotype names, annotation of the SNP-phenotype associations and their level of confidence, as well as modality markers. Moreover, the produced corpus was annotated with negation scopes and cues as well as neutral candidates that play crucial role as far as negation and the modality phenomenon in relation to extraction tasks. Result The agreement between annotators was measured by Cohen’s Kappa coefficient where the resulting scores indicated the reliability of the corpus. The Kappa score was 0.79 for annotating the associations and 0.80 for the confidence degree of associations. Further presented were the basic statistics of the annotated features of the corpus in addition to the results of our first experiments related to the extraction of ranked SNP-Phenotype associations. The prepared guideline documents render the corpus more convenient and facile to use. The corpus, guidelines and inter-annotator agreement analysis are available on the website of the corpus: http://nil.fdi.ucm.es/?q=node/639 . Conclusion Specifying the confidence degree of SNP-phenotype associations from articles helps identify the strength of associations that could in turn assist genomics scientists in determining phenotypic plasticity and the importance of environmental factors. What is more, our first experiments with the corpus show that linguistic-based confidence alongside other non-linguistic features can be utilized in order to estimate the strength of the observed SNP-phenotype associations. Trial Registration: Not Applicable
Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some corpora and methods have been developed with the purpose of extracting mutations and diseases from texts. However, there is no available corpus, for extracting associations from texts, that is annotated with linguistic-based negation, modality markers, neutral candidates, and confidence level of associations.BACKGROUNDSingle Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some corpora and methods have been developed with the purpose of extracting mutations and diseases from texts. However, there is no available corpus, for extracting associations from texts, that is annotated with linguistic-based negation, modality markers, neutral candidates, and confidence level of associations.In this research, different steps were presented so as to produce the SNPPhenA corpus. They include automatic Named Entity Recognition (NER) followed by the manual annotation of SNP and phenotype names, annotation of the SNP-phenotype associations and their level of confidence, as well as modality markers. Moreover, the produced corpus was annotated with negation scopes and cues as well as neutral candidates that play crucial role as far as negation and the modality phenomenon in relation to extraction tasks.METHODIn this research, different steps were presented so as to produce the SNPPhenA corpus. They include automatic Named Entity Recognition (NER) followed by the manual annotation of SNP and phenotype names, annotation of the SNP-phenotype associations and their level of confidence, as well as modality markers. Moreover, the produced corpus was annotated with negation scopes and cues as well as neutral candidates that play crucial role as far as negation and the modality phenomenon in relation to extraction tasks.The agreement between annotators was measured by Cohen's Kappa coefficient where the resulting scores indicated the reliability of the corpus. The Kappa score was 0.79 for annotating the associations and 0.80 for the confidence degree of associations. Further presented were the basic statistics of the annotated features of the corpus in addition to the results of our first experiments related to the extraction of ranked SNP-Phenotype associations. The prepared guideline documents render the corpus more convenient and facile to use. The corpus, guidelines and inter-annotator agreement analysis are available on the website of the corpus: http://nil.fdi.ucm.es/?q=node/639 .RESULTThe agreement between annotators was measured by Cohen's Kappa coefficient where the resulting scores indicated the reliability of the corpus. The Kappa score was 0.79 for annotating the associations and 0.80 for the confidence degree of associations. Further presented were the basic statistics of the annotated features of the corpus in addition to the results of our first experiments related to the extraction of ranked SNP-Phenotype associations. The prepared guideline documents render the corpus more convenient and facile to use. The corpus, guidelines and inter-annotator agreement analysis are available on the website of the corpus: http://nil.fdi.ucm.es/?q=node/639 .Specifying the confidence degree of SNP-phenotype associations from articles helps identify the strength of associations that could in turn assist genomics scientists in determining phenotypic plasticity and the importance of environmental factors. What is more, our first experiments with the corpus show that linguistic-based confidence alongside other non-linguistic features can be utilized in order to estimate the strength of the observed SNP-phenotype associations.CONCLUSIONSpecifying the confidence degree of SNP-phenotype associations from articles helps identify the strength of associations that could in turn assist genomics scientists in determining phenotypic plasticity and the importance of environmental factors. What is more, our first experiments with the corpus show that linguistic-based confidence alongside other non-linguistic features can be utilized in order to estimate the strength of the observed SNP-phenotype associations.Not Applicable.TRIAL REGISTRATIONNot Applicable.
ArticleNumber 14
Author Chitsaz, Hamidreza
Diaz, Alberto
Chavoshinejad, Ramyar
Taghizadeh, Nasrin
Bokharaeian, Behrouz
Author_xml – sequence: 1
  givenname: Behrouz
  surname: Bokharaeian
  fullname: Bokharaeian, Behrouz
– sequence: 2
  givenname: Alberto
  surname: Diaz
  fullname: Diaz, Alberto
– sequence: 3
  givenname: Nasrin
  surname: Taghizadeh
  fullname: Taghizadeh, Nasrin
– sequence: 4
  givenname: Hamidreza
  surname: Chitsaz
  fullname: Chitsaz, Hamidreza
– sequence: 5
  givenname: Ramyar
  surname: Chavoshinejad
  fullname: Chavoshinejad, Ramyar
BackLink https://www.ncbi.nlm.nih.gov/pubmed/28388928$$D View this record in MEDLINE/PubMed
BookMark eNp1kl1rFDEUhgep2Fr7A7yRgDfejOYkk5mMF0IpVgtFC-p1yORjN-tMMiYZ7f57s91W2oKBkJDznifn63l14IM3VfUS8FsA3r5LQClpawxd2dDW5El1RHADNTQcH9y7H1YnKW1wWZQC5vRZdUg45bwn_Kj68-3L1dXa-NP3SCIV4rwkZENE5jpHqbLzKxSl_2k0kikF5WR2wScULErFNpraL2o0ITtt0BzG7VQQa5emhKTXaC7kkLezKdAYJjS6bKLMSzQvqqdWjsmc3J7H1Y_zj9_PPteXXz9dnJ1e1qrpaa6Z1oOVLUhslQJCqGRaMTPYQRrV6I5BDyUp1mNKpBxMr4Fo21kNQKAvtuPqYs_VQW7EHN0k41YE6cTNQ4grIWN2JQeBsZQwKCY7Zhqr28HC0AB0LdCBlN8K68OeNS_DZLQyvtRofAB9aPFuLVbht2CU075hBfDmFhDDr8WkLCaXlBlH6U1YkgDOWc9Iz3bS14-km7BEX0pVVIXFu67ZRfTqfkT_QrnrbxF0e4GKIaVorFAu3_SwBOhGAVjshknsh0mUYRK7YRKkeMIjzzv4_33-AimdzsA
CitedBy_id crossref_primary_10_1186_s13326_021_00248_y
crossref_primary_10_1016_j_procs_2018_10_475
crossref_primary_10_1016_j_procs_2024_09_637
crossref_primary_10_1038_s41597_019_0342_9
crossref_primary_10_1093_database_bay020
crossref_primary_10_1186_s12859_021_04421_z
crossref_primary_10_1186_s12859_023_05236_w
crossref_primary_10_1016_j_jtbi_2019_110112
crossref_primary_10_1145_3448251
crossref_primary_10_1186_s13326_017_0163_8
Cites_doi 10.1016/j.febslet.2008.02.073
10.1186/gb-2008-9-s2-s2
10.1093/nar/28.1.352
10.1038/nature09298
10.1093/nar/gki470
10.1186/1471-2164-13-S4-S10
10.1186/s12911-016-0276-5
10.1093/bioinformatics/btg449
10.1098/rspb.2003.2372
10.1075/li.30.1.03nad
10.1093/nar/30.1.163
10.1075/tsl.32.22byb
10.1093/bib/6.4.357
10.1371/journal.pone.0152725
10.1093/bioinformatics/btw234
10.1086/383092
10.1186/2041-1480-5-11
10.1371/journal.pcbi.1000837
10.1177/0741088396013002004
10.1093/bioinformatics/btm235
10.1038/clpt.2012.96
10.1093/acref/9780198714378.001.0001
10.1371/journal.pone.0163480
10.1186/s12864-015-1497-1
10.1093/nar/gkr798
10.1186/2041-1480-3-S3-S2
10.1093/bioinformatics/btt156
10.1093/nar/gkj151
10.1016/j.sbspro.2015.07.200
10.1093/bioinformatics/btq667
10.1093/database/baw043
10.1038/ng0208-124
10.1093/nar/gkn580
10.1038/70570
10.1093/database/bat019
10.1002/gepi.20377
10.1145/1656274.1656278
ContentType Journal Article
Copyright Copyright BioMed Central 2017
The Author(s). 2017
Copyright_xml – notice: Copyright BioMed Central 2017
– notice: The Author(s). 2017
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
3V.
7X7
7XB
88E
8FE
8FG
8FH
8FI
8FJ
8FK
ABJCF
ABUWG
AFKRA
AZQEC
BBNVY
BENPR
BGLVJ
BHPHI
CCPQU
DWQXO
FYUFA
GHDGH
GNUQQ
HCIFZ
K9.
L6V
LK8
M0S
M1P
M7P
M7S
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
PTHSS
7X8
5PM
DOA
DOI 10.1186/s13326-017-0116-2
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
ProQuest Central (Corporate)
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Medical Database (Alumni Edition)
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Natural Science Collection
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
Materials Science & Engineering Collection
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
ProQuest Central Essentials
Biological Science Collection
ProQuest Central
Technology Collection
Natural Science Collection
ProQuest One
ProQuest Central Korea
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Central Student
SciTech Premium Collection
ProQuest Health & Medical Complete (Alumni)
ProQuest Engineering Collection
Biological Sciences
ProQuest Health & Medical Collection
Medical Database
Biological Science Database
Engineering Database
ProQuest Central Premium
ProQuest One Academic (New)
Publicly Available Content Database
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
Engineering Collection
MEDLINE - Academic
PubMed Central (Full Participant titles)
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Publicly Available Content Database
ProQuest Central Student
Technology Collection
ProQuest One Academic Middle East (New)
ProQuest Central Essentials
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Natural Science Collection
ProQuest Central China
ProQuest Central
ProQuest One Applied & Life Sciences
ProQuest Health & Medical Research Collection
ProQuest Engineering Collection
Health Research Premium Collection
Health and Medicine Complete (Alumni Edition)
Natural Science Collection
ProQuest Central Korea
Health & Medical Research Collection
Biological Science Collection
ProQuest Central (New)
ProQuest Medical Library (Alumni)
Engineering Collection
Engineering Database
ProQuest Biological Science Collection
ProQuest One Academic Eastern Edition
ProQuest Hospital Collection
ProQuest Technology Collection
Health Research Premium Collection (Alumni)
Biological Science Database
ProQuest SciTech Collection
ProQuest Hospital Collection (Alumni)
ProQuest Health & Medical Complete
ProQuest Medical Library
ProQuest One Academic UKI Edition
Materials Science & Engineering Collection
ProQuest One Academic
ProQuest One Academic (New)
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList Publicly Available Content Database
MEDLINE

MEDLINE - Academic
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 4
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Languages & Literatures
EISSN 2041-1480
EndPage 13
ExternalDocumentID oai_doaj_org_article_00aa1bc5a75e4fd6bf1b4117613b2ec4
PMC5383945
28388928
10_1186_s13326_017_0116_2
Genre Journal Article
GroupedDBID 0R~
53G
5VS
7X7
88E
8FE
8FG
8FH
8FI
8FJ
AAFWJ
AAJSJ
AASML
AAYXX
ABDBF
ABJCF
ABUWG
ACGFO
ACGFS
ACIWK
ACPRK
ACUHS
ADBBV
ADRAZ
ADUKV
AEGXH
AENEX
AFKRA
AFPKN
AHBYD
AHSBF
AHYZX
AIAGR
ALIPV
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMTXH
AOIJS
BAPOH
BAWUL
BBNVY
BCNDV
BENPR
BFQNJ
BGLVJ
BHPHI
BMC
BPHCQ
BVXVI
C6C
CCPQU
CITATION
DIK
E3Z
EBD
EBLON
EBS
EJD
ESX
F5P
FYUFA
GROUPED_DOAJ
GX1
H13
HCIFZ
HMCUK
HYE
IAO
IEA
IHR
INH
INR
ITC
KQ8
L6V
LK8
M1P
M48
M7P
M7S
ML~
M~E
O5R
O5S
OK1
PGMZT
PHGZM
PHGZT
PIMPY
PQQKQ
PROAC
PSQYO
PTHSS
RBZ
RNS
ROL
RPM
RSV
SMT
SOJ
TUS
UKHRP
-A0
3V.
ACRMQ
ADINQ
C24
CGR
CUY
CVF
ECM
EIF
NPM
7XB
8FK
AZQEC
DWQXO
GNUQQ
K9.
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQUKI
PRINS
7X8
PUEGO
5PM
ID FETCH-LOGICAL-c493t-5ddbfa61a0fcc1223a5dc5ebfbaec4d7519131059032aabe9d12df7fd11219913
IEDL.DBID M48
ISSN 2041-1480
IngestDate Wed Aug 27 01:27:16 EDT 2025
Thu Aug 21 18:15:58 EDT 2025
Sun Aug 24 03:05:10 EDT 2025
Fri Jul 25 11:59:43 EDT 2025
Thu Jan 02 22:23:01 EST 2025
Thu Apr 24 22:59:41 EDT 2025
Tue Jul 01 03:54:47 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Relation extraction
Degree of confidence
Phenotype
Negation
SNP
Modality
Language English
License Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c493t-5ddbfa61a0fcc1223a5dc5ebfbaec4d7519131059032aabe9d12df7fd11219913
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
OpenAccessLink https://www.proquest.com/docview/1894587744?pq-origsite=%requestingapplication%
PMID 28388928
PQID 1894587744
PQPubID 2040220
PageCount 13
ParticipantIDs doaj_primary_oai_doaj_org_article_00aa1bc5a75e4fd6bf1b4117613b2ec4
pubmedcentral_primary_oai_pubmedcentral_nih_gov_5383945
proquest_miscellaneous_1885952955
proquest_journals_1894587744
pubmed_primary_28388928
crossref_citationtrail_10_1186_s13326_017_0116_2
crossref_primary_10_1186_s13326_017_0116_2
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2017-04-07
PublicationDateYYYYMMDD 2017-04-07
PublicationDate_xml – month: 04
  year: 2017
  text: 2017-04-07
  day: 07
PublicationDecade 2010
PublicationPlace England
PublicationPlace_xml – name: England
– name: London
PublicationTitle Journal of biomedical semantics
PublicationTitleAlternate J Biomed Semantics
PublicationYear 2017
Publisher BioMed Central
BMC
Publisher_xml – name: BioMed Central
– name: BMC
References L Smith (116_CR13) 2008; 9
TD Price (116_CR42) 2003; 270
others, I. H (116_CR2) 2010; 467
EM Smigielski (116_CR34) 2000; 28
S Wooding (116_CR41) 2004; 74
JG Caporaso (116_CR15) 2007; 23
C-H Wei (116_CR16) 2013; 29
V Vincze (116_CR27) 2008; 9
116_CR46
116_CR45
116_CR44
M Hewett (116_CR39) 2002; 30
M Whirl-Carrillo (116_CR7) 2012; 92
D Nadeau (116_CR31) 2007; 30
EE Loos (116_CR10) 2004
116_CR3
D Lin (116_CR9) 2009; 33
K Ravikumar (116_CR21) 2012; 3
AA Mahmood (116_CR6) 2016; 11
116_CR4
LC Kim (116_CR24) 2015; 197
M Hall (116_CR50) 2009; 11
116_CR33
E Doughty (116_CR17) 2011; 27
W Yu (116_CR36) 2008; 40
KM Verspoor (116_CR5) 2016; 16
B Bokharaeian (116_CR29) 2013; 51
C Giuliano (116_CR47) 2006; 18
U Leser (116_CR12) 2005; 6
B Bokharaeian (116_CR40) 2016; 11
116_CR19
M Seringhaus (116_CR8) 2008; 582
AP Davis (116_CR32) 2009; 37
GT Marth (116_CR1) 1999; 23
BR Packer (116_CR37) 2006; 34
116_CR25
116_CR22
116_CR28
116_CR26
T Joachims (116_CR49) 1999
P Thomas (116_CR14) 2016; 32
M Cariaso (116_CR38) 2012; 40
M Ballesteros (116_CR43) 2012
116_CR20
A Doms (116_CR30) 2005; 33
A Klein (116_CR23) 2014; 5
116_CR11
E Nicolazzi (116_CR35) 2015; 16
116_CR18
D Tikk (116_CR48) 2010; 6
18782832 - Nucleic Acids Res. 2009 Jan;37(Database issue):D786-92
23046792 - J Biomed Semantics. 2012 Oct 5;3 Suppl 3:S2
22759648 - BMC Genomics. 2012 Jun 18;13 Suppl 4:S10
23564842 - Bioinformatics. 2013 Jun 1;29(11):1433-9
19025695 - BMC Bioinformatics. 2008 Nov 19;9 Suppl 11:S9
16420734 - Brief Bioinform. 2005 Dec;6(4):357-69
23584833 - Database (Oxford). 2013 Apr 12;2013:bat019
10592272 - Nucleic Acids Res. 2000 Jan 1;28(1):352-5
14997422 - Am J Hum Genet. 2004 Apr;74(4):637-46
20811451 - Nature. 2010 Sep 2;467(7311):52-8
14990452 - Bioinformatics. 2004 Mar 1;20(4):557-68
27695078 - PLoS One. 2016 Oct 3;11(10 ):e0163480
22992668 - Clin Pharmacol Ther. 2012 Oct;92(4):414-7
27074804 - Database (Oxford). 2016 Apr 13;2016
18834493 - Genome Biol. 2008;9 Suppl 2:S2
22140107 - Nucleic Acids Res. 2012 Jan;40(Database issue):D1308-12
21138947 - Bioinformatics. 2011 Feb 1;27(3):408-15
25881165 - BMC Genomics. 2015 Apr 10;16:283
10581034 - Nat Genet. 1999 Dec;23(4):452-6
16381944 - Nucleic Acids Res. 2006 Jan 1;34(Database issue):D617-21
11752281 - Nucleic Acids Res. 2002 Jan 1;30(1):163-5
27454860 - BMC Med Inform Decis Mak. 2016 Jul 18;16 Suppl 1:68
17495998 - Bioinformatics. 2007 Jul 15;23(14):1862-5
18227866 - Nat Genet. 2008 Feb;40(2):124-5
12965006 - Proc Biol Sci. 2003 Jul 22;270(1523):1433-40
27073839 - PLoS One. 2016 Apr 13;11(4):e0152725
27256315 - Bioinformatics. 2016 Sep 15;32(18):2883-5
15980585 - Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W783-6
19051285 - Genet Epidemiol. 2009 Apr;33(3):256-65
18328823 - FEBS Lett. 2008 Apr 9;582(8):1170
24568600 - J Biomed Semantics. 2014 Feb 25;5(1):11
20617200 - PLoS Comput Biol. 2010 Jul 01;6:e1000837
References_xml – volume: 582
  start-page: 1170
  issue: 8
  year: 2008
  ident: 116_CR8
  publication-title: FEBS Lett
  doi: 10.1016/j.febslet.2008.02.073
– volume: 9
  start-page: 1
  issue: Suppl 2
  year: 2008
  ident: 116_CR13
  publication-title: Genome Biol
  doi: 10.1186/gb-2008-9-s2-s2
– volume: 28
  start-page: 352
  issue: 1
  year: 2000
  ident: 116_CR34
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/28.1.352
– volume: 467
  start-page: 52
  year: 2010
  ident: 116_CR2
  publication-title: Nature
  doi: 10.1038/nature09298
– volume: 33
  start-page: W783
  issue: suppl 2
  year: 2005
  ident: 116_CR30
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gki470
– ident: 116_CR22
  doi: 10.1186/1471-2164-13-S4-S10
– volume: 16
  start-page: 37
  issue: 1
  year: 2016
  ident: 116_CR5
  publication-title: BMC Med Inform Decis Mak
  doi: 10.1186/s12911-016-0276-5
– ident: 116_CR4
– ident: 116_CR20
  doi: 10.1093/bioinformatics/btg449
– ident: 116_CR26
– volume: 270
  start-page: 1433
  issue: 1523
  year: 2003
  ident: 116_CR42
  publication-title: Proc Biol Sci
  doi: 10.1098/rspb.2003.2372
– volume: 30
  start-page: 3
  issue: 1
  year: 2007
  ident: 116_CR31
  publication-title: Lingvisticae Investigationes
  doi: 10.1075/li.30.1.03nad
– volume: 30
  start-page: 163
  issue: 1
  year: 2002
  ident: 116_CR39
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/30.1.163
– ident: 116_CR33
– ident: 116_CR11
  doi: 10.1075/tsl.32.22byb
– volume: 6
  start-page: 357
  issue: 4
  year: 2005
  ident: 116_CR12
  publication-title: Brief Bioinform
  doi: 10.1093/bib/6.4.357
– ident: 116_CR44
– volume: 11
  start-page: e0152725
  issue: 4
  year: 2016
  ident: 116_CR6
  publication-title: PLoS ONE
  doi: 10.1371/journal.pone.0152725
– volume: 32
  start-page: 2883
  issue: 18
  year: 2016
  ident: 116_CR14
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btw234
– volume: 9
  start-page: 1
  issue: 11
  year: 2008
  ident: 116_CR27
  publication-title: BMC Bioinformatics
– volume: 51
  start-page: 49
  year: 2013
  ident: 116_CR29
  publication-title: Procesamiento del Lenguaje Natural
– volume: 74
  start-page: 637
  issue: 4
  year: 2004
  ident: 116_CR41
  publication-title: Am J Hum Genet
  doi: 10.1086/383092
– volume: 5
  start-page: 11
  year: 2014
  ident: 116_CR23
  publication-title: J Biomed Semantics
  doi: 10.1186/2041-1480-5-11
– volume: 6
  start-page: e1000837
  issue: 7
  year: 2010
  ident: 116_CR48
  publication-title: PLoS Comput Biol
  doi: 10.1371/journal.pcbi.1000837
– ident: 116_CR45
  doi: 10.1177/0741088396013002004
– volume: 23
  start-page: 1862
  issue: 14
  year: 2007
  ident: 116_CR15
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btm235
– start-page: 363
  volume-title: Inferring the Scope of Negation in Biomedical Documents. 13th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2012)
  year: 2012
  ident: 116_CR43
– volume: 92
  start-page: 414
  year: 2012
  ident: 116_CR7
  publication-title: Clin Pharmacol Ther
  doi: 10.1038/clpt.2012.96
– ident: 116_CR3
  doi: 10.1093/acref/9780198714378.001.0001
– ident: 116_CR28
– volume: 11
  start-page: e0163480
  issue: 10
  year: 2016
  ident: 116_CR40
  publication-title: PLoS ONE
  doi: 10.1371/journal.pone.0163480
– volume: 16
  start-page: 283
  year: 2015
  ident: 116_CR35
  publication-title: BMC Genomics
  doi: 10.1186/s12864-015-1497-1
– volume: 40
  start-page: D1308
  issue: D1
  year: 2012
  ident: 116_CR38
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkr798
– volume: 3
  start-page: 1480
  year: 2012
  ident: 116_CR21
  publication-title: J Biomed Semantics
  doi: 10.1186/2041-1480-3-S3-S2
– volume: 29
  start-page: 1433
  year: 2013
  ident: 116_CR16
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btt156
– volume: 34
  start-page: D617
  issue: suppl 1
  year: 2006
  ident: 116_CR37
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkj151
– volume: 197
  start-page: 600
  year: 2015
  ident: 116_CR24
  publication-title: Procedia Soc Behavioral Sci
  doi: 10.1016/j.sbspro.2015.07.200
– volume: 27
  start-page: 408
  issue: 3
  year: 2011
  ident: 116_CR17
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btq667
– ident: 116_CR18
  doi: 10.1093/database/baw043
– volume: 40
  start-page: 124
  issue: 2
  year: 2008
  ident: 116_CR36
  publication-title: Nat Genet
  doi: 10.1038/ng0208-124
– start-page: 169
  volume-title: Advances in kernel methods
  year: 1999
  ident: 116_CR49
– volume: 37
  start-page: D786
  issue: suppl 1
  year: 2009
  ident: 116_CR32
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkn580
– volume: 23
  start-page: 452
  issue: 4
  year: 1999
  ident: 116_CR1
  publication-title: Nat Genet
  doi: 10.1038/70570
– ident: 116_CR46
– ident: 116_CR25
– ident: 116_CR19
  doi: 10.1093/database/bat019
– volume: 33
  start-page: 256
  issue: 3
  year: 2009
  ident: 116_CR9
  publication-title: Genet Epidemiol
  doi: 10.1002/gepi.20377
– volume-title: Glossary of linguistic terms
  year: 2004
  ident: 116_CR10
– volume: 18
  start-page: 401
  year: 2006
  ident: 116_CR47
  publication-title: EACL
– volume: 11
  start-page: 10
  issue: 1
  year: 2009
  ident: 116_CR50
  publication-title: ACM SIGKDD Explorations Newsl
  doi: 10.1145/1656274.1656278
– reference: 23046792 - J Biomed Semantics. 2012 Oct 5;3 Suppl 3:S2
– reference: 12965006 - Proc Biol Sci. 2003 Jul 22;270(1523):1433-40
– reference: 20811451 - Nature. 2010 Sep 2;467(7311):52-8
– reference: 10592272 - Nucleic Acids Res. 2000 Jan 1;28(1):352-5
– reference: 15980585 - Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W783-6
– reference: 27695078 - PLoS One. 2016 Oct 3;11(10 ):e0163480
– reference: 21138947 - Bioinformatics. 2011 Feb 1;27(3):408-15
– reference: 27454860 - BMC Med Inform Decis Mak. 2016 Jul 18;16 Suppl 1:68
– reference: 18782832 - Nucleic Acids Res. 2009 Jan;37(Database issue):D786-92
– reference: 24568600 - J Biomed Semantics. 2014 Feb 25;5(1):11
– reference: 10581034 - Nat Genet. 1999 Dec;23(4):452-6
– reference: 27256315 - Bioinformatics. 2016 Sep 15;32(18):2883-5
– reference: 18227866 - Nat Genet. 2008 Feb;40(2):124-5
– reference: 18834493 - Genome Biol. 2008;9 Suppl 2:S2
– reference: 23564842 - Bioinformatics. 2013 Jun 1;29(11):1433-9
– reference: 17495998 - Bioinformatics. 2007 Jul 15;23(14):1862-5
– reference: 19025695 - BMC Bioinformatics. 2008 Nov 19;9 Suppl 11:S9
– reference: 19051285 - Genet Epidemiol. 2009 Apr;33(3):256-65
– reference: 23584833 - Database (Oxford). 2013 Apr 12;2013:bat019
– reference: 14997422 - Am J Hum Genet. 2004 Apr;74(4):637-46
– reference: 14990452 - Bioinformatics. 2004 Mar 1;20(4):557-68
– reference: 22140107 - Nucleic Acids Res. 2012 Jan;40(Database issue):D1308-12
– reference: 16420734 - Brief Bioinform. 2005 Dec;6(4):357-69
– reference: 27074804 - Database (Oxford). 2016 Apr 13;2016:
– reference: 16381944 - Nucleic Acids Res. 2006 Jan 1;34(Database issue):D617-21
– reference: 18328823 - FEBS Lett. 2008 Apr 9;582(8):1170
– reference: 25881165 - BMC Genomics. 2015 Apr 10;16:283
– reference: 11752281 - Nucleic Acids Res. 2002 Jan 1;30(1):163-5
– reference: 27073839 - PLoS One. 2016 Apr 13;11(4):e0152725
– reference: 22992668 - Clin Pharmacol Ther. 2012 Oct;92(4):414-7
– reference: 22759648 - BMC Genomics. 2012 Jun 18;13 Suppl 4:S10
– reference: 20617200 - PLoS Comput Biol. 2010 Jul 01;6:e1000837
SSID ssj0000331083
Score 2.1905124
Snippet Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some...
Background Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes....
Abstract Background Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes....
SourceID doaj
pubmedcentral
proquest
pubmed
crossref
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Enrichment Source
StartPage 14
SubjectTerms Degree of confidence
Gene Ontology
Information Storage and Retrieval - methods
Modality
Mutation
Negation
Phenotype
Polymorphism
Polymorphism, Single Nucleotide
Relation extraction
Semantics
Single-nucleotide polymorphism
SNP
SummonAdditionalLinks – databaseName: DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Na9wwEBUlp1xKP1O3SVGh9FAwsSzJlntLQkMoaQi0gdyEZEndhV15iXcp_fedkR13t5T20qsly7JmpHkjjd4Q8tY4JQvrMOCJCXBQVMitFDZvuHI1mGMZ0t2qz1fVxY34dCtvt1J9YUzYQA88DNxxURjDbCtNLb0IrrKBWcEYeN_clr5NTKBg87acqbQGc4Atio_HmExVxz04YyU6zxhpyaq83DFEia__TyDz91jJLeNz_og8HFEjPRl6-5g88PEJObgc9xp7-o5eTvTI_VPy_cvV9fXMx5MP1FBwL1ebngI4pbAQp0tR8RvFXO3eUfNLOj3tAsWdg4XPI7Icd-u583TVLX4soYnZvF_21ERHMSqsw61baPSuW9LF9Oln5Ob849ezi3zMsJC3ouHrXDpng6mYKULbMkAKRrpWehusgaF1NcA7xhGBFbw0xvrGsdKFOjhAaRgzxZ-TvdhF_wJDpJjwXBpwrwAEhqrxvg4qKI8no4aFjBT3w63bkX4cs2AsdHJDVKUHCWmQkEYJ6TIj76dXVgP3xt8qn6IMp4pIm50egDLpUZn0v5QpI4f3GqDHudxrphohFcBkKH4zFcMsxKMVE323wTrIEweqLTNyMCjM1BMAcEo1pcpIvaNKO13dLYnzWWL6BmvE4eMv_8e_vSL7ZdJ-kRf1Idlb3238EQCqtX2d5s5Pc48gTA
  priority: 102
  providerName: Directory of Open Access Journals
– databaseName: Health & Medical Collection
  dbid: 7X7
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3fb9MwELZgvPCC-DkCAxkJ8YAULU7s2OEFDcQ0oTFNgkl9s-zYXiu1cWlaIf577lI3owjtNXZsJ3e2vzufvyPkrXFKFNZhwBPjYKCokFvBbd5UyknYjkUY7lZ9u6jPrvjXiZgkh1ufwip3a-KwULvYoo_8mKmGCwVghX9c_swxaxSerqYUGnfJPaQuw5AuOZGjj6WoALyoKh1mMlUf92CSlWhCY7wlq_NybzsaWPv_BzX_jZj8aws6fUgeJOxIT7bCfkTu-O4xOTxPHseevqPnI0ly_4T8-n5xeTn13ckHaigYmctNTwGiUliOh6tR3TXFjO3eUXMjo57GQNF_MPd5h1zHcT1zni7j_PcCmpjO-kVPTecoxoZFdOBCo6u4oPOx66fk6vTLj89necqzkLe8qda5cM4GUzNThLZlgBeMcK3wNljjW-4kgDxWIQ4rqtIY6xvHShdkcIDVMHKqekYOutj55xgoxbivhAEjC6BgqBvvZVBBeTwfNSxkpNj9bt0mEnLMhTHXgzGiar2VkAYJaZSQLjPyfnxluWXguK3yJ5ThWBHJs4cHcXWt01zURWEMs60wUngeXG0Ds5wxCcjGlvDFGTnaaYBOM7rXN_qXkTdjMcxFPGAxnY8brINscaDgIiOHW4UZRwIwTqmmVBmRe6q0N9T9km42Hfi-YU-qoPMXtw_rJblfDnrN80IekYP1auNfAWBa29fDrPgDLNkXTw
  priority: 102
  providerName: ProQuest
Title SNPPhenA: a corpus for extracting ranked associations of single-nucleotide polymorphisms and phenotypes from literature
URI https://www.ncbi.nlm.nih.gov/pubmed/28388928
https://www.proquest.com/docview/1894587744
https://www.proquest.com/docview/1885952955
https://pubmed.ncbi.nlm.nih.gov/PMC5383945
https://doaj.org/article/00aa1bc5a75e4fd6bf1b4117613b2ec4
Volume 8
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3db9MwELfG9sILYnyMbKMyEuIBKRAnduwgIdShlanaqopRqW-RHdtrpTYpTSvYf8_ZTQNFFeIlkWLHdnx3vt_ZlzuEXkstWKS0c3giFAwUYUPFqAqzRGgO6phZ_2_VzSC9GtH-mI0P0Da9VTOB9V7TzuWTGi1n735-v_8EAv_RC7xI39dgZ8XOLnZOlCQNYUU-AsXEnZzeNGjfL8wJYBkfmDOOKAnBENiec-5tZUdT-YD--1Do386Uf2in3mP0qIGVuLvhg2N0YMon6OS62Yys8Rt83cZPrp-iH7eD4XBiyu4HLDHYn4t1jQG9Ylip_V9T5R12ydyNxvI3-WpcWey2FmYmLF0Y5Go11QYvqtn9HJqYTOt5jWWpsXMbq9zeLjS6rOZ41nb9DI16l98-X4VNCoawoFmyCpnWysqUyMgWBQEoIZkumFFWSVNQzQH_kcRBtCiJpVQm0yTWllsNMM45VSXP0WFZleaF86Ei1CRMwrQDSrRpZgy3wgrjjk4lsQGKttOdF018cpcmY5Z7O0Wk-YZCOVAodxTK4wC9bV9ZbIJz_KvyhaNhW9HF1fYPquVd3ohpHkVSElUwyZmhVqfKEkUJ4QB6VAxfHKDzLQfkW17NicgoE4CjofhVWwxi6s5eZGmqtavjAskB77MAnWwYph0JIDwhslgEiO-w0s5Qd0vK6cSHAgd1lUDnp__R7xl6GHvmpmHEz9Hhark2LwFQrVQHPeBjDlfR-9JBR91u_7YP94vLwfBrx29SdLwg_QI6nCPt
linkProvider Scholars Portal
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3db9MwELfGeIAXxOcIDDAS8IAULU7sxEFCaHxMHeuqSWxS34wd22ulNilNq2n_FH8jd2maUYT2tlfbcZzcne935_MdIW-0lSIyFgOeGAcDRfrQCG7CPJE2A3UsfHO36niQ9s7496EYbpHf67swGFa53hObjdpWBfrI95jMuZAAVvin2a8Qq0bh6eq6hMaKLY7c5QWYbPXHw69A37dxfPDt9EsvbKsKhAXPk0UorDVep0xHvigYaEctbCGc8Ua7gtsMIA1LEHVESay1cbllsfWZt4BMME4ogXlvkdugeCOUqGyYdT6dKIFGmbSHp0ymezWYgDGa7BjfydIw3lB_TZWA_0HbfyM0_1J5B_fJvRar0v0Vcz0gW658SHb6rYezpu9ov0vKXD8iFz8GJycjV-5_oJqCUTtb1hQgMYXtv7mKVZ5TrBDvLNVXPFHTylP0V0xcWGJu5Woxto7OqsnlFKYYjetpTXVpKcaiVegwhknn1ZROulc_Jmc3QoEnZLusSvcUA7MYd4nQYNQB9PRp7lzmpZcOz2M18wGJ1r9bFW3Sc6y9MVGN8SNTtaKQAgoppJCKA_K-e2S2yvhx3eDPSMNuICbrbhqq-blqZV9FkdbMFEJnwnFvU-OZ4YxlgKRMDF8ckN01B6h2B6nVFb8H5HXXDbKPBzq6dNUSx2B2OhAoEZCdFcN0KwHYKGUey4BkG6y0sdTNnnI8avKLgw5M4OXPrl_WK3Knd3rcV_3DwdFzcjdueJyHUbZLthfzpXsBYG1hXjYSQsnPmxbJP37jVYg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=SNPPhenA%3A+a+corpus+for+extracting+ranked+associations+of+single-nucleotide+polymorphisms+and+phenotypes+from+literature&rft.jtitle=Journal+of+biomedical+semantics&rft.au=Bokharaeian%2C+Behrouz&rft.au=Diaz%2C+Alberto&rft.au=Taghizadeh%2C+Nasrin&rft.au=Chitsaz%2C+Hamidreza&rft.date=2017-04-07&rft.issn=2041-1480&rft.eissn=2041-1480&rft.volume=8&rft.issue=1&rft.spage=14&rft_id=info:doi/10.1186%2Fs13326-017-0116-2&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2041-1480&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2041-1480&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2041-1480&client=summon