SNPPhenA: a corpus for extracting ranked associations of single-nucleotide polymorphisms and phenotypes from literature

Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some corpora and methods have been developed with the purpose of extracting mutations and diseases from texts. However, there is no available corpus,...

Full description

Saved in:

Bibliographic Details
Published in	Journal of biomedical semantics Vol. 8; no. 1; pp. 14 - 13
Main Authors	Bokharaeian, Behrouz, Diaz, Alberto, Taghizadeh, Nasrin, Chitsaz, Hamidreza, Chavoshinejad, Ramyar
Format	Journal Article
Language	English
Published	England BioMed Central 07.04.2017 BMC
Subjects	Degree of confidence Gene Ontology Information Storage and Retrieval - methods Modality Mutation Negation Phenotype Polymorphism Polymorphism, Single Nucleotide Relation extraction Semantics Single-nucleotide polymorphism SNP Relation extraction Degree of confidence Phenotype Negation SNP Modality
Online Access	Get full text

Cover

Loading…

Abstract	Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some corpora and methods have been developed with the purpose of extracting mutations and diseases from texts. However, there is no available corpus, for extracting associations from texts, that is annotated with linguistic-based negation, modality markers, neutral candidates, and confidence level of associations. In this research, different steps were presented so as to produce the SNPPhenA corpus. They include automatic Named Entity Recognition (NER) followed by the manual annotation of SNP and phenotype names, annotation of the SNP-phenotype associations and their level of confidence, as well as modality markers. Moreover, the produced corpus was annotated with negation scopes and cues as well as neutral candidates that play crucial role as far as negation and the modality phenomenon in relation to extraction tasks. The agreement between annotators was measured by Cohen's Kappa coefficient where the resulting scores indicated the reliability of the corpus. The Kappa score was 0.79 for annotating the associations and 0.80 for the confidence degree of associations. Further presented were the basic statistics of the annotated features of the corpus in addition to the results of our first experiments related to the extraction of ranked SNP-Phenotype associations. The prepared guideline documents render the corpus more convenient and facile to use. The corpus, guidelines and inter-annotator agreement analysis are available on the website of the corpus: http://nil.fdi.ucm.es/?q=node/639 . Specifying the confidence degree of SNP-phenotype associations from articles helps identify the strength of associations that could in turn assist genomics scientists in determining phenotypic plasticity and the importance of environmental factors. What is more, our first experiments with the corpus show that linguistic-based confidence alongside other non-linguistic features can be utilized in order to estimate the strength of the observed SNP-phenotype associations. Not Applicable.
AbstractList	Background Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some corpora and methods have been developed with the purpose of extracting mutations and diseases from texts. However, there is no available corpus, for extracting associations from texts, that is annotated with linguistic-based negation, modality markers, neutral candidates, and confidence level of associations. Method In this research, different steps were presented so as to produce the SNPPhenA corpus. They include automatic Named Entity Recognition (NER) followed by the manual annotation of SNP and phenotype names, annotation of the SNP-phenotype associations and their level of confidence, as well as modality markers. Moreover, the produced corpus was annotated with negation scopes and cues as well as neutral candidates that play crucial role as far as negation and the modality phenomenon in relation to extraction tasks. Result The agreement between annotators was measured by Cohen’s Kappa coefficient where the resulting scores indicated the reliability of the corpus. The Kappa score was 0.79 for annotating the associations and 0.80 for the confidence degree of associations. Further presented were the basic statistics of the annotated features of the corpus in addition to the results of our first experiments related to the extraction of ranked SNP-Phenotype associations. The prepared guideline documents render the corpus more convenient and facile to use. The corpus, guidelines and inter-annotator agreement analysis are available on the website of the corpus: http://nil.fdi.ucm.es/?q=node/639. Conclusion Specifying the confidence degree of SNP-phenotype associations from articles helps identify the strength of associations that could in turn assist genomics scientists in determining phenotypic plasticity and the importance of environmental factors. What is more, our first experiments with the corpus show that linguistic-based confidence alongside other non-linguistic features can be utilized in order to estimate the strength of the observed SNP-phenotype associations. Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some corpora and methods have been developed with the purpose of extracting mutations and diseases from texts. However, there is no available corpus, for extracting associations from texts, that is annotated with linguistic-based negation, modality markers, neutral candidates, and confidence level of associations. In this research, different steps were presented so as to produce the SNPPhenA corpus. They include automatic Named Entity Recognition (NER) followed by the manual annotation of SNP and phenotype names, annotation of the SNP-phenotype associations and their level of confidence, as well as modality markers. Moreover, the produced corpus was annotated with negation scopes and cues as well as neutral candidates that play crucial role as far as negation and the modality phenomenon in relation to extraction tasks. The agreement between annotators was measured by Cohen's Kappa coefficient where the resulting scores indicated the reliability of the corpus. The Kappa score was 0.79 for annotating the associations and 0.80 for the confidence degree of associations. Further presented were the basic statistics of the annotated features of the corpus in addition to the results of our first experiments related to the extraction of ranked SNP-Phenotype associations. The prepared guideline documents render the corpus more convenient and facile to use. The corpus, guidelines and inter-annotator agreement analysis are available on the website of the corpus: http://nil.fdi.ucm.es/?q=node/639 . Specifying the confidence degree of SNP-phenotype associations from articles helps identify the strength of associations that could in turn assist genomics scientists in determining phenotypic plasticity and the importance of environmental factors. What is more, our first experiments with the corpus show that linguistic-based confidence alongside other non-linguistic features can be utilized in order to estimate the strength of the observed SNP-phenotype associations. Not Applicable. Abstract Background Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some corpora and methods have been developed with the purpose of extracting mutations and diseases from texts. However, there is no available corpus, for extracting associations from texts, that is annotated with linguistic-based negation, modality markers, neutral candidates, and confidence level of associations. Method In this research, different steps were presented so as to produce the SNPPhenA corpus. They include automatic Named Entity Recognition (NER) followed by the manual annotation of SNP and phenotype names, annotation of the SNP-phenotype associations and their level of confidence, as well as modality markers. Moreover, the produced corpus was annotated with negation scopes and cues as well as neutral candidates that play crucial role as far as negation and the modality phenomenon in relation to extraction tasks. Result The agreement between annotators was measured by Cohen’s Kappa coefficient where the resulting scores indicated the reliability of the corpus. The Kappa score was 0.79 for annotating the associations and 0.80 for the confidence degree of associations. Further presented were the basic statistics of the annotated features of the corpus in addition to the results of our first experiments related to the extraction of ranked SNP-Phenotype associations. The prepared guideline documents render the corpus more convenient and facile to use. The corpus, guidelines and inter-annotator agreement analysis are available on the website of the corpus: http://nil.fdi.ucm.es/?q=node/639 . Conclusion Specifying the confidence degree of SNP-phenotype associations from articles helps identify the strength of associations that could in turn assist genomics scientists in determining phenotypic plasticity and the importance of environmental factors. What is more, our first experiments with the corpus show that linguistic-based confidence alongside other non-linguistic features can be utilized in order to estimate the strength of the observed SNP-phenotype associations. Trial Registration: Not Applicable Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some corpora and methods have been developed with the purpose of extracting mutations and diseases from texts. However, there is no available corpus, for extracting associations from texts, that is annotated with linguistic-based negation, modality markers, neutral candidates, and confidence level of associations.BACKGROUNDSingle Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some corpora and methods have been developed with the purpose of extracting mutations and diseases from texts. However, there is no available corpus, for extracting associations from texts, that is annotated with linguistic-based negation, modality markers, neutral candidates, and confidence level of associations.In this research, different steps were presented so as to produce the SNPPhenA corpus. They include automatic Named Entity Recognition (NER) followed by the manual annotation of SNP and phenotype names, annotation of the SNP-phenotype associations and their level of confidence, as well as modality markers. Moreover, the produced corpus was annotated with negation scopes and cues as well as neutral candidates that play crucial role as far as negation and the modality phenomenon in relation to extraction tasks.METHODIn this research, different steps were presented so as to produce the SNPPhenA corpus. They include automatic Named Entity Recognition (NER) followed by the manual annotation of SNP and phenotype names, annotation of the SNP-phenotype associations and their level of confidence, as well as modality markers. Moreover, the produced corpus was annotated with negation scopes and cues as well as neutral candidates that play crucial role as far as negation and the modality phenomenon in relation to extraction tasks.The agreement between annotators was measured by Cohen's Kappa coefficient where the resulting scores indicated the reliability of the corpus. The Kappa score was 0.79 for annotating the associations and 0.80 for the confidence degree of associations. Further presented were the basic statistics of the annotated features of the corpus in addition to the results of our first experiments related to the extraction of ranked SNP-Phenotype associations. The prepared guideline documents render the corpus more convenient and facile to use. The corpus, guidelines and inter-annotator agreement analysis are available on the website of the corpus: http://nil.fdi.ucm.es/?q=node/639 .RESULTThe agreement between annotators was measured by Cohen's Kappa coefficient where the resulting scores indicated the reliability of the corpus. The Kappa score was 0.79 for annotating the associations and 0.80 for the confidence degree of associations. Further presented were the basic statistics of the annotated features of the corpus in addition to the results of our first experiments related to the extraction of ranked SNP-Phenotype associations. The prepared guideline documents render the corpus more convenient and facile to use. The corpus, guidelines and inter-annotator agreement analysis are available on the website of the corpus: http://nil.fdi.ucm.es/?q=node/639 .Specifying the confidence degree of SNP-phenotype associations from articles helps identify the strength of associations that could in turn assist genomics scientists in determining phenotypic plasticity and the importance of environmental factors. What is more, our first experiments with the corpus show that linguistic-based confidence alongside other non-linguistic features can be utilized in order to estimate the strength of the observed SNP-phenotype associations.CONCLUSIONSpecifying the confidence degree of SNP-phenotype associations from articles helps identify the strength of associations that could in turn assist genomics scientists in determining phenotypic plasticity and the importance of environmental factors. What is more, our first experiments with the corpus show that linguistic-based confidence alongside other non-linguistic features can be utilized in order to estimate the strength of the observed SNP-phenotype associations.Not Applicable.TRIAL REGISTRATIONNot Applicable.
ArticleNumber	14
Author	Chitsaz, Hamidreza Diaz, Alberto Chavoshinejad, Ramyar Taghizadeh, Nasrin Bokharaeian, Behrouz
Author_xml	– sequence: 1 givenname: Behrouz surname: Bokharaeian fullname: Bokharaeian, Behrouz – sequence: 2 givenname: Alberto surname: Diaz fullname: Diaz, Alberto – sequence: 3 givenname: Nasrin surname: Taghizadeh fullname: Taghizadeh, Nasrin – sequence: 4 givenname: Hamidreza surname: Chitsaz fullname: Chitsaz, Hamidreza – sequence: 5 givenname: Ramyar surname: Chavoshinejad fullname: Chavoshinejad, Ramyar
BackLink	https://www.ncbi.nlm.nih.gov/pubmed/28388928$$D View this record in MEDLINE/PubMed
BookMark	eNp1kl1rFDEUhgep2Fr7A7yRgDfejOYkk5mMF0IpVgtFC-p1yORjN-tMMiYZ7f57s91W2oKBkJDznifn63l14IM3VfUS8FsA3r5LQClpawxd2dDW5El1RHADNTQcH9y7H1YnKW1wWZQC5vRZdUg45bwn_Kj68-3L1dXa-NP3SCIV4rwkZENE5jpHqbLzKxSl_2k0kikF5WR2wScULErFNpraL2o0ITtt0BzG7VQQa5emhKTXaC7kkLezKdAYJjS6bKLMSzQvqqdWjsmc3J7H1Y_zj9_PPteXXz9dnJ1e1qrpaa6Z1oOVLUhslQJCqGRaMTPYQRrV6I5BDyUp1mNKpBxMr4Fo21kNQKAvtuPqYs_VQW7EHN0k41YE6cTNQ4grIWN2JQeBsZQwKCY7Zhqr28HC0AB0LdCBlN8K68OeNS_DZLQyvtRofAB9aPFuLVbht2CU075hBfDmFhDDr8WkLCaXlBlH6U1YkgDOWc9Iz3bS14-km7BEX0pVVIXFu67ZRfTqfkT_QrnrbxF0e4GKIaVorFAu3_SwBOhGAVjshknsh0mUYRK7YRKkeMIjzzv4_33-AimdzsA
CitedBy_id	crossref_primary_10_1186_s13326_021_00248_y crossref_primary_10_1016_j_procs_2018_10_475 crossref_primary_10_1016_j_procs_2024_09_637 crossref_primary_10_1038_s41597_019_0342_9 crossref_primary_10_1093_database_bay020 crossref_primary_10_1186_s12859_021_04421_z crossref_primary_10_1186_s12859_023_05236_w crossref_primary_10_1016_j_jtbi_2019_110112 crossref_primary_10_1145_3448251 crossref_primary_10_1186_s13326_017_0163_8
Cites_doi	10.1016/j.febslet.2008.02.073 10.1186/gb-2008-9-s2-s2 10.1093/nar/28.1.352 10.1038/nature09298 10.1093/nar/gki470 10.1186/1471-2164-13-S4-S10 10.1186/s12911-016-0276-5 10.1093/bioinformatics/btg449 10.1098/rspb.2003.2372 10.1075/li.30.1.03nad 10.1093/nar/30.1.163 10.1075/tsl.32.22byb 10.1093/bib/6.4.357 10.1371/journal.pone.0152725 10.1093/bioinformatics/btw234 10.1086/383092 10.1186/2041-1480-5-11 10.1371/journal.pcbi.1000837 10.1177/0741088396013002004 10.1093/bioinformatics/btm235 10.1038/clpt.2012.96 10.1093/acref/9780198714378.001.0001 10.1371/journal.pone.0163480 10.1186/s12864-015-1497-1 10.1093/nar/gkr798 10.1186/2041-1480-3-S3-S2 10.1093/bioinformatics/btt156 10.1093/nar/gkj151 10.1016/j.sbspro.2015.07.200 10.1093/bioinformatics/btq667 10.1093/database/baw043 10.1038/ng0208-124 10.1093/nar/gkn580 10.1038/70570 10.1093/database/bat019 10.1002/gepi.20377 10.1145/1656274.1656278
ContentType	Journal Article
Copyright	Copyright BioMed Central 2017 The Author(s). 2017
Copyright_xml	– notice: Copyright BioMed Central 2017 – notice: The Author(s). 2017
DBID	AAYXX CITATION CGR CUY CVF ECM EIF NPM 3V. 7X7 7XB 88E 8FE 8FG 8FH 8FI 8FJ 8FK ABJCF ABUWG AFKRA AZQEC BBNVY BENPR BGLVJ BHPHI CCPQU DWQXO FYUFA GHDGH GNUQQ HCIFZ K9. L6V LK8 M0S M1P M7P M7S PHGZM PHGZT PIMPY PJZUB PKEHL PPXIY PQEST PQGLB PQQKQ PQUKI PRINS PTHSS 7X8 5PM DOA
DOI	10.1186/s13326-017-0116-2
DatabaseName	CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed ProQuest Central (Corporate) Health & Medical Collection ProQuest Central (purchase pre-March 2016) Medical Database (Alumni Edition) ProQuest SciTech Collection ProQuest Technology Collection ProQuest Natural Science Collection Hospital Premium Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) Materials Science & Engineering Collection ProQuest Central (Alumni) ProQuest Central UK/Ireland ProQuest Central Essentials Biological Science Collection ProQuest Central Technology Collection Natural Science Collection ProQuest One ProQuest Central Korea Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Central Student SciTech Premium Collection ProQuest Health & Medical Complete (Alumni) ProQuest Engineering Collection Biological Sciences ProQuest Health & Medical Collection Medical Database Biological Science Database Engineering Database ProQuest Central Premium ProQuest One Academic (New) Publicly Available Content Database ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China Engineering Collection MEDLINE - Academic PubMed Central (Full Participant titles) DOAJ Directory of Open Access Journals
DatabaseTitle	CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) Publicly Available Content Database ProQuest Central Student Technology Collection ProQuest One Academic Middle East (New) ProQuest Central Essentials ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College ProQuest One Health & Nursing ProQuest Natural Science Collection ProQuest Central China ProQuest Central ProQuest One Applied & Life Sciences ProQuest Health & Medical Research Collection ProQuest Engineering Collection Health Research Premium Collection Health and Medicine Complete (Alumni Edition) Natural Science Collection ProQuest Central Korea Health & Medical Research Collection Biological Science Collection ProQuest Central (New) ProQuest Medical Library (Alumni) Engineering Collection Engineering Database ProQuest Biological Science Collection ProQuest One Academic Eastern Edition ProQuest Hospital Collection ProQuest Technology Collection Health Research Premium Collection (Alumni) Biological Science Database ProQuest SciTech Collection ProQuest Hospital Collection (Alumni) ProQuest Health & Medical Complete ProQuest Medical Library ProQuest One Academic UKI Edition Materials Science & Engineering Collection ProQuest One Academic ProQuest One Academic (New) ProQuest Central (Alumni) MEDLINE - Academic
DatabaseTitleList	Publicly Available Content Database MEDLINE MEDLINE - Academic
Database_xml	– sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 3 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database – sequence: 4 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Languages & Literatures
EISSN	2041-1480
EndPage	13
ExternalDocumentID	oai_doaj_org_article_00aa1bc5a75e4fd6bf1b4117613b2ec4 PMC5383945 28388928 10_1186_s13326_017_0116_2
Genre	Journal Article
GroupedDBID	0R~ 53G 5VS 7X7 88E 8FE 8FG 8FH 8FI 8FJ AAFWJ AAJSJ AASML AAYXX ABDBF ABJCF ABUWG ACGFO ACGFS ACIWK ACPRK ACUHS ADBBV ADRAZ ADUKV AEGXH AENEX AFKRA AFPKN AHBYD AHSBF AHYZX AIAGR ALIPV ALMA_UNASSIGNED_HOLDINGS AMKLP AMTXH AOIJS BAPOH BAWUL BBNVY BCNDV BENPR BFQNJ BGLVJ BHPHI BMC BPHCQ BVXVI C6C CCPQU CITATION DIK E3Z EBD EBLON EBS EJD ESX F5P FYUFA GROUPED_DOAJ GX1 H13 HCIFZ HMCUK HYE IAO IEA IHR INH INR ITC KQ8 L6V LK8 M1P M48 M7P M7S ML~ M~E O5R O5S OK1 PGMZT PHGZM PHGZT PIMPY PQQKQ PROAC PSQYO PTHSS RBZ RNS ROL RPM RSV SMT SOJ TUS UKHRP -A0 3V. ACRMQ ADINQ C24 CGR CUY CVF ECM EIF NPM 7XB 8FK AZQEC DWQXO GNUQQ K9. PJZUB PKEHL PPXIY PQEST PQGLB PQUKI PRINS 7X8 PUEGO 5PM
ID	FETCH-LOGICAL-c493t-5ddbfa61a0fcc1223a5dc5ebfbaec4d7519131059032aabe9d12df7fd11219913
IEDL.DBID	M48
ISSN	2041-1480
IngestDate	Wed Aug 27 01:27:16 EDT 2025 Thu Aug 21 18:15:58 EDT 2025 Sun Aug 24 03:05:10 EDT 2025 Fri Jul 25 11:59:43 EDT 2025 Thu Jan 02 22:23:01 EST 2025 Thu Apr 24 22:59:41 EDT 2025 Tue Jul 01 03:54:47 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Issue	1
Keywords	Relation extraction Degree of confidence Phenotype Negation SNP Modality
Language	English
License	Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c493t-5ddbfa61a0fcc1223a5dc5ebfbaec4d7519131059032aabe9d12df7fd11219913
Notes	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
OpenAccessLink	https://www.proquest.com/docview/1894587744?pq-origsite=%requestingapplication%
PMID	28388928
PQID	1894587744
PQPubID	2040220
PageCount	13
ParticipantIDs	doaj_primary_oai_doaj_org_article_00aa1bc5a75e4fd6bf1b4117613b2ec4 pubmedcentral_primary_oai_pubmedcentral_nih_gov_5383945 proquest_miscellaneous_1885952955 proquest_journals_1894587744 pubmed_primary_28388928 crossref_citationtrail_10_1186_s13326_017_0116_2 crossref_primary_10_1186_s13326_017_0116_2
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	2017-04-07
PublicationDateYYYYMMDD	2017-04-07
PublicationDate_xml	– month: 04 year: 2017 text: 2017-04-07 day: 07
PublicationDecade	2010
PublicationPlace	England
PublicationPlace_xml	– name: England – name: London
PublicationTitle	Journal of biomedical semantics
PublicationTitleAlternate	J Biomed Semantics
PublicationYear	2017
Publisher	BioMed Central BMC
Publisher_xml	– name: BioMed Central – name: BMC
References	L Smith (116_CR13) 2008; 9 TD Price (116_CR42) 2003; 270 others, I. H (116_CR2) 2010; 467 EM Smigielski (116_CR34) 2000; 28 S Wooding (116_CR41) 2004; 74 JG Caporaso (116_CR15) 2007; 23 C-H Wei (116_CR16) 2013; 29 V Vincze (116_CR27) 2008; 9 116_CR46 116_CR45 116_CR44 M Hewett (116_CR39) 2002; 30 M Whirl-Carrillo (116_CR7) 2012; 92 D Nadeau (116_CR31) 2007; 30 EE Loos (116_CR10) 2004 116_CR3 D Lin (116_CR9) 2009; 33 K Ravikumar (116_CR21) 2012; 3 AA Mahmood (116_CR6) 2016; 11 116_CR4 LC Kim (116_CR24) 2015; 197 M Hall (116_CR50) 2009; 11 116_CR33 E Doughty (116_CR17) 2011; 27 W Yu (116_CR36) 2008; 40 KM Verspoor (116_CR5) 2016; 16 B Bokharaeian (116_CR29) 2013; 51 C Giuliano (116_CR47) 2006; 18 U Leser (116_CR12) 2005; 6 B Bokharaeian (116_CR40) 2016; 11 116_CR19 M Seringhaus (116_CR8) 2008; 582 AP Davis (116_CR32) 2009; 37 GT Marth (116_CR1) 1999; 23 BR Packer (116_CR37) 2006; 34 116_CR25 116_CR22 116_CR28 116_CR26 T Joachims (116_CR49) 1999 P Thomas (116_CR14) 2016; 32 M Cariaso (116_CR38) 2012; 40 M Ballesteros (116_CR43) 2012 116_CR20 A Doms (116_CR30) 2005; 33 A Klein (116_CR23) 2014; 5 116_CR11 E Nicolazzi (116_CR35) 2015; 16 116_CR18 D Tikk (116_CR48) 2010; 6 18782832 - Nucleic Acids Res. 2009 Jan;37(Database issue):D786-92 23046792 - J Biomed Semantics. 2012 Oct 5;3 Suppl 3:S2 22759648 - BMC Genomics. 2012 Jun 18;13 Suppl 4:S10 23564842 - Bioinformatics. 2013 Jun 1;29(11):1433-9 19025695 - BMC Bioinformatics. 2008 Nov 19;9 Suppl 11:S9 16420734 - Brief Bioinform. 2005 Dec;6(4):357-69 23584833 - Database (Oxford). 2013 Apr 12;2013:bat019 10592272 - Nucleic Acids Res. 2000 Jan 1;28(1):352-5 14997422 - Am J Hum Genet. 2004 Apr;74(4):637-46 20811451 - Nature. 2010 Sep 2;467(7311):52-8 14990452 - Bioinformatics. 2004 Mar 1;20(4):557-68 27695078 - PLoS One. 2016 Oct 3;11(10 ):e0163480 22992668 - Clin Pharmacol Ther. 2012 Oct;92(4):414-7 27074804 - Database (Oxford). 2016 Apr 13;2016 18834493 - Genome Biol. 2008;9 Suppl 2:S2 22140107 - Nucleic Acids Res. 2012 Jan;40(Database issue):D1308-12 21138947 - Bioinformatics. 2011 Feb 1;27(3):408-15 25881165 - BMC Genomics. 2015 Apr 10;16:283 10581034 - Nat Genet. 1999 Dec;23(4):452-6 16381944 - Nucleic Acids Res. 2006 Jan 1;34(Database issue):D617-21 11752281 - Nucleic Acids Res. 2002 Jan 1;30(1):163-5 27454860 - BMC Med Inform Decis Mak. 2016 Jul 18;16 Suppl 1:68 17495998 - Bioinformatics. 2007 Jul 15;23(14):1862-5 18227866 - Nat Genet. 2008 Feb;40(2):124-5 12965006 - Proc Biol Sci. 2003 Jul 22;270(1523):1433-40 27073839 - PLoS One. 2016 Apr 13;11(4):e0152725 27256315 - Bioinformatics. 2016 Sep 15;32(18):2883-5 15980585 - Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W783-6 19051285 - Genet Epidemiol. 2009 Apr;33(3):256-65 18328823 - FEBS Lett. 2008 Apr 9;582(8):1170 24568600 - J Biomed Semantics. 2014 Feb 25;5(1):11 20617200 - PLoS Comput Biol. 2010 Jul 01;6:e1000837
References_xml	– volume: 582 start-page: 1170 issue: 8 year: 2008 ident: 116_CR8 publication-title: FEBS Lett doi: 10.1016/j.febslet.2008.02.073 – volume: 9 start-page: 1 issue: Suppl 2 year: 2008 ident: 116_CR13 publication-title: Genome Biol doi: 10.1186/gb-2008-9-s2-s2 – volume: 28 start-page: 352 issue: 1 year: 2000 ident: 116_CR34 publication-title: Nucleic Acids Res doi: 10.1093/nar/28.1.352 – volume: 467 start-page: 52 year: 2010 ident: 116_CR2 publication-title: Nature doi: 10.1038/nature09298 – volume: 33 start-page: W783 issue: suppl 2 year: 2005 ident: 116_CR30 publication-title: Nucleic Acids Res doi: 10.1093/nar/gki470 – ident: 116_CR22 doi: 10.1186/1471-2164-13-S4-S10 – volume: 16 start-page: 37 issue: 1 year: 2016 ident: 116_CR5 publication-title: BMC Med Inform Decis Mak doi: 10.1186/s12911-016-0276-5 – ident: 116_CR4 – ident: 116_CR20 doi: 10.1093/bioinformatics/btg449 – ident: 116_CR26 – volume: 270 start-page: 1433 issue: 1523 year: 2003 ident: 116_CR42 publication-title: Proc Biol Sci doi: 10.1098/rspb.2003.2372 – volume: 30 start-page: 3 issue: 1 year: 2007 ident: 116_CR31 publication-title: Lingvisticae Investigationes doi: 10.1075/li.30.1.03nad – volume: 30 start-page: 163 issue: 1 year: 2002 ident: 116_CR39 publication-title: Nucleic Acids Res doi: 10.1093/nar/30.1.163 – ident: 116_CR33 – ident: 116_CR11 doi: 10.1075/tsl.32.22byb – volume: 6 start-page: 357 issue: 4 year: 2005 ident: 116_CR12 publication-title: Brief Bioinform doi: 10.1093/bib/6.4.357 – ident: 116_CR44 – volume: 11 start-page: e0152725 issue: 4 year: 2016 ident: 116_CR6 publication-title: PLoS ONE doi: 10.1371/journal.pone.0152725 – volume: 32 start-page: 2883 issue: 18 year: 2016 ident: 116_CR14 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btw234 – volume: 9 start-page: 1 issue: 11 year: 2008 ident: 116_CR27 publication-title: BMC Bioinformatics – volume: 51 start-page: 49 year: 2013 ident: 116_CR29 publication-title: Procesamiento del Lenguaje Natural – volume: 74 start-page: 637 issue: 4 year: 2004 ident: 116_CR41 publication-title: Am J Hum Genet doi: 10.1086/383092 – volume: 5 start-page: 11 year: 2014 ident: 116_CR23 publication-title: J Biomed Semantics doi: 10.1186/2041-1480-5-11 – volume: 6 start-page: e1000837 issue: 7 year: 2010 ident: 116_CR48 publication-title: PLoS Comput Biol doi: 10.1371/journal.pcbi.1000837 – ident: 116_CR45 doi: 10.1177/0741088396013002004 – volume: 23 start-page: 1862 issue: 14 year: 2007 ident: 116_CR15 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btm235 – start-page: 363 volume-title: Inferring the Scope of Negation in Biomedical Documents. 13th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2012) year: 2012 ident: 116_CR43 – volume: 92 start-page: 414 year: 2012 ident: 116_CR7 publication-title: Clin Pharmacol Ther doi: 10.1038/clpt.2012.96 – ident: 116_CR3 doi: 10.1093/acref/9780198714378.001.0001 – ident: 116_CR28 – volume: 11 start-page: e0163480 issue: 10 year: 2016 ident: 116_CR40 publication-title: PLoS ONE doi: 10.1371/journal.pone.0163480 – volume: 16 start-page: 283 year: 2015 ident: 116_CR35 publication-title: BMC Genomics doi: 10.1186/s12864-015-1497-1 – volume: 40 start-page: D1308 issue: D1 year: 2012 ident: 116_CR38 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkr798 – volume: 3 start-page: 1480 year: 2012 ident: 116_CR21 publication-title: J Biomed Semantics doi: 10.1186/2041-1480-3-S3-S2 – volume: 29 start-page: 1433 year: 2013 ident: 116_CR16 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btt156 – volume: 34 start-page: D617 issue: suppl 1 year: 2006 ident: 116_CR37 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkj151 – volume: 197 start-page: 600 year: 2015 ident: 116_CR24 publication-title: Procedia Soc Behavioral Sci doi: 10.1016/j.sbspro.2015.07.200 – volume: 27 start-page: 408 issue: 3 year: 2011 ident: 116_CR17 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btq667 – ident: 116_CR18 doi: 10.1093/database/baw043 – volume: 40 start-page: 124 issue: 2 year: 2008 ident: 116_CR36 publication-title: Nat Genet doi: 10.1038/ng0208-124 – start-page: 169 volume-title: Advances in kernel methods year: 1999 ident: 116_CR49 – volume: 37 start-page: D786 issue: suppl 1 year: 2009 ident: 116_CR32 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkn580 – volume: 23 start-page: 452 issue: 4 year: 1999 ident: 116_CR1 publication-title: Nat Genet doi: 10.1038/70570 – ident: 116_CR46 – ident: 116_CR25 – ident: 116_CR19 doi: 10.1093/database/bat019 – volume: 33 start-page: 256 issue: 3 year: 2009 ident: 116_CR9 publication-title: Genet Epidemiol doi: 10.1002/gepi.20377 – volume-title: Glossary of linguistic terms year: 2004 ident: 116_CR10 – volume: 18 start-page: 401 year: 2006 ident: 116_CR47 publication-title: EACL – volume: 11 start-page: 10 issue: 1 year: 2009 ident: 116_CR50 publication-title: ACM SIGKDD Explorations Newsl doi: 10.1145/1656274.1656278 – reference: 23046792 - J Biomed Semantics. 2012 Oct 5;3 Suppl 3:S2 – reference: 12965006 - Proc Biol Sci. 2003 Jul 22;270(1523):1433-40 – reference: 20811451 - Nature. 2010 Sep 2;467(7311):52-8 – reference: 10592272 - Nucleic Acids Res. 2000 Jan 1;28(1):352-5 – reference: 15980585 - Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W783-6 – reference: 27695078 - PLoS One. 2016 Oct 3;11(10 ):e0163480 – reference: 21138947 - Bioinformatics. 2011 Feb 1;27(3):408-15 – reference: 27454860 - BMC Med Inform Decis Mak. 2016 Jul 18;16 Suppl 1:68 – reference: 18782832 - Nucleic Acids Res. 2009 Jan;37(Database issue):D786-92 – reference: 24568600 - J Biomed Semantics. 2014 Feb 25;5(1):11 – reference: 10581034 - Nat Genet. 1999 Dec;23(4):452-6 – reference: 27256315 - Bioinformatics. 2016 Sep 15;32(18):2883-5 – reference: 18227866 - Nat Genet. 2008 Feb;40(2):124-5 – reference: 18834493 - Genome Biol. 2008;9 Suppl 2:S2 – reference: 23564842 - Bioinformatics. 2013 Jun 1;29(11):1433-9 – reference: 17495998 - Bioinformatics. 2007 Jul 15;23(14):1862-5 – reference: 19025695 - BMC Bioinformatics. 2008 Nov 19;9 Suppl 11:S9 – reference: 19051285 - Genet Epidemiol. 2009 Apr;33(3):256-65 – reference: 23584833 - Database (Oxford). 2013 Apr 12;2013:bat019 – reference: 14997422 - Am J Hum Genet. 2004 Apr;74(4):637-46 – reference: 14990452 - Bioinformatics. 2004 Mar 1;20(4):557-68 – reference: 22140107 - Nucleic Acids Res. 2012 Jan;40(Database issue):D1308-12 – reference: 16420734 - Brief Bioinform. 2005 Dec;6(4):357-69 – reference: 27074804 - Database (Oxford). 2016 Apr 13;2016: – reference: 16381944 - Nucleic Acids Res. 2006 Jan 1;34(Database issue):D617-21 – reference: 18328823 - FEBS Lett. 2008 Apr 9;582(8):1170 – reference: 25881165 - BMC Genomics. 2015 Apr 10;16:283 – reference: 11752281 - Nucleic Acids Res. 2002 Jan 1;30(1):163-5 – reference: 27073839 - PLoS One. 2016 Apr 13;11(4):e0152725 – reference: 22992668 - Clin Pharmacol Ther. 2012 Oct;92(4):414-7 – reference: 22759648 - BMC Genomics. 2012 Jun 18;13 Suppl 4:S10 – reference: 20617200 - PLoS Comput Biol. 2010 Jul 01;6:e1000837
SSID	ssj0000331083
Score	2.1905124
Snippet	Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes. Recently, some... Background Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes.... Abstract Background Single Nucleotide Polymorphisms (SNPs) are among the most important types of genetic variations influencing common diseases and phenotypes....
SourceID	doaj pubmedcentral proquest pubmed crossref
SourceType	Open Website Open Access Repository Aggregation Database Index Database Enrichment Source
StartPage	14
SubjectTerms	Degree of confidence Gene Ontology Information Storage and Retrieval - methods Modality Mutation Negation Phenotype Polymorphism Polymorphism, Single Nucleotide Relation extraction Semantics Single-nucleotide polymorphism SNP
SummonAdditionalLinks	– databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Na9wwEBUlp1xKP1O3SVGh9FAwsSzJlntLQkMoaQi0gdyEZEndhV15iXcp_fedkR13t5T20qsly7JmpHkjjd4Q8tY4JQvrMOCJCXBQVMitFDZvuHI1mGMZ0t2qz1fVxY34dCtvt1J9YUzYQA88DNxxURjDbCtNLb0IrrKBWcEYeN_clr5NTKBg87acqbQGc4Atio_HmExVxz04YyU6zxhpyaq83DFEia__TyDz91jJLeNz_og8HFEjPRl6-5g88PEJObgc9xp7-o5eTvTI_VPy_cvV9fXMx5MP1FBwL1ebngI4pbAQp0tR8RvFXO3eUfNLOj3tAsWdg4XPI7Icd-u583TVLX4soYnZvF_21ERHMSqsw61baPSuW9LF9Oln5Ob849ezi3zMsJC3ouHrXDpng6mYKULbMkAKRrpWehusgaF1NcA7xhGBFbw0xvrGsdKFOjhAaRgzxZ-TvdhF_wJDpJjwXBpwrwAEhqrxvg4qKI8no4aFjBT3w63bkX4cs2AsdHJDVKUHCWmQkEYJ6TIj76dXVgP3xt8qn6IMp4pIm50egDLpUZn0v5QpI4f3GqDHudxrphohFcBkKH4zFcMsxKMVE323wTrIEweqLTNyMCjM1BMAcEo1pcpIvaNKO13dLYnzWWL6BmvE4eMv_8e_vSL7ZdJ-kRf1Idlb3238EQCqtX2d5s5Pc48gTA priority: 102 providerName: Directory of Open Access Journals – databaseName: Health & Medical Collection dbid: 7X7 link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3fb9MwELZgvPCC-DkCAxkJ8YAULU7s2OEFDcQ0oTFNgkl9s-zYXiu1cWlaIf577lI3owjtNXZsJ3e2vzufvyPkrXFKFNZhwBPjYKCokFvBbd5UyknYjkUY7lZ9u6jPrvjXiZgkh1ufwip3a-KwULvYoo_8mKmGCwVghX9c_swxaxSerqYUGnfJPaQuw5AuOZGjj6WoALyoKh1mMlUf92CSlWhCY7wlq_NybzsaWPv_BzX_jZj8aws6fUgeJOxIT7bCfkTu-O4xOTxPHseevqPnI0ly_4T8-n5xeTn13ckHaigYmctNTwGiUliOh6tR3TXFjO3eUXMjo57GQNF_MPd5h1zHcT1zni7j_PcCmpjO-kVPTecoxoZFdOBCo6u4oPOx66fk6vTLj89necqzkLe8qda5cM4GUzNThLZlgBeMcK3wNljjW-4kgDxWIQ4rqtIY6xvHShdkcIDVMHKqekYOutj55xgoxbivhAEjC6BgqBvvZVBBeTwfNSxkpNj9bt0mEnLMhTHXgzGiar2VkAYJaZSQLjPyfnxluWXguK3yJ5ThWBHJs4cHcXWt01zURWEMs60wUngeXG0Ds5wxCcjGlvDFGTnaaYBOM7rXN_qXkTdjMcxFPGAxnY8brINscaDgIiOHW4UZRwIwTqmmVBmRe6q0N9T9km42Hfi-YU-qoPMXtw_rJblfDnrN80IekYP1auNfAWBa29fDrPgDLNkXTw priority: 102 providerName: ProQuest
Title	SNPPhenA: a corpus for extracting ranked associations of single-nucleotide polymorphisms and phenotypes from literature
URI	https://www.ncbi.nlm.nih.gov/pubmed/28388928 https://www.proquest.com/docview/1894587744 https://www.proquest.com/docview/1885952955 https://pubmed.ncbi.nlm.nih.gov/PMC5383945 https://doaj.org/article/00aa1bc5a75e4fd6bf1b4117613b2ec4
Volume	8
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3db9MwELfG9sILYnyMbKMyEuIBKRAnduwgIdShlanaqopRqW-RHdtrpTYpTSvYf8_ZTQNFFeIlkWLHdnx3vt_ZlzuEXkstWKS0c3giFAwUYUPFqAqzRGgO6phZ_2_VzSC9GtH-mI0P0Da9VTOB9V7TzuWTGi1n735-v_8EAv_RC7xI39dgZ8XOLnZOlCQNYUU-AsXEnZzeNGjfL8wJYBkfmDOOKAnBENiec-5tZUdT-YD--1Do386Uf2in3mP0qIGVuLvhg2N0YMon6OS62Yys8Rt83cZPrp-iH7eD4XBiyu4HLDHYn4t1jQG9Ylip_V9T5R12ydyNxvI3-WpcWey2FmYmLF0Y5Go11QYvqtn9HJqYTOt5jWWpsXMbq9zeLjS6rOZ41nb9DI16l98-X4VNCoawoFmyCpnWysqUyMgWBQEoIZkumFFWSVNQzQH_kcRBtCiJpVQm0yTWllsNMM45VSXP0WFZleaF86Ei1CRMwrQDSrRpZgy3wgrjjk4lsQGKttOdF018cpcmY5Z7O0Wk-YZCOVAodxTK4wC9bV9ZbIJz_KvyhaNhW9HF1fYPquVd3ohpHkVSElUwyZmhVqfKEkUJ4QB6VAxfHKDzLQfkW17NicgoE4CjofhVWwxi6s5eZGmqtavjAskB77MAnWwYph0JIDwhslgEiO-w0s5Qd0vK6cSHAgd1lUDnp__R7xl6GHvmpmHEz9Hhark2LwFQrVQHPeBjDlfR-9JBR91u_7YP94vLwfBrx29SdLwg_QI6nCPt
linkProvider	Scholars Portal
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3db9MwELfGeIAXxOcIDDAS8IAULU7sxEFCaHxMHeuqSWxS34wd22ulNilNq2n_FH8jd2maUYT2tlfbcZzcne935_MdIW-0lSIyFgOeGAcDRfrQCG7CPJE2A3UsfHO36niQ9s7496EYbpHf67swGFa53hObjdpWBfrI95jMuZAAVvin2a8Qq0bh6eq6hMaKLY7c5QWYbPXHw69A37dxfPDt9EsvbKsKhAXPk0UorDVep0xHvigYaEctbCGc8Ua7gtsMIA1LEHVESay1cbllsfWZt4BMME4ogXlvkdugeCOUqGyYdT6dKIFGmbSHp0ymezWYgDGa7BjfydIw3lB_TZWA_0HbfyM0_1J5B_fJvRar0v0Vcz0gW658SHb6rYezpu9ov0vKXD8iFz8GJycjV-5_oJqCUTtb1hQgMYXtv7mKVZ5TrBDvLNVXPFHTylP0V0xcWGJu5Woxto7OqsnlFKYYjetpTXVpKcaiVegwhknn1ZROulc_Jmc3QoEnZLusSvcUA7MYd4nQYNQB9PRp7lzmpZcOz2M18wGJ1r9bFW3Sc6y9MVGN8SNTtaKQAgoppJCKA_K-e2S2yvhx3eDPSMNuICbrbhqq-blqZV9FkdbMFEJnwnFvU-OZ4YxlgKRMDF8ckN01B6h2B6nVFb8H5HXXDbKPBzq6dNUSx2B2OhAoEZCdFcN0KwHYKGUey4BkG6y0sdTNnnI8avKLgw5M4OXPrl_WK3Knd3rcV_3DwdFzcjdueJyHUbZLthfzpXsBYG1hXjYSQsnPmxbJP37jVYg
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=SNPPhenA%3A+a+corpus+for+extracting+ranked+associations+of+single-nucleotide+polymorphisms+and+phenotypes+from+literature&rft.jtitle=Journal+of+biomedical+semantics&rft.au=Bokharaeian%2C+Behrouz&rft.au=Diaz%2C+Alberto&rft.au=Taghizadeh%2C+Nasrin&rft.au=Chitsaz%2C+Hamidreza&rft.date=2017-04-07&rft.issn=2041-1480&rft.eissn=2041-1480&rft.volume=8&rft.issue=1&rft.spage=14&rft_id=info:doi/10.1186%2Fs13326-017-0116-2&rft.externalDBID=NO_FULL_TEXT
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2041-1480&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2041-1480&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2041-1480&client=summon