VAIV bio-discovery service using transformer model and retrieval augmented generation
Background There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. Main body We propose a novel biomedical neural search service called ‘VAIV Bio-Discovery’, which supports enhanced knowledge discovery and document search...
Saved in:
Published in | BMC bioinformatics Vol. 25; no. 1; pp. 273 - 25 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
London
BioMed Central
21.08.2024
BioMed Central Ltd BMC |
Subjects | |
Online Access | Get full text |
ISSN | 1471-2105 1471-2105 |
DOI | 10.1186/s12859-024-05903-6 |
Cover
Abstract | Background
There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery.
Main body
We propose a novel biomedical neural search service called ‘VAIV Bio-Discovery’, which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25.
Conclusion
As a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge. |
---|---|
AbstractList | There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. As a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge. Background There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. Main body We propose a novel biomedical neural search service called 'VAIV Bio-Discovery', which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25. Conclusion As a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge. Keywords: Natural language processing, Text mining, LLM, Transformer, RAG, Biomedical interaction extraction, Neural search, T5, Embedding Background There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. Main body We propose a novel biomedical neural search service called ‘VAIV Bio-Discovery’, which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25. Conclusion As a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge. There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery.BACKGROUNDThere has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery.We propose a novel biomedical neural search service called 'VAIV Bio-Discovery', which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25.MAIN BODYWe propose a novel biomedical neural search service called 'VAIV Bio-Discovery', which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25.As a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge.CONCLUSIONAs a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge. BackgroundThere has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery.Main bodyWe propose a novel biomedical neural search service called ‘VAIV Bio-Discovery’, which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25.ConclusionAs a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge. Abstract Background There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. Main body We propose a novel biomedical neural search service called ‘VAIV Bio-Discovery’, which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25. Conclusion As a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge. There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. We propose a novel biomedical neural search service called 'VAIV Bio-Discovery', which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25. As a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge. |
ArticleNumber | 273 |
Audience | Academic |
Author | Kim, Seonho Yoon, Juntae |
Author_xml | – sequence: 1 givenname: Seonho surname: Kim fullname: Kim, Seonho email: shkim.lex@gmail.com organization: Department of Computer Science, Sogang University – sequence: 2 givenname: Juntae surname: Yoon fullname: Yoon, Juntae email: jtyoon@vaiv.kr organization: VAIV Company Inc |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/39169321$$D View this record in MEDLINE/PubMed |
BookMark | eNp9kktv1DAUhSNURB_wB1igSGxgkWLHsWOv0KjiMVIlJKDdWjf2TXCV2MVORvTf4-mUtoMQyiKJ851zc4_OcXHgg8eieEnJKaVSvEu0llxVpG4qwhVhlXhSHNGmpVVNCT949HxYHKd0RQhtJeHPikOmqFCspkfFxeVqfVl2LlTWJRM2GG_KhHHjDJZLcn4o5wg-9SFOGMspWBxL8LaMOEeHG8hvyzChn9GWA3qMMLvgnxdPexgTvri7nxQXHz98P_tcnX_5tD5bnVeGKzZXtWqUYfn_ZSelVK0wBmxNe4um7ZrWGgBrhLW14iBRCGZt23QEOyBdx0Gxk2K987UBrvR1dBPEGx3A6duDEAcNcXZmRF0b3kIrG6kIaaxgYLGWLUEBzBIgPHu933ldL92E1uSdIox7pvtfvPuhh7DRlLKG0IZkhzd3DjH8XDDNesqZ4jiCx7AkzYjiom1qvh32-i_0KizR56w0ozkQIahgD9QAeQPn-5AHm62pXkki81jF20yd_oPKl8XJmVyZ3uXzPcHbPUFmZvw1D7CkpNffvu6zrx6nch_HnwZloN4BJoaUIvb3CCV6W1O9q6nOW-nbmmqRRWwnShn2A8aH9f-j-g04qehY |
Cites_doi | 10.3390/bioengineering10050586 10.18653/v1/2021.emnlp-main.98 10.1093/nar/gkad751 10.1093/bioinformatics/btad557 10.1093/nar/gkx1037 10.1093/bioinformatics/btac793 10.18653/v1/2022.acl-long.551 10.3115/1572340.1572343 10.1038/s41597-023-02068-4 10.18653/v1/2022.bionlp-1.37 10.1093/nar/gkac1085 10.1038/s41597-019-0342-9 10.1093/database/baw032 10.48550/arXiv.2303.08774 10.1093/nar/gkt531 10.1093/nar/gkac833 10.1093/toxsci/kfad069 10.1093/bioinformatics/btz682 10.1016/j.ins.2018.12.041 |
ContentType | Journal Article |
Copyright | The Author(s) 2024 2024. The Author(s). COPYRIGHT 2024 BioMed Central Ltd. 2024. This work is licensed under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. The Author(s) 2024 2024 |
Copyright_xml | – notice: The Author(s) 2024 – notice: 2024. The Author(s). – notice: COPYRIGHT 2024 BioMed Central Ltd. – notice: 2024. This work is licensed under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. – notice: The Author(s) 2024 2024 |
DBID | C6C AAYXX CITATION CGR CUY CVF ECM EIF NPM ISR 3V. 7QO 7SC 7X7 7XB 88E 8AL 8AO 8FD 8FE 8FG 8FH 8FI 8FJ 8FK ABUWG AEUYN AFKRA ARAPS AZQEC BBNVY BENPR BGLVJ BHPHI CCPQU DWQXO FR3 FYUFA GHDGH GNUQQ HCIFZ JQ2 K7- K9. L7M LK8 L~C L~D M0N M0S M1P M7P P5Z P62 P64 PHGZM PHGZT PIMPY PJZUB PKEHL PPXIY PQEST PQGLB PQQKQ PQUKI Q9U 7X8 5PM DOA |
DOI | 10.1186/s12859-024-05903-6 |
DatabaseName | Springer Nature OA Free Journals CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed Gale In Context: Science ProQuest Central (Corporate) Biotechnology Research Abstracts Computer and Information Systems Abstracts Health & Medical Collection ProQuest Central (purchase pre-March 2016) Medical Database (Alumni Edition) Computing Database (Alumni Edition) ProQuest Pharma Collection Technology Research Database ProQuest SciTech Collection ProQuest Technology Collection ProQuest Natural Science Collection Hospital Premium Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ProQuest Central (Alumni) ProQuest One Sustainability ProQuest Central UK/Ireland Advanced Technologies & Aerospace Collection ProQuest Central Essentials - QC Biological Science Collection ProQuest Central Technology Collection Natural Science Collection ProQuest One Community College ProQuest Central Korea Engineering Research Database Proquest Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Central Student SciTech Premium Collection ProQuest Computer Science Collection Computer Science Database ProQuest Health & Medical Complete (Alumni) Advanced Technologies Database with Aerospace Biological Sciences Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional Computing Database ProQuest Health & Medical Collection Medical Database Biological Science Database Advanced Technologies & Aerospace Database ProQuest Advanced Technologies & Aerospace Collection Biotechnology and BioEngineering Abstracts ProQuest Central Premium ProQuest One Academic (New) Publicly Available Content Database ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central Basic MEDLINE - Academic PubMed Central (Full Participant titles) DOAJ Directory of Open Access Journals |
DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) Publicly Available Content Database Computer Science Database ProQuest Central Student ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest Computer Science Collection Computer and Information Systems Abstracts SciTech Premium Collection ProQuest One Applied & Life Sciences ProQuest One Sustainability Health Research Premium Collection Natural Science Collection Health & Medical Research Collection Biological Science Collection ProQuest Central (New) ProQuest Medical Library (Alumni) Advanced Technologies & Aerospace Collection ProQuest Biological Science Collection ProQuest One Academic Eastern Edition ProQuest Hospital Collection ProQuest Technology Collection Health Research Premium Collection (Alumni) Biological Science Database ProQuest Hospital Collection (Alumni) Biotechnology and BioEngineering Abstracts ProQuest Health & Medical Complete ProQuest One Academic UKI Edition Engineering Research Database ProQuest One Academic ProQuest One Academic (New) Technology Collection Technology Research Database Computer and Information Systems Abstracts – Academic ProQuest One Academic Middle East (New) ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest One Health & Nursing ProQuest Natural Science Collection ProQuest Pharma Collection ProQuest Central ProQuest Health & Medical Research Collection Biotechnology Research Abstracts Health and Medicine Complete (Alumni Edition) ProQuest Central Korea Advanced Technologies Database with Aerospace ProQuest Computing ProQuest Central Basic ProQuest Computing (Alumni Edition) ProQuest SciTech Collection Computer and Information Systems Abstracts Professional Advanced Technologies & Aerospace Database ProQuest Medical Library ProQuest Central (Alumni) MEDLINE - Academic |
DatabaseTitleList | MEDLINE - Academic Publicly Available Content Database MEDLINE |
Database_xml | – sequence: 1 dbid: C6C name: Springer Nature OA Free Journals url: http://www.springeropen.com/ sourceTypes: Publisher – sequence: 2 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 3 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 4 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database – sequence: 5 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Biology |
EISSN | 1471-2105 |
EndPage | 25 |
ExternalDocumentID | oai_doaj_org_article_2c57a78489004d63ade2870e6a3d0a05 PMC11340140 A808340957 39169321 10_1186_s12859_024_05903_6 |
Genre | Journal Article |
GeographicLocations | South Korea |
GeographicLocations_xml | – name: South Korea |
GrantInformation_xml | – fundername: National Research Foundation of Korea (NRF) grantid: NRF-2020R1I1A1A01073125 |
GroupedDBID | --- 0R~ 23N 2WC 53G 5VS 6J9 7X7 88E 8AO 8FE 8FG 8FH 8FI 8FJ AAFWJ AAJSJ AAKPC AASML ABDBF ABUWG ACGFO ACGFS ACIHN ACIWK ACPRK ACUHS ADBBV ADMLS ADUKV AEAQA AENEX AEUYN AFKRA AFPKN AFRAH AHBYD AHMBA AHYZX ALMA_UNASSIGNED_HOLDINGS AMKLP AMTXH AOIJS ARAPS AZQEC BAPOH BAWUL BBNVY BCNDV BENPR BFQNJ BGLVJ BHPHI BMC BPHCQ BVXVI C6C CCPQU CS3 DIK DU5 DWQXO E3Z EAD EAP EAS EBD EBLON EBS EMB EMK EMOBN ESX F5P FYUFA GNUQQ GROUPED_DOAJ GX1 HCIFZ HMCUK HYE IAO ICD IHR INH INR ISR ITC K6V K7- KQ8 LK8 M1P M48 M7P MK~ ML0 M~E O5R O5S OK1 OVT P2P P62 PGMZT PHGZM PHGZT PIMPY PJZUB PPXIY PQGLB PQQKQ PROAC PSQYO PUEGO RBZ RNS ROL RPM RSV SBL SOJ SV3 TR2 TUS UKHRP W2D WOQ WOW XH6 XSB AAYXX ALIPV CITATION CGR CUY CVF ECM EIF NPM PMFND 3V. 7QO 7SC 7XB 8AL 8FD 8FK FR3 JQ2 K9. L7M L~C L~D M0N P64 PKEHL PQEST PQUKI Q9U 7X8 5PM |
ID | FETCH-LOGICAL-c593t-2949c30248b888976ccad21fdec7b47dcaadc6dd295a8e663dd74b0eba0bb5a93 |
IEDL.DBID | M48 |
ISSN | 1471-2105 |
IngestDate | Wed Aug 27 01:30:29 EDT 2025 Thu Aug 21 18:34:35 EDT 2025 Thu Sep 04 19:18:02 EDT 2025 Fri Jul 25 19:11:15 EDT 2025 Tue Jun 17 22:04:17 EDT 2025 Tue Jun 10 20:58:05 EDT 2025 Fri Jun 27 05:25:09 EDT 2025 Mon Jul 21 05:55:54 EDT 2025 Tue Jul 01 03:38:41 EDT 2025 Sat Sep 06 07:27:28 EDT 2025 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 1 |
Keywords | Text mining T5 Neural search Transformer Biomedical interaction extraction RAG Embedding Natural language processing LLM |
Language | English |
License | 2024. The Author(s). Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/. |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c593t-2949c30248b888976ccad21fdec7b47dcaadc6dd295a8e663dd74b0eba0bb5a93 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 |
OpenAccessLink | http://journals.scholarsportal.info/openUrl.xqy?doi=10.1186/s12859-024-05903-6 |
PMID | 39169321 |
PQID | 3102466163 |
PQPubID | 44065 |
PageCount | 25 |
ParticipantIDs | doaj_primary_oai_doaj_org_article_2c57a78489004d63ade2870e6a3d0a05 pubmedcentral_primary_oai_pubmedcentral_nih_gov_11340140 proquest_miscellaneous_3095674255 proquest_journals_3102466163 gale_infotracmisc_A808340957 gale_infotracacademiconefile_A808340957 gale_incontextgauss_ISR_A808340957 pubmed_primary_39169321 crossref_primary_10_1186_s12859_024_05903_6 springer_journals_10_1186_s12859_024_05903_6 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2024-08-21 |
PublicationDateYYYYMMDD | 2024-08-21 |
PublicationDate_xml | – month: 08 year: 2024 text: 2024-08-21 day: 21 |
PublicationDecade | 2020 |
PublicationPlace | London |
PublicationPlace_xml | – name: London – name: England |
PublicationTitle | BMC bioinformatics |
PublicationTitleAbbrev | BMC Bioinformatics |
PublicationTitleAlternate | BMC Bioinformatics |
PublicationYear | 2024 |
Publisher | BioMed Central BioMed Central Ltd BMC |
Publisher_xml | – name: BioMed Central – name: BioMed Central Ltd – name: BMC |
References | Q Chen (5903_CR13) 2023; 39 C Raffel (5903_CR4) 2020; 21 E Coudert (5903_CR23) 2023; 39 5903_CR3 J Legrand (5903_CR21) 2020 5903_CR1 AP Davis (5903_CR18) 2023; 51 5903_CR2 5903_CR25 5903_CR26 5903_CR24 5903_CR7 5903_CR29 5903_CR8 5903_CR28 J Kim (5903_CR31) 2013; 41 5903_CR20 TA Nakamura (5903_CR15) 2018; 480 Y Zhou (5903_CR19) 2024; 52 DS Wishart (5903_CR22) 2018; D1 J Lee (5903_CR5) 2020; 36 CH Wei (5903_CR27) 2016; 2016 A Nentidis (5903_CR34) 2023; 10 5903_CR12 SH Kim (5903_CR9) 2023; 10 AP Davis (5903_CR17) 2023; 195 Y Gu (5903_CR6) 2021; 3 5903_CR16 S Avram (5903_CR30) 2023; 51 5903_CR10 5903_CR32 5903_CR11 5903_CR33 P Liu (5903_CR14) 2023; 55 |
References_xml | – volume: 10 start-page: 586 issue: 5 year: 2023 ident: 5903_CR9 publication-title: Bioengineering (Basel) doi: 10.3390/bioengineering10050586 – ident: 5903_CR11 doi: 10.18653/v1/2021.emnlp-main.98 – volume: 52 start-page: D1465 issue: D1 year: 2024 ident: 5903_CR19 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkad751 – ident: 5903_CR32 – volume: 39 start-page: btad557 issue: 9 year: 2023 ident: 5903_CR13 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btad557 – volume: D1 start-page: D1074 issue: 46 year: 2018 ident: 5903_CR22 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkx1037 – volume: 39 start-page: btac793 issue: 1 year: 2023 ident: 5903_CR23 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btac793 – ident: 5903_CR7 doi: 10.18653/v1/2022.acl-long.551 – ident: 5903_CR33 doi: 10.3115/1572340.1572343 – ident: 5903_CR25 – volume: 10 start-page: 170 year: 2023 ident: 5903_CR34 publication-title: Sci Data. doi: 10.1038/s41597-023-02068-4 – ident: 5903_CR10 doi: 10.18653/v1/2022.bionlp-1.37 – volume: 51 start-page: D1276 issue: D1 year: 2023 ident: 5903_CR30 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkac1085 – ident: 5903_CR29 – volume: 3 start-page: 1 issue: 1–2 year: 2021 ident: 5903_CR6 publication-title: ACM Trans Comput Healthcare – ident: 5903_CR16 – year: 2020 ident: 5903_CR21 publication-title: Sci Data doi: 10.1038/s41597-019-0342-9 – volume: 2016 start-page: baw032 year: 2016 ident: 5903_CR27 publication-title: Database doi: 10.1093/database/baw032 – ident: 5903_CR20 – ident: 5903_CR2 doi: 10.48550/arXiv.2303.08774 – ident: 5903_CR12 – volume: 41 start-page: 510 year: 2013 ident: 5903_CR31 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkt531 – volume: 51 start-page: D1257 issue: D1 year: 2023 ident: 5903_CR18 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkac833 – ident: 5903_CR8 – volume: 21 start-page: 1 issue: 140 year: 2020 ident: 5903_CR4 publication-title: JMLR – ident: 5903_CR1 – ident: 5903_CR3 – volume: 55 start-page: 1 year: 2023 ident: 5903_CR14 publication-title: ACM Comput Surv – ident: 5903_CR28 – volume: 195 start-page: 155 issue: 2 year: 2023 ident: 5903_CR17 publication-title: Toxicol Sci doi: 10.1093/toxsci/kfad069 – ident: 5903_CR24 – volume: 36 start-page: 1234 issue: 4 year: 2020 ident: 5903_CR5 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btz682 – ident: 5903_CR26 – volume: 480 start-page: 339 year: 2018 ident: 5903_CR15 publication-title: J Inf Sci doi: 10.1016/j.ins.2018.12.041 |
SSID | ssj0017805 |
Score | 2.475747 |
Snippet | Background
There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery.
Main body
We... There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. We propose a novel... There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. As a result, our system... Background There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. Main body We... BackgroundThere has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery.Main bodyWe... There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery.BACKGROUNDThere has been a... Abstract Background There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. Main... |
SourceID | doaj pubmedcentral proquest gale pubmed crossref springer |
SourceType | Open Website Open Access Repository Aggregation Database Index Database Publisher |
StartPage | 273 |
SubjectTerms | Algorithms Attention task Big data management in biological domains Bioinformatics Biomedical and Life Sciences Biomedical interaction extraction Chatbots Chemical compounds Computational Biology/Bioinformatics Computational linguistics Computer Appl. in Life Sciences Data mining Data Mining - methods Deep learning Documents Drug discovery Drug interaction Drugs Engine blocks Genes Information Storage and Retrieval - methods Keywords Knowledge discovery Knowledge Discovery - methods Language Language processing Large language models Life Sciences LLM Machine Learning Medical research Medical Subject Headings-MeSH Medicine, Experimental Methods Microarrays Natural language Natural language interfaces Natural Language Processing Neural Networks, Computer Proteins PubMed Queries Query languages RAG Retrieval Search Engine Search engines Semantics Speech recognition Technology application Text mining Transformer Transformers Trends Unstructured data |
SummonAdditionalLinks | – databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwEB6hSkhcEG8CLTIIiQNETeL4keOCqFoOHICterP8ytJLFm12D_33nXGSpSlCXLiuvdHkm7E9E898A_BWWhdjKxTGJtLmddnq3Imgci8aG2LrRYgp2-KrPF3WXy7ExY1WX5QTNtADD8AdV14oq3StG1RnkByfQHdzUVoeCjuwlxZNMQVT4_0BMfVPJTJaHvcl8bTleB7lVGzJczk7hhJb_5978o1D6XbC5K1b03QYnTyA-6MXyRaD9A_hTuwewd2hr-TVY1ieL87Ombtc51R0S0maV6wfNgVGie4rtp381bhhqRkOs11gm9ReC22P2d0qsXUGtkq81KS-J7A8-fzj02k-9k8gpPk2r5q68ZxIyxzGueh3oLZCVbYheuVqFby1wcsQqkZYHdH1CEHVrojOFs4J2_CncNCtu_gcGKcvDxictboIde0aXXLRaoVPb0IrXMjg_QSn-TXQZJgUXmhpBvANymES-EZm8JEQ388kiuv0AyrejIo3_1J8Bm9IX4ZILDrKklnZXd-bs-_fzEKjY4mBq1AZvBsntWtE1tux6ADfinivZjMPZzNxlfn58GQWZlzlvUHXuKrRwZE8g9f7YfonZa51cb3DOcT0qHBnRIGfDVa0f28qekb_ucxAz-xrBsx8pLv8mTjAyxLlwuA4gw-TKf6W6-_Iv_gfyL-Ee1VaSrjHlodwsN3s4hG6Zlv3Kq3Ca-FENAo priority: 102 providerName: Directory of Open Access Journals – databaseName: ProQuest Technology Collection dbid: 8FG link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1Lb9QwELagCIkL4k1oQQYhcQCrefmRE1oQS8uBA7BVb5ZfSXtJymb30H_fGSfZkiK4rr2RPZ6nPfMNIW-FsSHUXEJsIgwrs1oxy71kjlfGh9pxH2K2xXdxtCq_nfLT8cKtH9MqJ50YFbXvHN6RH4IbkpdgTETx8eI3w65R-Lo6ttC4Te5kYGmQz9Xy6-4VAfH6p0IZJQ77DNHaGHyHYcllwcTMGEXM_r818x-m6Wba5I2302iSlg_I_dGXpIvh8B-SW6F9RO4O3SUvH5PVyeL4hNrzjmHpLaZqXtJ-UA0U090bupm81rCmsSUONa2n69hkCziQmm0TMTs9bSI6NR7iE7Jafvn1-YiNXRSQ3sWG5VVZuQKhyyxEu-B9wJn5PKt9cNKW0jtjvBPe5xU3KgARvZelTYM1qbXcVMVTstd2bXhOaIH3DxCi1Sr1ZWkrlRW8VhK-XvmaW5-Q9xM59cUAlqFjkKGEHoivYR06El-LhHxCiu9mItB1_KFbN3qUG507Lo1UpapAmr0ogIHwaTYIU_jUpDwhb_C8NEJZtJgr05ht3-vjnz_0QoF7CeErlwl5N06qO6CsM2PpAewK0a9mMw9mM0HW3Hx4Ygs9ynqvrzkzIa93w_hPzF9rQ7eFOYj3KEE_woKfDVy02zeWPoMXnSVEzfhrRpj5SHt-FpHAswzWBSFyQj5MrHi9rn9T_sX_t7FP7uVRSECHZgdkb7Pehpfgem3sqyhfV-dGLCg priority: 102 providerName: ProQuest – databaseName: Springer Nature OA Free Journals dbid: C6C link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1Lb9QwEB5BERIXxJtAQQYhcYCIOI4fOS4VVcuBA7BVb5YdO0svWbTZPfTfM-MkS1PgwDUeR8487JnMzGeAN8r5GFupMTZRLq94a3Ivg84bWbsQ20aGmKotvqiTZfX5XJ6PMDnUC3M1f8-N-tBzQljL8STJqU1S5Oom3JJcqJSYVUf7jAFh809NMX-dNzt4Ej7_n7vwlWPoeonktTxpOn6O78Hd0W9ki0HQ9-FG7B7A7eEmycuHsDxbnJ4xf7HOqc2WyjIvWT9sA4xK21dsO3moccPS9TfMdYFt0oVaqG3M7VYJnzOwVUKiJoE9guXxp-9HJ_l4YwLxVmzzsq7qRhBMmcfIFj0NlE8oeRtio32lQ-NcaFQIZS2diehshKArX0TvCu-lq8VjOOjWXXwKTNC_BgzHWlOEqvK14UK2RuPb69BKHzJ4N7HT_hyAMWwKKIyyA_MtrsMm5luVwUfi-J6SQK3TA5S1HW3Elo3UTpvK1Gi5QQlUFkrDRuVEKFwhM3hN8rIEW9FRXczK7frenn77ahcGXUkMVaXO4O1I1K6Rs40b2wzwqwjpakZ5OKNEu2rmw5Na2NGue4vOcFmhS6NEBq_2wzSTatW6uN4hDWE7atwLccFPBi3afze1OaPHzDMwM_2aMWY-0l38SKjfnOO6MBzO4P2kir_X9W_OP_s_8udwp0xGg_snP4SD7WYXX6DbtfUvk739AuyCJSw priority: 102 providerName: Springer Nature |
Title | VAIV bio-discovery service using transformer model and retrieval augmented generation |
URI | https://link.springer.com/article/10.1186/s12859-024-05903-6 https://www.ncbi.nlm.nih.gov/pubmed/39169321 https://www.proquest.com/docview/3102466163 https://www.proquest.com/docview/3095674255 https://pubmed.ncbi.nlm.nih.gov/PMC11340140 https://doaj.org/article/2c57a78489004d63ade2870e6a3d0a05 |
Volume | 25 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Zb9NAEB71EKgviBtDiQxC4gEM8bGHHxBKo4YWiQoVUuVttetdh0rIaXNI5N8zs7ZTXMoLL5GSHcfrOXZmvLPfALzi2jhXMoG5CddRFpcyMsyKqGC5tq4smHW-2uKEH42zzxM22YK23VHDwMWNqR31kxrPf777dbn-iAb_wRu85O8XMaGwRehtIjpKmUZ8G3bRM3FKxr5kV7sKhN_fHpy58bo9uE0HUTGmiTt-ysP5_71o_-G1rldUXttW9d5qdBfuNGFmOKj14h5sueo-3KobT64fwPhscHwWmvNZRKdyqYpzHS7qVSOkSvhpuGwDWjcPfbecUFc2nPv-W6icoV5NPZynDaceuJrk-xDGo8Pvw6OoabBAokiXUZJneZESqpnBRBgDExSnTeLSukKYTNhCa1twa5OcaekwNrFWZKbvjO4bw3SePoKdala5JxCm9GoCs7dS9m2WmVzGKSulwH_PbcmMDeBNy051UeNoKJ9_SK5qOSich_JyUDyAA-L4hpIwsP0Ps_lUNSalkoIJLWQmczR0y1PULdq1dVyntq_7LICXJC9FKBcVldFM9WqxUMffTtVAYuSJmS0TAbxuiMoZcrbQzakEfCoCxupQ7nco0QyL7nCrFqrVYoWxc5JhBMTTAF5shulKKm2r3GyFNAQFKXDpxAk_rrVo89ytMgYgO_rVYUx3pDr_4UHC4xjnhdlzAG9bVbya1785__T_7_QM9hJvS7j0xvuws5yv3HOM2JamB9tiIvBTjj71YPfg8OTrKX4b8mHPvwPpeTP9DXh9QiU |
linkProvider | Scholars Portal |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELaqIgQXxJtAAYNAHCBqXn7kgNDyqHZp6QG61d5cO3aWXpKy2RXaP8VvZMZJtqQIbr2unY09mac98w0hL7g2zpVMQGzCdZjFpQwNsyIsWK6tKwtmnc-2OOTjafZ5xmZb5FdfC4Nplb1O9Ira1gWeke-CG5JkYEx4-u7sR4hdo_B2tW-h0bLFvlv_hJCteTv5CN_3ZZLsfTr6MA67rgL4_nQZJnmWFylCeRmI_sAawx5sEpfWFcJkwhZa24Jbm-RMSwcG2VqRmcgZHRnDNIIvgcq_kuHJOMiPmG0CvBj7A_SFOZLvNjGiw4XwqhBLPNOQD4yf7xHwtyX4wxReTNO8cFfrTeDeTXKj813pqGW2W2TLVbfJ1bab5foOmR6PJsfUnNYhlvpiauiaNq0qopheP6fL3kt2C-pb8FBdWbrwTb2A46lezT1GqKVzj4aNTHOXTC-FvvfIdlVX7gGhKZ53QEhYyshmmcllnLJSCvj33JbM2IC87smpzlpwDuWDGslVS3wF61Ce-IoH5D1SfDMTgbX9D_Virjo5VUnBhBYykzloD8tTYFi8CnZcpzbSEQvIc_xeCqEzKszNmetV06jJt69qJMGdhXCZiYC86iaVNVC20F2pA-wK0bYGM3cGM0G2i-Fwzxaq0y2NOpeEgDzbDOOTmC9XuXoFcxBfUoA-hgXfb7los28stQavPQ6IHPDXgDDDker0u0cej2NYF4TkAXnTs-L5uv5N-Yf_38ZTcm189OVAHUwO9x-R64kXGNDf8Q7ZXi5W7jG4fUvzxMsaJSeXLdy_AX3Sax8 |
linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1Lb9QwELagCNQL4llSChiExAGi5uFXjkth1QVUIWCr3iw7dkIv2WqTPfTfM-MkS1PgwDUeR854xjOTmflMyGthrPcVlxCbCBOztFKx5U7GJS-M81XJnQ_VFifieMk-nfGzK138odp9TEn2PQ2I0tR0hxeu6lVcicM2Rdy1GOxLjM2TeSxuklsMTR-ma8XRNo-AiP1jq8xf503MUUDt__NsvmKcrhdOXsueBqM0v0fuDt4knfXbf5_c8M0Dcru_X_LyIVmezhan1J6vYmy-xWLNS9r2hwPFgveadqPf6tc0XIpDTePoOlyzBTJIzaYOqJ2O1gGfGrfxEVnOP_44Oo6HexSQ43kXZwUryhzByyzEu-B_wK65LK2cL6Vl0pXGuFI4lxXcKA8uiHOS2cRbk1jLTZE_JjvNqvFPCM3xDwQEaZVKHGO2UGnOKyXh7YWruHUReTuyU1_0cBk6hBlK6J75GtahA_O1iMh75PiWEqGuw4PVutaD5uis5NJIxVQB-uxEDiKEyVkvTO4Sk_CIvML90ghm0WC1TG02basX37_pmQIHE8SCy4i8GYiqFXC2NEPzAXwV4l9NKA8mlKBt5XR4FAs9aHurwUXOGDg6Io_Iy-0wzsQKtsavNkCDiI8STkhY8F4vRdvvxuZn8KPTiKiJfE0YMx1pzn8GLPA0hXVBkByRd6Mo_l7Xvzm__3_kL8idrx_m-svi5PNTspsF_YEDNj0gO91645-BX9bZ50H1fgEpXzBg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=VAIV+bio-discovery+service+using+transformer+model+and+retrieval+augmented+generation&rft.jtitle=BMC+bioinformatics&rft.au=Kim%2C+Seonho&rft.au=Yoon%2C+Juntae&rft.date=2024-08-21&rft.pub=BioMed+Central&rft.eissn=1471-2105&rft.volume=25&rft_id=info:doi/10.1186%2Fs12859-024-05903-6&rft_id=info%3Apmid%2F39169321&rft.externalDocID=PMC11340140 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2105&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2105&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2105&client=summon |