VAIV bio-discovery service using transformer model and retrieval augmented generation

Background There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. Main body We propose a novel biomedical neural search service called ‘VAIV Bio-Discovery’, which supports enhanced knowledge discovery and document search...

Full description

Saved in:
Bibliographic Details
Published inBMC bioinformatics Vol. 25; no. 1; pp. 273 - 25
Main Authors Kim, Seonho, Yoon, Juntae
Format Journal Article
LanguageEnglish
Published London BioMed Central 21.08.2024
BioMed Central Ltd
BMC
Subjects
Online AccessGet full text
ISSN1471-2105
1471-2105
DOI10.1186/s12859-024-05903-6

Cover

Abstract Background There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. Main body We propose a novel biomedical neural search service called ‘VAIV Bio-Discovery’, which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25. Conclusion As a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge.
AbstractList There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. As a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge.
Background There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. Main body We propose a novel biomedical neural search service called 'VAIV Bio-Discovery', which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25. Conclusion As a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge. Keywords: Natural language processing, Text mining, LLM, Transformer, RAG, Biomedical interaction extraction, Neural search, T5, Embedding
Background There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. Main body We propose a novel biomedical neural search service called ‘VAIV Bio-Discovery’, which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25. Conclusion As a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge.
There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery.BACKGROUNDThere has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery.We propose a novel biomedical neural search service called 'VAIV Bio-Discovery', which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25.MAIN BODYWe propose a novel biomedical neural search service called 'VAIV Bio-Discovery', which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25.As a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge.CONCLUSIONAs a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge.
BackgroundThere has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery.Main bodyWe propose a novel biomedical neural search service called ‘VAIV Bio-Discovery’, which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25.ConclusionAs a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge.
Abstract Background There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. Main body We propose a novel biomedical neural search service called ‘VAIV Bio-Discovery’, which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25. Conclusion As a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge.
There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. We propose a novel biomedical neural search service called 'VAIV Bio-Discovery', which supports enhanced knowledge discovery and document search on unstructured text such as PubMed. It mainly handles with information related to chemical compound/drugs, gene/proteins, diseases, and their interactions (chemical compounds/drugs-proteins/gene including drugs-targets, drug-drug, and drug-disease). To provide comprehensive knowledge, the system offers four search options: basic search, entity and interaction search, and natural language search. We employ T5slim_dec, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the interaction extraction task by removing the self-attention layer in the decoder block. It also assists in interpreting research findings by summarizing the retrieved search results for a given natural language query with Retrieval Augmented Generation (RAG). The search engine is built with a hybrid method that combines neural search with the probabilistic search, BM25. As a result, our system can better understand the context, semantics and relationships between terms within the document, enhancing search accuracy. This research contributes to the rapidly evolving biomedical field by introducing a new service to access and discover relevant knowledge.
ArticleNumber 273
Audience Academic
Author Kim, Seonho
Yoon, Juntae
Author_xml – sequence: 1
  givenname: Seonho
  surname: Kim
  fullname: Kim, Seonho
  email: shkim.lex@gmail.com
  organization: Department of Computer Science, Sogang University
– sequence: 2
  givenname: Juntae
  surname: Yoon
  fullname: Yoon, Juntae
  email: jtyoon@vaiv.kr
  organization: VAIV Company Inc
BackLink https://www.ncbi.nlm.nih.gov/pubmed/39169321$$D View this record in MEDLINE/PubMed
BookMark eNp9kktv1DAUhSNURB_wB1igSGxgkWLHsWOv0KjiMVIlJKDdWjf2TXCV2MVORvTf4-mUtoMQyiKJ851zc4_OcXHgg8eieEnJKaVSvEu0llxVpG4qwhVhlXhSHNGmpVVNCT949HxYHKd0RQhtJeHPikOmqFCspkfFxeVqfVl2LlTWJRM2GG_KhHHjDJZLcn4o5wg-9SFOGMspWBxL8LaMOEeHG8hvyzChn9GWA3qMMLvgnxdPexgTvri7nxQXHz98P_tcnX_5tD5bnVeGKzZXtWqUYfn_ZSelVK0wBmxNe4um7ZrWGgBrhLW14iBRCGZt23QEOyBdx0Gxk2K987UBrvR1dBPEGx3A6duDEAcNcXZmRF0b3kIrG6kIaaxgYLGWLUEBzBIgPHu933ldL92E1uSdIox7pvtfvPuhh7DRlLKG0IZkhzd3DjH8XDDNesqZ4jiCx7AkzYjiom1qvh32-i_0KizR56w0ozkQIahgD9QAeQPn-5AHm62pXkki81jF20yd_oPKl8XJmVyZ3uXzPcHbPUFmZvw1D7CkpNffvu6zrx6nch_HnwZloN4BJoaUIvb3CCV6W1O9q6nOW-nbmmqRRWwnShn2A8aH9f-j-g04qehY
Cites_doi 10.3390/bioengineering10050586
10.18653/v1/2021.emnlp-main.98
10.1093/nar/gkad751
10.1093/bioinformatics/btad557
10.1093/nar/gkx1037
10.1093/bioinformatics/btac793
10.18653/v1/2022.acl-long.551
10.3115/1572340.1572343
10.1038/s41597-023-02068-4
10.18653/v1/2022.bionlp-1.37
10.1093/nar/gkac1085
10.1038/s41597-019-0342-9
10.1093/database/baw032
10.48550/arXiv.2303.08774
10.1093/nar/gkt531
10.1093/nar/gkac833
10.1093/toxsci/kfad069
10.1093/bioinformatics/btz682
10.1016/j.ins.2018.12.041
ContentType Journal Article
Copyright The Author(s) 2024
2024. The Author(s).
COPYRIGHT 2024 BioMed Central Ltd.
2024. This work is licensed under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
The Author(s) 2024 2024
Copyright_xml – notice: The Author(s) 2024
– notice: 2024. The Author(s).
– notice: COPYRIGHT 2024 BioMed Central Ltd.
– notice: 2024. This work is licensed under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
– notice: The Author(s) 2024 2024
DBID C6C
AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
ISR
3V.
7QO
7SC
7X7
7XB
88E
8AL
8AO
8FD
8FE
8FG
8FH
8FI
8FJ
8FK
ABUWG
AEUYN
AFKRA
ARAPS
AZQEC
BBNVY
BENPR
BGLVJ
BHPHI
CCPQU
DWQXO
FR3
FYUFA
GHDGH
GNUQQ
HCIFZ
JQ2
K7-
K9.
L7M
LK8
L~C
L~D
M0N
M0S
M1P
M7P
P5Z
P62
P64
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQQKQ
PQUKI
Q9U
7X8
5PM
DOA
DOI 10.1186/s12859-024-05903-6
DatabaseName Springer Nature OA Free Journals
CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
Gale In Context: Science
ProQuest Central (Corporate)
Biotechnology Research Abstracts
Computer and Information Systems Abstracts
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Medical Database (Alumni Edition)
Computing Database (Alumni Edition)
ProQuest Pharma Collection
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Natural Science Collection
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest One Sustainability
ProQuest Central UK/Ireland
Advanced Technologies & Aerospace Collection
ProQuest Central Essentials - QC
Biological Science Collection
ProQuest Central
Technology Collection
Natural Science Collection
ProQuest One Community College
ProQuest Central Korea
Engineering Research Database
Proquest Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Central Student
SciTech Premium Collection
ProQuest Computer Science Collection
Computer Science Database
ProQuest Health & Medical Complete (Alumni)
Advanced Technologies Database with Aerospace
Biological Sciences
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Computing Database
ProQuest Health & Medical Collection
Medical Database
Biological Science Database
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
Biotechnology and BioEngineering Abstracts
ProQuest Central Premium
ProQuest One Academic (New)
Publicly Available Content Database
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central Basic
MEDLINE - Academic
PubMed Central (Full Participant titles)
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Publicly Available Content Database
Computer Science Database
ProQuest Central Student
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
SciTech Premium Collection
ProQuest One Applied & Life Sciences
ProQuest One Sustainability
Health Research Premium Collection
Natural Science Collection
Health & Medical Research Collection
Biological Science Collection
ProQuest Central (New)
ProQuest Medical Library (Alumni)
Advanced Technologies & Aerospace Collection
ProQuest Biological Science Collection
ProQuest One Academic Eastern Edition
ProQuest Hospital Collection
ProQuest Technology Collection
Health Research Premium Collection (Alumni)
Biological Science Database
ProQuest Hospital Collection (Alumni)
Biotechnology and BioEngineering Abstracts
ProQuest Health & Medical Complete
ProQuest One Academic UKI Edition
Engineering Research Database
ProQuest One Academic
ProQuest One Academic (New)
Technology Collection
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest One Academic Middle East (New)
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Natural Science Collection
ProQuest Pharma Collection
ProQuest Central
ProQuest Health & Medical Research Collection
Biotechnology Research Abstracts
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
Advanced Technologies Database with Aerospace
ProQuest Computing
ProQuest Central Basic
ProQuest Computing (Alumni Edition)
ProQuest SciTech Collection
Computer and Information Systems Abstracts Professional
Advanced Technologies & Aerospace Database
ProQuest Medical Library
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList


MEDLINE - Academic
Publicly Available Content Database

MEDLINE

Database_xml – sequence: 1
  dbid: C6C
  name: Springer Nature OA Free Journals
  url: http://www.springeropen.com/
  sourceTypes: Publisher
– sequence: 2
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 3
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 4
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 5
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1471-2105
EndPage 25
ExternalDocumentID oai_doaj_org_article_2c57a78489004d63ade2870e6a3d0a05
PMC11340140
A808340957
39169321
10_1186_s12859_024_05903_6
Genre Journal Article
GeographicLocations South Korea
GeographicLocations_xml – name: South Korea
GrantInformation_xml – fundername: National Research Foundation of Korea (NRF)
  grantid: NRF-2020R1I1A1A01073125
GroupedDBID ---
0R~
23N
2WC
53G
5VS
6J9
7X7
88E
8AO
8FE
8FG
8FH
8FI
8FJ
AAFWJ
AAJSJ
AAKPC
AASML
ABDBF
ABUWG
ACGFO
ACGFS
ACIHN
ACIWK
ACPRK
ACUHS
ADBBV
ADMLS
ADUKV
AEAQA
AENEX
AEUYN
AFKRA
AFPKN
AFRAH
AHBYD
AHMBA
AHYZX
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMTXH
AOIJS
ARAPS
AZQEC
BAPOH
BAWUL
BBNVY
BCNDV
BENPR
BFQNJ
BGLVJ
BHPHI
BMC
BPHCQ
BVXVI
C6C
CCPQU
CS3
DIK
DU5
DWQXO
E3Z
EAD
EAP
EAS
EBD
EBLON
EBS
EMB
EMK
EMOBN
ESX
F5P
FYUFA
GNUQQ
GROUPED_DOAJ
GX1
HCIFZ
HMCUK
HYE
IAO
ICD
IHR
INH
INR
ISR
ITC
K6V
K7-
KQ8
LK8
M1P
M48
M7P
MK~
ML0
M~E
O5R
O5S
OK1
OVT
P2P
P62
PGMZT
PHGZM
PHGZT
PIMPY
PJZUB
PPXIY
PQGLB
PQQKQ
PROAC
PSQYO
PUEGO
RBZ
RNS
ROL
RPM
RSV
SBL
SOJ
SV3
TR2
TUS
UKHRP
W2D
WOQ
WOW
XH6
XSB
AAYXX
ALIPV
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
PMFND
3V.
7QO
7SC
7XB
8AL
8FD
8FK
FR3
JQ2
K9.
L7M
L~C
L~D
M0N
P64
PKEHL
PQEST
PQUKI
Q9U
7X8
5PM
ID FETCH-LOGICAL-c593t-2949c30248b888976ccad21fdec7b47dcaadc6dd295a8e663dd74b0eba0bb5a93
IEDL.DBID M48
ISSN 1471-2105
IngestDate Wed Aug 27 01:30:29 EDT 2025
Thu Aug 21 18:34:35 EDT 2025
Thu Sep 04 19:18:02 EDT 2025
Fri Jul 25 19:11:15 EDT 2025
Tue Jun 17 22:04:17 EDT 2025
Tue Jun 10 20:58:05 EDT 2025
Fri Jun 27 05:25:09 EDT 2025
Mon Jul 21 05:55:54 EDT 2025
Tue Jul 01 03:38:41 EDT 2025
Sat Sep 06 07:27:28 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Text mining
T5
Neural search
Transformer
Biomedical interaction extraction
RAG
Embedding
Natural language processing
LLM
Language English
License 2024. The Author(s).
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c593t-2949c30248b888976ccad21fdec7b47dcaadc6dd295a8e663dd74b0eba0bb5a93
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
OpenAccessLink http://journals.scholarsportal.info/openUrl.xqy?doi=10.1186/s12859-024-05903-6
PMID 39169321
PQID 3102466163
PQPubID 44065
PageCount 25
ParticipantIDs doaj_primary_oai_doaj_org_article_2c57a78489004d63ade2870e6a3d0a05
pubmedcentral_primary_oai_pubmedcentral_nih_gov_11340140
proquest_miscellaneous_3095674255
proquest_journals_3102466163
gale_infotracmisc_A808340957
gale_infotracacademiconefile_A808340957
gale_incontextgauss_ISR_A808340957
pubmed_primary_39169321
crossref_primary_10_1186_s12859_024_05903_6
springer_journals_10_1186_s12859_024_05903_6
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2024-08-21
PublicationDateYYYYMMDD 2024-08-21
PublicationDate_xml – month: 08
  year: 2024
  text: 2024-08-21
  day: 21
PublicationDecade 2020
PublicationPlace London
PublicationPlace_xml – name: London
– name: England
PublicationTitle BMC bioinformatics
PublicationTitleAbbrev BMC Bioinformatics
PublicationTitleAlternate BMC Bioinformatics
PublicationYear 2024
Publisher BioMed Central
BioMed Central Ltd
BMC
Publisher_xml – name: BioMed Central
– name: BioMed Central Ltd
– name: BMC
References Q Chen (5903_CR13) 2023; 39
C Raffel (5903_CR4) 2020; 21
E Coudert (5903_CR23) 2023; 39
5903_CR3
J Legrand (5903_CR21) 2020
5903_CR1
AP Davis (5903_CR18) 2023; 51
5903_CR2
5903_CR25
5903_CR26
5903_CR24
5903_CR7
5903_CR29
5903_CR8
5903_CR28
J Kim (5903_CR31) 2013; 41
5903_CR20
TA Nakamura (5903_CR15) 2018; 480
Y Zhou (5903_CR19) 2024; 52
DS Wishart (5903_CR22) 2018; D1
J Lee (5903_CR5) 2020; 36
CH Wei (5903_CR27) 2016; 2016
A Nentidis (5903_CR34) 2023; 10
5903_CR12
SH Kim (5903_CR9) 2023; 10
AP Davis (5903_CR17) 2023; 195
Y Gu (5903_CR6) 2021; 3
5903_CR16
S Avram (5903_CR30) 2023; 51
5903_CR10
5903_CR32
5903_CR11
5903_CR33
P Liu (5903_CR14) 2023; 55
References_xml – volume: 10
  start-page: 586
  issue: 5
  year: 2023
  ident: 5903_CR9
  publication-title: Bioengineering (Basel)
  doi: 10.3390/bioengineering10050586
– ident: 5903_CR11
  doi: 10.18653/v1/2021.emnlp-main.98
– volume: 52
  start-page: D1465
  issue: D1
  year: 2024
  ident: 5903_CR19
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkad751
– ident: 5903_CR32
– volume: 39
  start-page: btad557
  issue: 9
  year: 2023
  ident: 5903_CR13
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btad557
– volume: D1
  start-page: D1074
  issue: 46
  year: 2018
  ident: 5903_CR22
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkx1037
– volume: 39
  start-page: btac793
  issue: 1
  year: 2023
  ident: 5903_CR23
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btac793
– ident: 5903_CR7
  doi: 10.18653/v1/2022.acl-long.551
– ident: 5903_CR33
  doi: 10.3115/1572340.1572343
– ident: 5903_CR25
– volume: 10
  start-page: 170
  year: 2023
  ident: 5903_CR34
  publication-title: Sci Data.
  doi: 10.1038/s41597-023-02068-4
– ident: 5903_CR10
  doi: 10.18653/v1/2022.bionlp-1.37
– volume: 51
  start-page: D1276
  issue: D1
  year: 2023
  ident: 5903_CR30
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkac1085
– ident: 5903_CR29
– volume: 3
  start-page: 1
  issue: 1–2
  year: 2021
  ident: 5903_CR6
  publication-title: ACM Trans Comput Healthcare
– ident: 5903_CR16
– year: 2020
  ident: 5903_CR21
  publication-title: Sci Data
  doi: 10.1038/s41597-019-0342-9
– volume: 2016
  start-page: baw032
  year: 2016
  ident: 5903_CR27
  publication-title: Database
  doi: 10.1093/database/baw032
– ident: 5903_CR20
– ident: 5903_CR2
  doi: 10.48550/arXiv.2303.08774
– ident: 5903_CR12
– volume: 41
  start-page: 510
  year: 2013
  ident: 5903_CR31
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkt531
– volume: 51
  start-page: D1257
  issue: D1
  year: 2023
  ident: 5903_CR18
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkac833
– ident: 5903_CR8
– volume: 21
  start-page: 1
  issue: 140
  year: 2020
  ident: 5903_CR4
  publication-title: JMLR
– ident: 5903_CR1
– ident: 5903_CR3
– volume: 55
  start-page: 1
  year: 2023
  ident: 5903_CR14
  publication-title: ACM Comput Surv
– ident: 5903_CR28
– volume: 195
  start-page: 155
  issue: 2
  year: 2023
  ident: 5903_CR17
  publication-title: Toxicol Sci
  doi: 10.1093/toxsci/kfad069
– ident: 5903_CR24
– volume: 36
  start-page: 1234
  issue: 4
  year: 2020
  ident: 5903_CR5
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btz682
– ident: 5903_CR26
– volume: 480
  start-page: 339
  year: 2018
  ident: 5903_CR15
  publication-title: J Inf Sci
  doi: 10.1016/j.ins.2018.12.041
SSID ssj0017805
Score 2.475747
Snippet Background There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. Main body We...
There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. We propose a novel...
There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. As a result, our system...
Background There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. Main body We...
BackgroundThere has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery.Main bodyWe...
There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery.BACKGROUNDThere has been a...
Abstract Background There has been a considerable advancement in AI technologies like LLM and machine learning to support biomedical knowledge discovery. Main...
SourceID doaj
pubmedcentral
proquest
gale
pubmed
crossref
springer
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Publisher
StartPage 273
SubjectTerms Algorithms
Attention task
Big data management in biological domains
Bioinformatics
Biomedical and Life Sciences
Biomedical interaction extraction
Chatbots
Chemical compounds
Computational Biology/Bioinformatics
Computational linguistics
Computer Appl. in Life Sciences
Data mining
Data Mining - methods
Deep learning
Documents
Drug discovery
Drug interaction
Drugs
Engine blocks
Genes
Information Storage and Retrieval - methods
Keywords
Knowledge discovery
Knowledge Discovery - methods
Language
Language processing
Large language models
Life Sciences
LLM
Machine Learning
Medical research
Medical Subject Headings-MeSH
Medicine, Experimental
Methods
Microarrays
Natural language
Natural language interfaces
Natural Language Processing
Neural Networks, Computer
Proteins
PubMed
Queries
Query languages
RAG
Retrieval
Search Engine
Search engines
Semantics
Speech recognition
Technology application
Text mining
Transformer
Transformers
Trends
Unstructured data
SummonAdditionalLinks – databaseName: DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwEB6hSkhcEG8CLTIIiQNETeL4keOCqFoOHICterP8ytJLFm12D_33nXGSpSlCXLiuvdHkm7E9E898A_BWWhdjKxTGJtLmddnq3Imgci8aG2LrRYgp2-KrPF3WXy7ExY1WX5QTNtADD8AdV14oq3StG1RnkByfQHdzUVoeCjuwlxZNMQVT4_0BMfVPJTJaHvcl8bTleB7lVGzJczk7hhJb_5978o1D6XbC5K1b03QYnTyA-6MXyRaD9A_hTuwewd2hr-TVY1ieL87Ombtc51R0S0maV6wfNgVGie4rtp381bhhqRkOs11gm9ReC22P2d0qsXUGtkq81KS-J7A8-fzj02k-9k8gpPk2r5q68ZxIyxzGueh3oLZCVbYheuVqFby1wcsQqkZYHdH1CEHVrojOFs4J2_CncNCtu_gcGKcvDxictboIde0aXXLRaoVPb0IrXMjg_QSn-TXQZJgUXmhpBvANymES-EZm8JEQ388kiuv0AyrejIo3_1J8Bm9IX4ZILDrKklnZXd-bs-_fzEKjY4mBq1AZvBsntWtE1tux6ADfinivZjMPZzNxlfn58GQWZlzlvUHXuKrRwZE8g9f7YfonZa51cb3DOcT0qHBnRIGfDVa0f28qekb_ucxAz-xrBsx8pLv8mTjAyxLlwuA4gw-TKf6W6-_Iv_gfyL-Ee1VaSrjHlodwsN3s4hG6Zlv3Kq3Ca-FENAo
  priority: 102
  providerName: Directory of Open Access Journals
– databaseName: ProQuest Technology Collection
  dbid: 8FG
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1Lb9QwELagCIkL4k1oQQYhcQCrefmRE1oQS8uBA7BVb5ZfSXtJymb30H_fGSfZkiK4rr2RPZ6nPfMNIW-FsSHUXEJsIgwrs1oxy71kjlfGh9pxH2K2xXdxtCq_nfLT8cKtH9MqJ50YFbXvHN6RH4IbkpdgTETx8eI3w65R-Lo6ttC4Te5kYGmQz9Xy6-4VAfH6p0IZJQ77DNHaGHyHYcllwcTMGEXM_r818x-m6Wba5I2302iSlg_I_dGXpIvh8B-SW6F9RO4O3SUvH5PVyeL4hNrzjmHpLaZqXtJ-UA0U090bupm81rCmsSUONa2n69hkCziQmm0TMTs9bSI6NR7iE7Jafvn1-YiNXRSQ3sWG5VVZuQKhyyxEu-B9wJn5PKt9cNKW0jtjvBPe5xU3KgARvZelTYM1qbXcVMVTstd2bXhOaIH3DxCi1Sr1ZWkrlRW8VhK-XvmaW5-Q9xM59cUAlqFjkKGEHoivYR06El-LhHxCiu9mItB1_KFbN3qUG507Lo1UpapAmr0ogIHwaTYIU_jUpDwhb_C8NEJZtJgr05ht3-vjnz_0QoF7CeErlwl5N06qO6CsM2PpAewK0a9mMw9mM0HW3Hx4Ygs9ynqvrzkzIa93w_hPzF9rQ7eFOYj3KEE_woKfDVy02zeWPoMXnSVEzfhrRpj5SHt-FpHAswzWBSFyQj5MrHi9rn9T_sX_t7FP7uVRSECHZgdkb7Pehpfgem3sqyhfV-dGLCg
  priority: 102
  providerName: ProQuest
– databaseName: Springer Nature OA Free Journals
  dbid: C6C
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1Lb9QwEB5BERIXxJtAQQYhcYCIOI4fOS4VVcuBA7BVb5YdO0svWbTZPfTfM-MkS1PgwDUeR8487JnMzGeAN8r5GFupMTZRLq94a3Ivg84bWbsQ20aGmKotvqiTZfX5XJ6PMDnUC3M1f8-N-tBzQljL8STJqU1S5Oom3JJcqJSYVUf7jAFh809NMX-dNzt4Ej7_n7vwlWPoeonktTxpOn6O78Hd0W9ki0HQ9-FG7B7A7eEmycuHsDxbnJ4xf7HOqc2WyjIvWT9sA4xK21dsO3moccPS9TfMdYFt0oVaqG3M7VYJnzOwVUKiJoE9guXxp-9HJ_l4YwLxVmzzsq7qRhBMmcfIFj0NlE8oeRtio32lQ-NcaFQIZS2diehshKArX0TvCu-lq8VjOOjWXXwKTNC_BgzHWlOEqvK14UK2RuPb69BKHzJ4N7HT_hyAMWwKKIyyA_MtrsMm5luVwUfi-J6SQK3TA5S1HW3Elo3UTpvK1Gi5QQlUFkrDRuVEKFwhM3hN8rIEW9FRXczK7frenn77ahcGXUkMVaXO4O1I1K6Rs40b2wzwqwjpakZ5OKNEu2rmw5Na2NGue4vOcFmhS6NEBq_2wzSTatW6uN4hDWE7atwLccFPBi3afze1OaPHzDMwM_2aMWY-0l38SKjfnOO6MBzO4P2kir_X9W_OP_s_8udwp0xGg_snP4SD7WYXX6DbtfUvk739AuyCJSw
  priority: 102
  providerName: Springer Nature
Title VAIV bio-discovery service using transformer model and retrieval augmented generation
URI https://link.springer.com/article/10.1186/s12859-024-05903-6
https://www.ncbi.nlm.nih.gov/pubmed/39169321
https://www.proquest.com/docview/3102466163
https://www.proquest.com/docview/3095674255
https://pubmed.ncbi.nlm.nih.gov/PMC11340140
https://doaj.org/article/2c57a78489004d63ade2870e6a3d0a05
Volume 25
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Zb9NAEB71EKgviBtDiQxC4gEM8bGHHxBKo4YWiQoVUuVttetdh0rIaXNI5N8zs7ZTXMoLL5GSHcfrOXZmvLPfALzi2jhXMoG5CddRFpcyMsyKqGC5tq4smHW-2uKEH42zzxM22YK23VHDwMWNqR31kxrPf777dbn-iAb_wRu85O8XMaGwRehtIjpKmUZ8G3bRM3FKxr5kV7sKhN_fHpy58bo9uE0HUTGmiTt-ysP5_71o_-G1rldUXttW9d5qdBfuNGFmOKj14h5sueo-3KobT64fwPhscHwWmvNZRKdyqYpzHS7qVSOkSvhpuGwDWjcPfbecUFc2nPv-W6icoV5NPZynDaceuJrk-xDGo8Pvw6OoabBAokiXUZJneZESqpnBRBgDExSnTeLSukKYTNhCa1twa5OcaekwNrFWZKbvjO4bw3SePoKdala5JxCm9GoCs7dS9m2WmVzGKSulwH_PbcmMDeBNy051UeNoKJ9_SK5qOSich_JyUDyAA-L4hpIwsP0Ps_lUNSalkoIJLWQmczR0y1PULdq1dVyntq_7LICXJC9FKBcVldFM9WqxUMffTtVAYuSJmS0TAbxuiMoZcrbQzakEfCoCxupQ7nco0QyL7nCrFqrVYoWxc5JhBMTTAF5shulKKm2r3GyFNAQFKXDpxAk_rrVo89ytMgYgO_rVYUx3pDr_4UHC4xjnhdlzAG9bVbya1785__T_7_QM9hJvS7j0xvuws5yv3HOM2JamB9tiIvBTjj71YPfg8OTrKX4b8mHPvwPpeTP9DXh9QiU
linkProvider Scholars Portal
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELaqIgQXxJtAAYNAHCBqXn7kgNDyqHZp6QG61d5cO3aWXpKy2RXaP8VvZMZJtqQIbr2unY09mac98w0hL7g2zpVMQGzCdZjFpQwNsyIsWK6tKwtmnc-2OOTjafZ5xmZb5FdfC4Nplb1O9Ira1gWeke-CG5JkYEx4-u7sR4hdo_B2tW-h0bLFvlv_hJCteTv5CN_3ZZLsfTr6MA67rgL4_nQZJnmWFylCeRmI_sAawx5sEpfWFcJkwhZa24Jbm-RMSwcG2VqRmcgZHRnDNIIvgcq_kuHJOMiPmG0CvBj7A_SFOZLvNjGiw4XwqhBLPNOQD4yf7xHwtyX4wxReTNO8cFfrTeDeTXKj813pqGW2W2TLVbfJ1bab5foOmR6PJsfUnNYhlvpiauiaNq0qopheP6fL3kt2C-pb8FBdWbrwTb2A46lezT1GqKVzj4aNTHOXTC-FvvfIdlVX7gGhKZ53QEhYyshmmcllnLJSCvj33JbM2IC87smpzlpwDuWDGslVS3wF61Ce-IoH5D1SfDMTgbX9D_Virjo5VUnBhBYykzloD8tTYFi8CnZcpzbSEQvIc_xeCqEzKszNmetV06jJt69qJMGdhXCZiYC86iaVNVC20F2pA-wK0bYGM3cGM0G2i-Fwzxaq0y2NOpeEgDzbDOOTmC9XuXoFcxBfUoA-hgXfb7los28stQavPQ6IHPDXgDDDker0u0cej2NYF4TkAXnTs-L5uv5N-Yf_38ZTcm189OVAHUwO9x-R64kXGNDf8Q7ZXi5W7jG4fUvzxMsaJSeXLdy_AX3Sax8
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1Lb9QwELagCNQL4llSChiExAGi5uFXjkth1QVUIWCr3iw7dkIv2WqTPfTfM-MkS1PgwDUeR854xjOTmflMyGthrPcVlxCbCBOztFKx5U7GJS-M81XJnQ_VFifieMk-nfGzK138odp9TEn2PQ2I0tR0hxeu6lVcicM2Rdy1GOxLjM2TeSxuklsMTR-ma8XRNo-AiP1jq8xf503MUUDt__NsvmKcrhdOXsueBqM0v0fuDt4knfXbf5_c8M0Dcru_X_LyIVmezhan1J6vYmy-xWLNS9r2hwPFgveadqPf6tc0XIpDTePoOlyzBTJIzaYOqJ2O1gGfGrfxEVnOP_44Oo6HexSQ43kXZwUryhzByyzEu-B_wK65LK2cL6Vl0pXGuFI4lxXcKA8uiHOS2cRbk1jLTZE_JjvNqvFPCM3xDwQEaZVKHGO2UGnOKyXh7YWruHUReTuyU1_0cBk6hBlK6J75GtahA_O1iMh75PiWEqGuw4PVutaD5uis5NJIxVQB-uxEDiKEyVkvTO4Sk_CIvML90ghm0WC1TG02basX37_pmQIHE8SCy4i8GYiqFXC2NEPzAXwV4l9NKA8mlKBt5XR4FAs9aHurwUXOGDg6Io_Iy-0wzsQKtsavNkCDiI8STkhY8F4vRdvvxuZn8KPTiKiJfE0YMx1pzn8GLPA0hXVBkByRd6Mo_l7Xvzm__3_kL8idrx_m-svi5PNTspsF_YEDNj0gO91645-BX9bZ50H1fgEpXzBg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=VAIV+bio-discovery+service+using+transformer+model+and+retrieval+augmented+generation&rft.jtitle=BMC+bioinformatics&rft.au=Kim%2C+Seonho&rft.au=Yoon%2C+Juntae&rft.date=2024-08-21&rft.pub=BioMed+Central&rft.eissn=1471-2105&rft.volume=25&rft_id=info:doi/10.1186%2Fs12859-024-05903-6&rft_id=info%3Apmid%2F39169321&rft.externalDocID=PMC11340140
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2105&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2105&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2105&client=summon