AVN: A Deep Learning Approach for the Analysis of Birdsong
Deep learning tools for behavior analysis have enabled important new insights and discoveries in neuroscience. Yet, they often compromise interpretability and generalizability for performance, making it difficult to quantitively compare phenotypes across datasets and research groups. We developed a...
Saved in:
Published in | bioRxiv |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
United States
Cold Spring Harbor Laboratory
24.08.2024
|
Online Access | Get full text |
Cover
Loading…
Abstract | Deep learning tools for behavior analysis have enabled important new insights and discoveries in neuroscience. Yet, they often compromise interpretability and generalizability for performance, making it difficult to quantitively compare phenotypes across datasets and research groups. We developed a novel deep learning-based behavior analysis pipeline,
(AVN), for the learned vocalizations of the most extensively studied vocal learning model species - the zebra finch. AVN annotates songs with high accuracy across multiple animal colonies without the need for any additional training data and generates a comprehensive set of interpretable features to describe the syntax, timing, and acoustic properties of song. We use this feature set to compare song phenotypes across multiple research groups and experiments, and to predict a bird's stage in song development. Additionally, we have developed a novel method to measure song imitation that requires no additional training data for new comparisons or recording environments, and outperforms existing similarity scoring methods in its sensitivity and agreement with expert human judgements of song similarity. These tools are available through the open-source AVN python package and graphical application, which makes them accessible to researchers without any prior coding experience. Altogether, this behavior analysis toolkit stands to facilitate and accelerate the study of vocal behavior by enabling a standardized mapping of phenotypes and learning outcomes, thus helping scientists better link behavior to the underlying neural processes. |
---|---|
AbstractList | Deep learning tools for behavior analysis have enabled important new insights and discoveries in neuroscience. Yet, they often compromise interpretability and generalizability for performance, making it difficult to quantitively compare phenotypes across datasets and research groups. We developed a novel deep learning-based behavior analysis pipeline,
Avian Vocalization Network
(AVN), for the learned vocalizations of the most extensively studied vocal learning model species – the zebra finch. AVN annotates songs with high accuracy across multiple animal colonies without the need for any additional training data and generates a comprehensive set of interpretable features to describe the syntax, timing, and acoustic properties of song. We use this feature set to compare song phenotypes across multiple research groups and experiments, and to predict a bird’s stage in song development. Additionally, we have developed a novel method to measure song imitation that requires no additional training data for new comparisons or recording environments, and outperforms existing similarity scoring methods in its sensitivity and agreement with expert human judgements of song similarity. These tools are available through the open-source AVN python package and graphical application, which makes them accessible to researchers without any prior coding experience. Altogether, this behavior analysis toolkit stands to facilitate and accelerate the study of vocal behavior by enabling a standardized mapping of phenotypes and learning outcomes, thus helping scientists better link behavior to the underlying neural processes. Deep learning tools for behavior analysis have enabled important new insights and discoveries in neuroscience. Yet, they often compromise interpretability and generalizability for performance, making it difficult to quantitively compare phenotypes across datasets and research groups. We developed a novel deep learning-based behavior analysis pipeline, Avian Vocalization Network (AVN), for the learned vocalizations of the most extensively studied vocal learning model species - the zebra finch. AVN annotates songs with high accuracy across multiple animal colonies without the need for any additional training data and generates a comprehensive set of interpretable features to describe the syntax, timing, and acoustic properties of song. We use this feature set to compare song phenotypes across multiple research groups and experiments, and to predict a bird's stage in song development. Additionally, we have developed a novel method to measure song imitation that requires no additional training data for new comparisons or recording environments, and outperforms existing similarity scoring methods in its sensitivity and agreement with expert human judgements of song similarity. These tools are available through the open-source AVN python package and graphical application, which makes them accessible to researchers without any prior coding experience. Altogether, this behavior analysis toolkit stands to facilitate and accelerate the study of vocal behavior by enabling a standardized mapping of phenotypes and learning outcomes, thus helping scientists better link behavior to the underlying neural processes.Deep learning tools for behavior analysis have enabled important new insights and discoveries in neuroscience. Yet, they often compromise interpretability and generalizability for performance, making it difficult to quantitively compare phenotypes across datasets and research groups. We developed a novel deep learning-based behavior analysis pipeline, Avian Vocalization Network (AVN), for the learned vocalizations of the most extensively studied vocal learning model species - the zebra finch. AVN annotates songs with high accuracy across multiple animal colonies without the need for any additional training data and generates a comprehensive set of interpretable features to describe the syntax, timing, and acoustic properties of song. We use this feature set to compare song phenotypes across multiple research groups and experiments, and to predict a bird's stage in song development. Additionally, we have developed a novel method to measure song imitation that requires no additional training data for new comparisons or recording environments, and outperforms existing similarity scoring methods in its sensitivity and agreement with expert human judgements of song similarity. These tools are available through the open-source AVN python package and graphical application, which makes them accessible to researchers without any prior coding experience. Altogether, this behavior analysis toolkit stands to facilitate and accelerate the study of vocal behavior by enabling a standardized mapping of phenotypes and learning outcomes, thus helping scientists better link behavior to the underlying neural processes. Deep learning tools for behavior analysis have enabled important new insights and discoveries in neuroscience. Yet, they often compromise interpretability and generalizability for performance, making it difficult to quantitively compare phenotypes across datasets and research groups. We developed a novel deep learning-based behavior analysis pipeline, (AVN), for the learned vocalizations of the most extensively studied vocal learning model species - the zebra finch. AVN annotates songs with high accuracy across multiple animal colonies without the need for any additional training data and generates a comprehensive set of interpretable features to describe the syntax, timing, and acoustic properties of song. We use this feature set to compare song phenotypes across multiple research groups and experiments, and to predict a bird's stage in song development. Additionally, we have developed a novel method to measure song imitation that requires no additional training data for new comparisons or recording environments, and outperforms existing similarity scoring methods in its sensitivity and agreement with expert human judgements of song similarity. These tools are available through the open-source AVN python package and graphical application, which makes them accessible to researchers without any prior coding experience. Altogether, this behavior analysis toolkit stands to facilitate and accelerate the study of vocal behavior by enabling a standardized mapping of phenotypes and learning outcomes, thus helping scientists better link behavior to the underlying neural processes. |
Author | Marks, Ethan S Roberts, Todd F Koch, Therese M I |
Author_xml | – sequence: 1 givenname: Therese M I orcidid: 0000-0002-5327-3219 surname: Koch fullname: Koch, Therese M I – sequence: 2 givenname: Ethan S surname: Marks fullname: Marks, Ethan S – sequence: 3 givenname: Todd F orcidid: 0000-0002-0967-6598 surname: Roberts fullname: Roberts, Todd F |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/39229184$$D View this record in MEDLINE/PubMed |
BookMark | eNpVkEtPwzAQhC1URKH0B3BBPnJJWT9j94JCeUoVXICr5aROG5TawW6R-u-poKByml3N6hvtnKCeD94hdEZgRAiQSwqUj0CMtrvQTEhygI6p1DRTFERvb-6jYUrvAEC1JCznR6jPNKWaKH6MxsXb0xgX-Ma5Dk-djb7xc1x0XQy2WuA6RLxaOFx4225Sk3Co8XUTZyn4-Sk6rG2b3HCnA_R6d_syecimz_ePk2KadUQIkhE2s0IKJTmDWrO8klwTKWkOVNjacsEtK0EpoqUrqzJnVlakUpwxELlQwAbo6ofbrculm1XOr6JtTRebpY0bE2xj_ju-WZh5-DRk-y3wb8LFjhDDx9qllVk2qXJta70L62QYARCSaq62p-f7YX8pv42xL4tkbPg |
ContentType | Journal Article |
DBID | NPM 7X8 5PM |
DOI | 10.1101/2024.05.10.593561 |
DatabaseName | PubMed MEDLINE - Academic PubMed Central (Full Participant titles) |
DatabaseTitle | PubMed MEDLINE - Academic |
DatabaseTitleList | MEDLINE - Academic PubMed |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Biology |
EISSN | 2692-8205 |
ExternalDocumentID | 39229184 |
Genre | Journal Article Preprint |
GroupedDBID | 8FE 8FH AFKRA ALMA_UNASSIGNED_HOLDINGS BBNVY BENPR BHPHI HCIFZ LK8 M7P NPM NQS PIMPY PROAC RHI 7X8 5PM |
ID | FETCH-LOGICAL-p1551-13da56586430f937c64916627025afa454a3b088196ebcb73a6c1c84330575803 |
ISSN | 2692-8205 |
IngestDate | Thu Sep 05 06:27:17 EDT 2024 Thu Oct 24 02:23:18 EDT 2024 Sat Nov 02 12:31:02 EDT 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
License | This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License, which allows reusers to copy and distribute the material in any medium or format in unadapted form only, and only so long as attribution is given to the creator. The license allows for commercial use. |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-p1551-13da56586430f937c64916627025afa454a3b088196ebcb73a6c1c84330575803 |
Notes | ObjectType-Working Paper/Pre-Print-3 ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ORCID | 0000-0002-0967-6598 0000-0002-5327-3219 |
OpenAccessLink | https://pubmed.ncbi.nlm.nih.gov/PMC11370480 |
PMID | 39229184 |
PQID | 3100562948 |
PQPubID | 23479 |
ParticipantIDs | pubmedcentral_primary_oai_pubmedcentral_nih_gov_11370480 proquest_miscellaneous_3100562948 pubmed_primary_39229184 |
PublicationCentury | 2000 |
PublicationDate | 2024-Aug-24 20240824 |
PublicationDateYYYYMMDD | 2024-08-24 |
PublicationDate_xml | – month: 08 year: 2024 text: 2024-Aug-24 day: 24 |
PublicationDecade | 2020 |
PublicationPlace | United States |
PublicationPlace_xml | – name: United States |
PublicationTitle | bioRxiv |
PublicationTitleAlternate | bioRxiv |
PublicationYear | 2024 |
Publisher | Cold Spring Harbor Laboratory |
Publisher_xml | – name: Cold Spring Harbor Laboratory |
SSID | ssj0002961374 |
Score | 1.9309995 |
Snippet | Deep learning tools for behavior analysis have enabled important new insights and discoveries in neuroscience. Yet, they often compromise interpretability and... |
SourceID | pubmedcentral proquest pubmed |
SourceType | Open Access Repository Aggregation Database Index Database |
Title | AVN: A Deep Learning Approach for the Analysis of Birdsong |
URI | https://www.ncbi.nlm.nih.gov/pubmed/39229184 https://www.proquest.com/docview/3100562948 https://pubmed.ncbi.nlm.nih.gov/PMC11370480 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1dT9swFLU20CReEPuC7gN50t5QWBrbIeGtjCI0sQ6hFvUtshMH-pJUUKZtv55z4yRtKQ-wlyhyqljycW-Or-85ZuxrbKxVkaEqcYkFSp4ZT-PD7GVCZdqo0KSmqrYYhKcj-WOsxnMz_0pdMjP76b9HdSX_gyragCupZJ-BbPtSNOAe-OIKhHF9Esa9y4FTlh9bO22sUq-IWTqdVFNCuOg8cjSpVFZXi6zUTMqLP5PfbfAt3flQmEIkTtr7OU-tkrTHCRgo5T7Pm7oCbVdyVGZZXS9cZxMCSelRJ2J2QScIY0TIwHc7zXa1bTXkVlb_9C6yQEWLioVyBuvL9taDX8nJ6OwsGfbHw5dsPUBkQEhaP-oPzi_atFgQg19U3tltr_VeNPr5ttLLY-uCh-WtC3xhuMU2a6LPew611-yFLd6wV-7oz79v2SGwO-Q9TsjxBjneIMeBHAdyvEGOlzlvkHvHRif94fdTrz7HwpsSIfW6ItPgzRHIn5-DDqahBCkPSQmodK6lkloYRHsEQ4v_xoHQYdpNIykEkenIF-_ZWlEWdodxcrAMUiX8zORSI9r6JpS50LmJ89C3usO-NOORIE7Q5o8ubHl3m9BGDrhuLKMO23bjk0ydoUkCjhzEWOp3WLQ0cu0PyIN8-Ukxua68yLtAi2wJPjyh449sYz7lPrG12c2d_QxKNzO79TS4BwoARv4 |
link.rule.ids | 230,315,783,787,888,27936,27937,33757 |
linkProvider | ProQuest |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=AVN%3A+A+Deep+Learning+Approach+for+the+Analysis+of+Birdsong&rft.jtitle=bioRxiv&rft.au=Koch%2C+Therese+M+I&rft.au=Marks%2C+Ethan+S&rft.au=Roberts%2C+Todd+F&rft.date=2024-08-24&rft.issn=2692-8205&rft.eissn=2692-8205&rft_id=info:doi/10.1101%2F2024.05.10.593561&rft.externalDBID=NO_FULL_TEXT |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2692-8205&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2692-8205&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2692-8205&client=summon |