AVN: A Deep Learning Approach for the Analysis of Birdsong

Deep learning tools for behavior analysis have enabled important new insights and discoveries in neuroscience. Yet, they often compromise interpretability and generalizability for performance, making it difficult to quantitively compare phenotypes across datasets and research groups. We developed a...

Full description

Saved in:
Bibliographic Details
Published inbioRxiv
Main Authors Koch, Therese M I, Marks, Ethan S, Roberts, Todd F
Format Journal Article
LanguageEnglish
Published United States Cold Spring Harbor Laboratory 24.08.2024
Online AccessGet full text

Cover

Loading…
Abstract Deep learning tools for behavior analysis have enabled important new insights and discoveries in neuroscience. Yet, they often compromise interpretability and generalizability for performance, making it difficult to quantitively compare phenotypes across datasets and research groups. We developed a novel deep learning-based behavior analysis pipeline, (AVN), for the learned vocalizations of the most extensively studied vocal learning model species - the zebra finch. AVN annotates songs with high accuracy across multiple animal colonies without the need for any additional training data and generates a comprehensive set of interpretable features to describe the syntax, timing, and acoustic properties of song. We use this feature set to compare song phenotypes across multiple research groups and experiments, and to predict a bird's stage in song development. Additionally, we have developed a novel method to measure song imitation that requires no additional training data for new comparisons or recording environments, and outperforms existing similarity scoring methods in its sensitivity and agreement with expert human judgements of song similarity. These tools are available through the open-source AVN python package and graphical application, which makes them accessible to researchers without any prior coding experience. Altogether, this behavior analysis toolkit stands to facilitate and accelerate the study of vocal behavior by enabling a standardized mapping of phenotypes and learning outcomes, thus helping scientists better link behavior to the underlying neural processes.
AbstractList Deep learning tools for behavior analysis have enabled important new insights and discoveries in neuroscience. Yet, they often compromise interpretability and generalizability for performance, making it difficult to quantitively compare phenotypes across datasets and research groups. We developed a novel deep learning-based behavior analysis pipeline, Avian Vocalization Network (AVN), for the learned vocalizations of the most extensively studied vocal learning model species – the zebra finch. AVN annotates songs with high accuracy across multiple animal colonies without the need for any additional training data and generates a comprehensive set of interpretable features to describe the syntax, timing, and acoustic properties of song. We use this feature set to compare song phenotypes across multiple research groups and experiments, and to predict a bird’s stage in song development. Additionally, we have developed a novel method to measure song imitation that requires no additional training data for new comparisons or recording environments, and outperforms existing similarity scoring methods in its sensitivity and agreement with expert human judgements of song similarity. These tools are available through the open-source AVN python package and graphical application, which makes them accessible to researchers without any prior coding experience. Altogether, this behavior analysis toolkit stands to facilitate and accelerate the study of vocal behavior by enabling a standardized mapping of phenotypes and learning outcomes, thus helping scientists better link behavior to the underlying neural processes.
Deep learning tools for behavior analysis have enabled important new insights and discoveries in neuroscience. Yet, they often compromise interpretability and generalizability for performance, making it difficult to quantitively compare phenotypes across datasets and research groups. We developed a novel deep learning-based behavior analysis pipeline, Avian Vocalization Network (AVN), for the learned vocalizations of the most extensively studied vocal learning model species - the zebra finch. AVN annotates songs with high accuracy across multiple animal colonies without the need for any additional training data and generates a comprehensive set of interpretable features to describe the syntax, timing, and acoustic properties of song. We use this feature set to compare song phenotypes across multiple research groups and experiments, and to predict a bird's stage in song development. Additionally, we have developed a novel method to measure song imitation that requires no additional training data for new comparisons or recording environments, and outperforms existing similarity scoring methods in its sensitivity and agreement with expert human judgements of song similarity. These tools are available through the open-source AVN python package and graphical application, which makes them accessible to researchers without any prior coding experience. Altogether, this behavior analysis toolkit stands to facilitate and accelerate the study of vocal behavior by enabling a standardized mapping of phenotypes and learning outcomes, thus helping scientists better link behavior to the underlying neural processes.Deep learning tools for behavior analysis have enabled important new insights and discoveries in neuroscience. Yet, they often compromise interpretability and generalizability for performance, making it difficult to quantitively compare phenotypes across datasets and research groups. We developed a novel deep learning-based behavior analysis pipeline, Avian Vocalization Network (AVN), for the learned vocalizations of the most extensively studied vocal learning model species - the zebra finch. AVN annotates songs with high accuracy across multiple animal colonies without the need for any additional training data and generates a comprehensive set of interpretable features to describe the syntax, timing, and acoustic properties of song. We use this feature set to compare song phenotypes across multiple research groups and experiments, and to predict a bird's stage in song development. Additionally, we have developed a novel method to measure song imitation that requires no additional training data for new comparisons or recording environments, and outperforms existing similarity scoring methods in its sensitivity and agreement with expert human judgements of song similarity. These tools are available through the open-source AVN python package and graphical application, which makes them accessible to researchers without any prior coding experience. Altogether, this behavior analysis toolkit stands to facilitate and accelerate the study of vocal behavior by enabling a standardized mapping of phenotypes and learning outcomes, thus helping scientists better link behavior to the underlying neural processes.
Deep learning tools for behavior analysis have enabled important new insights and discoveries in neuroscience. Yet, they often compromise interpretability and generalizability for performance, making it difficult to quantitively compare phenotypes across datasets and research groups. We developed a novel deep learning-based behavior analysis pipeline, (AVN), for the learned vocalizations of the most extensively studied vocal learning model species - the zebra finch. AVN annotates songs with high accuracy across multiple animal colonies without the need for any additional training data and generates a comprehensive set of interpretable features to describe the syntax, timing, and acoustic properties of song. We use this feature set to compare song phenotypes across multiple research groups and experiments, and to predict a bird's stage in song development. Additionally, we have developed a novel method to measure song imitation that requires no additional training data for new comparisons or recording environments, and outperforms existing similarity scoring methods in its sensitivity and agreement with expert human judgements of song similarity. These tools are available through the open-source AVN python package and graphical application, which makes them accessible to researchers without any prior coding experience. Altogether, this behavior analysis toolkit stands to facilitate and accelerate the study of vocal behavior by enabling a standardized mapping of phenotypes and learning outcomes, thus helping scientists better link behavior to the underlying neural processes.
Author Marks, Ethan S
Roberts, Todd F
Koch, Therese M I
Author_xml – sequence: 1
  givenname: Therese M I
  orcidid: 0000-0002-5327-3219
  surname: Koch
  fullname: Koch, Therese M I
– sequence: 2
  givenname: Ethan S
  surname: Marks
  fullname: Marks, Ethan S
– sequence: 3
  givenname: Todd F
  orcidid: 0000-0002-0967-6598
  surname: Roberts
  fullname: Roberts, Todd F
BackLink https://www.ncbi.nlm.nih.gov/pubmed/39229184$$D View this record in MEDLINE/PubMed
BookMark eNpVkEtPwzAQhC1URKH0B3BBPnJJWT9j94JCeUoVXICr5aROG5TawW6R-u-poKByml3N6hvtnKCeD94hdEZgRAiQSwqUj0CMtrvQTEhygI6p1DRTFERvb-6jYUrvAEC1JCznR6jPNKWaKH6MxsXb0xgX-Ma5Dk-djb7xc1x0XQy2WuA6RLxaOFx4225Sk3Co8XUTZyn4-Sk6rG2b3HCnA_R6d_syecimz_ePk2KadUQIkhE2s0IKJTmDWrO8klwTKWkOVNjacsEtK0EpoqUrqzJnVlakUpwxELlQwAbo6ofbrculm1XOr6JtTRebpY0bE2xj_ju-WZh5-DRk-y3wb8LFjhDDx9qllVk2qXJta70L62QYARCSaq62p-f7YX8pv42xL4tkbPg
ContentType Journal Article
DBID NPM
7X8
5PM
DOI 10.1101/2024.05.10.593561
DatabaseName PubMed
MEDLINE - Academic
PubMed Central (Full Participant titles)
DatabaseTitle PubMed
MEDLINE - Academic
DatabaseTitleList
MEDLINE - Academic
PubMed
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 2692-8205
ExternalDocumentID 39229184
Genre Journal Article
Preprint
GroupedDBID 8FE
8FH
AFKRA
ALMA_UNASSIGNED_HOLDINGS
BBNVY
BENPR
BHPHI
HCIFZ
LK8
M7P
NPM
NQS
PIMPY
PROAC
RHI
7X8
5PM
ID FETCH-LOGICAL-p1551-13da56586430f937c64916627025afa454a3b088196ebcb73a6c1c84330575803
ISSN 2692-8205
IngestDate Thu Sep 05 06:27:17 EDT 2024
Thu Oct 24 02:23:18 EDT 2024
Sat Nov 02 12:31:02 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
License This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License, which allows reusers to copy and distribute the material in any medium or format in unadapted form only, and only so long as attribution is given to the creator. The license allows for commercial use.
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-p1551-13da56586430f937c64916627025afa454a3b088196ebcb73a6c1c84330575803
Notes ObjectType-Working Paper/Pre-Print-3
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ORCID 0000-0002-0967-6598
0000-0002-5327-3219
OpenAccessLink https://pubmed.ncbi.nlm.nih.gov/PMC11370480
PMID 39229184
PQID 3100562948
PQPubID 23479
ParticipantIDs pubmedcentral_primary_oai_pubmedcentral_nih_gov_11370480
proquest_miscellaneous_3100562948
pubmed_primary_39229184
PublicationCentury 2000
PublicationDate 2024-Aug-24
20240824
PublicationDateYYYYMMDD 2024-08-24
PublicationDate_xml – month: 08
  year: 2024
  text: 2024-Aug-24
  day: 24
PublicationDecade 2020
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle bioRxiv
PublicationTitleAlternate bioRxiv
PublicationYear 2024
Publisher Cold Spring Harbor Laboratory
Publisher_xml – name: Cold Spring Harbor Laboratory
SSID ssj0002961374
Score 1.9309995
Snippet Deep learning tools for behavior analysis have enabled important new insights and discoveries in neuroscience. Yet, they often compromise interpretability and...
SourceID pubmedcentral
proquest
pubmed
SourceType Open Access Repository
Aggregation Database
Index Database
Title AVN: A Deep Learning Approach for the Analysis of Birdsong
URI https://www.ncbi.nlm.nih.gov/pubmed/39229184
https://www.proquest.com/docview/3100562948
https://pubmed.ncbi.nlm.nih.gov/PMC11370480
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1dT9swFLU20CReEPuC7gN50t5QWBrbIeGtjCI0sQ6hFvUtshMH-pJUUKZtv55z4yRtKQ-wlyhyqljycW-Or-85ZuxrbKxVkaEqcYkFSp4ZT-PD7GVCZdqo0KSmqrYYhKcj-WOsxnMz_0pdMjP76b9HdSX_gyragCupZJ-BbPtSNOAe-OIKhHF9Esa9y4FTlh9bO22sUq-IWTqdVFNCuOg8cjSpVFZXi6zUTMqLP5PfbfAt3flQmEIkTtr7OU-tkrTHCRgo5T7Pm7oCbVdyVGZZXS9cZxMCSelRJ2J2QScIY0TIwHc7zXa1bTXkVlb_9C6yQEWLioVyBuvL9taDX8nJ6OwsGfbHw5dsPUBkQEhaP-oPzi_atFgQg19U3tltr_VeNPr5ttLLY-uCh-WtC3xhuMU2a6LPew611-yFLd6wV-7oz79v2SGwO-Q9TsjxBjneIMeBHAdyvEGOlzlvkHvHRif94fdTrz7HwpsSIfW6ItPgzRHIn5-DDqahBCkPSQmodK6lkloYRHsEQ4v_xoHQYdpNIykEkenIF-_ZWlEWdodxcrAMUiX8zORSI9r6JpS50LmJ89C3usO-NOORIE7Q5o8ubHl3m9BGDrhuLKMO23bjk0ydoUkCjhzEWOp3WLQ0cu0PyIN8-Ukxua68yLtAi2wJPjyh449sYz7lPrG12c2d_QxKNzO79TS4BwoARv4
link.rule.ids 230,315,783,787,888,27936,27937,33757
linkProvider ProQuest
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=AVN%3A+A+Deep+Learning+Approach+for+the+Analysis+of+Birdsong&rft.jtitle=bioRxiv&rft.au=Koch%2C+Therese+M+I&rft.au=Marks%2C+Ethan+S&rft.au=Roberts%2C+Todd+F&rft.date=2024-08-24&rft.issn=2692-8205&rft.eissn=2692-8205&rft_id=info:doi/10.1101%2F2024.05.10.593561&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2692-8205&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2692-8205&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2692-8205&client=summon