Speech processing and retrieval in a personal memory aid system for the elderly

The paper presents a new application of automatic speech processing in the Ambient Assisted Living area, developed in the course of a three year research project. Recording and automatic processing of spoken conversations plays a major role in this solution enabling effective search in a personal au...

Full description

Saved in:
Bibliographic Details
Published in2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) pp. 1749 - 1752
Main Authors Sorin, Alexander, Aronowitz, Hagai, Mamou, Jonathan, Toledo-Ronen, Orith, Hoory, Ron, Kuritzky, Michael, Erez, Yael, Ramabhadran, Bhuvana, Sethy, Abhinav
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.05.2011
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The paper presents a new application of automatic speech processing in the Ambient Assisted Living area, developed in the course of a three year research project. Recording and automatic processing of spoken conversations plays a major role in this solution enabling effective search in a personal audio archive and fast browsing of conversations. Processing of elderly conversational speech recorded by a distant PDA microphone poses a great challenge. The speech processing flow includes transcription, speaker tracking and combined indexing and search of spoken terms and participating speakers identity extracted from the audio. We present the entire application and individual speech processing components as well as evaluation results of the individual components and of the end-to-end spoken information retrieval solution.
AbstractList The paper presents a new application of automatic speech processing in the Ambient Assisted Living area, developed in the course of a three year research project. Recording and automatic processing of spoken conversations plays a major role in this solution enabling effective search in a personal audio archive and fast browsing of conversations. Processing of elderly conversational speech recorded by a distant PDA microphone poses a great challenge. The speech processing flow includes transcription, speaker tracking and combined indexing and search of spoken terms and participating speakers identity extracted from the audio. We present the entire application and individual speech processing components as well as evaluation results of the individual components and of the end-to-end spoken information retrieval solution.
Author Erez, Yael
Ramabhadran, Bhuvana
Sethy, Abhinav
Sorin, Alexander
Mamou, Jonathan
Aronowitz, Hagai
Kuritzky, Michael
Toledo-Ronen, Orith
Hoory, Ron
Author_xml – sequence: 1
  givenname: Alexander
  surname: Sorin
  fullname: Sorin, Alexander
  email: sorin@il.ibm.com
  organization: IBM Haifa Res. Lab., Haifa, Israel
– sequence: 2
  givenname: Hagai
  surname: Aronowitz
  fullname: Aronowitz, Hagai
  email: hagaia@il.ibm.com
  organization: IBM Haifa Res. Lab., Haifa, Israel
– sequence: 3
  givenname: Jonathan
  surname: Mamou
  fullname: Mamou, Jonathan
  email: mamou@il.ibm.com
  organization: IBM Haifa Res. Lab., Haifa, Israel
– sequence: 4
  givenname: Orith
  surname: Toledo-Ronen
  fullname: Toledo-Ronen, Orith
  email: oritht@il.ibm.com
  organization: IBM Haifa Res. Lab., Haifa, Israel
– sequence: 5
  givenname: Ron
  surname: Hoory
  fullname: Hoory, Ron
  email: hoory@il.ibm.com
  organization: IBM Haifa Res. Lab., Haifa, Israel
– sequence: 6
  givenname: Michael
  surname: Kuritzky
  fullname: Kuritzky, Michael
  email: kmichael@il.ibm.com
  organization: IBM Haifa Res. Lab., Haifa, Israel
– sequence: 7
  givenname: Yael
  surname: Erez
  fullname: Erez, Yael
  email: yaele@il.ibm.com
  organization: IBM Haifa Res. Lab., Haifa, Israel
– sequence: 8
  givenname: Bhuvana
  surname: Ramabhadran
  fullname: Ramabhadran, Bhuvana
  email: bhuvana@us.ibm.com
– sequence: 9
  givenname: Abhinav
  surname: Sethy
  fullname: Sethy, Abhinav
  email: asethy@us.ibm.com
BookMark eNo1UNtKAzEUjFrBbe0X9CU_sDX37HmUolUoVKiCbyXdPbGRvZEswv69C9aBYRgGBmbmZNZ2LRKy4mzNOYOH183j4fC2FozztQZlCsWuyJwrbS3TEuw1yYS0kHNgnzdkCbb4zwo2IxnXguWGK7gjy5S-2QQjrNWQkf2hRyzPtI9diSmF9ou6tqIRhxjwx9U0tNTRHmPq2sk12HRxpC5UNI1pwIb6LtLhjBTrCmM93pNb7-qEy4suyMfz0_vmJd_tt9OKXR641UPuKlHogqmTnIjGaiuMA8Gq0svTCbCsrC9lobxEoxVwZrxTBjxIVHxaLBdk9dcbEPHYx9C4OB4v38hfmHxV6w
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/ICASSP.2011.5946840
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library Online
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library Online
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 1457705397
9781457705373
9781457705397
1457705370
EISSN 2379-190X
EndPage 1752
ExternalDocumentID 5946840
Genre orig-research
GroupedDBID 23M
29P
6IE
6IF
6IH
6IK
6IL
6IM
6IN
AAJGR
ABLEC
ACGFS
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IJVOP
IPLJI
JC5
M43
OCL
RIE
RIL
RIO
RNS
ID FETCH-LOGICAL-i175t-ad285804b304be675726a920dcf3bb9ecd7fc384f3e6549106fa469f93e415393
IEDL.DBID RIE
ISBN 9781457705380
1457705389
ISSN 1520-6149
IngestDate Wed Jun 26 19:20:07 EDT 2024
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i175t-ad285804b304be675726a920dcf3bb9ecd7fc384f3e6549106fa469f93e415393
PageCount 4
ParticipantIDs ieee_primary_5946840
PublicationCentury 2000
PublicationDate 2011-May
PublicationDateYYYYMMDD 2011-05-01
PublicationDate_xml – month: 05
  year: 2011
  text: 2011-May
PublicationDecade 2010
PublicationTitle 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
PublicationTitleAbbrev ICASSP
PublicationYear 2011
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000627759
ssj0008748
Score 1.8108269
Snippet The paper presents a new application of automatic speech processing in the Ambient Assisted Living area, developed in the course of a three year research...
SourceID ieee
SourceType Publisher
StartPage 1749
SubjectTerms Ambient Assisted Living
Decision support systems
speaker diarization
speaker identification
Speech transcription
spoken information retrieval
Title Speech processing and retrieval in a personal memory aid system for the elderly
URI https://ieeexplore.ieee.org/document/5946840
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELZKJ1h4tIi3PDCSNo2d2B5RRVWQCpVKpW6VY59FBaRR1Q7l13NO0vIQA0Ok2EuSyyn33eW-7wi5dogiQu104BIlAo4BKZCyYwIbRzYKpQYder7z4DHpj_nDJJ7UyM2WCwMARfMZtPxp8S_fzs3Kl8raseJem2SH7AilSq7Wtp7i5XYL6brqKyxFMTkLw5NPj7gqSF2xEOh0Um20nqp1WMkRdULVvu_ejkbDUtuzut6PwStF3Ontk8Hmjst2k9fWapm2zMcvMcf_PtIBaX4x_OhwG7sOSQ2yI7L3TZywQZ5GOYB5oXlJJcA9qjNLF8UILvRPOsuopnkF5um7b9ldUz2ztFSHpgiHKcJLCn4Q-Nu6Sca9u-duP6jGLwQzxBTLQNtIxjLkKcMDMLEQUaJVFFrjWJoqMFY4wyR3DJLYv-XEaUy2nWKAqIApdkzq2TyDE0J5pBPBODDHBNe4Ao62N0ogugpdR56ShjfNNC8VNqaVVc7-3j4nu2Vl17cdXpD6crGCS4QGy_Sq8IlPg1GwQg
link.rule.ids 310,311,783,787,792,793,799,27939,55088
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT8JAEN4gHtSLDzC-3YNHC6W77e4eDZGgApIACTeybWcjUUtD4IC_3tm24CMePDTp7qXtdNL5ZjrfN4TcGEQRrjbaMYESDseA5EjZiJzY92LPlRq0a_nO3V7QHvHHsT8ukdsNFwYAsuYzqNnT7F9-PIuWtlRW9xW32iRbZNu3uCJna20qKlZwNxOvK77DUmSzszBA2QSJq4zW5QuBbifVWu2pWLuFIFHDVfWH5t1g0M_VPYsr_hi9kkWe1j7pru85bzh5rS0XYS36-CXn-N-HOiDVL44f7W-i1yEpQXJE9r7JE1bI8yAFiF5ompMJcI_qJKbzbAgXeiidJlTTtIDz9N027a6onsY014emCIgpAkwKdhT426pKRq37YbPtFAMYnCmiioWjY0_60uUhwwMwtRBeoJXnxpFhYaggioWJmOSGQeDb9xwYjem2UQwQFzDFjkk5mSVwQij3dCAYB2aY4BpXwNH2kRKIr1zTkKekYk0zSXONjUlhlbO_t6_JTnvY7Uw6D72nc7Kb13ltE-IFKS_mS7hEoLAIrzL_-AT2c7OP
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2011+IEEE+International+Conference+on+Acoustics%2C+Speech+and+Signal+Processing+%28ICASSP%29&rft.atitle=Speech+processing+and+retrieval+in+a+personal+memory+aid+system+for+the+elderly&rft.au=Sorin%2C+Alexander&rft.au=Aronowitz%2C+Hagai&rft.au=Mamou%2C+Jonathan&rft.au=Toledo-Ronen%2C+Orith&rft.date=2011-05-01&rft.pub=IEEE&rft.isbn=9781457705380&rft.issn=1520-6149&rft.eissn=2379-190X&rft.spage=1749&rft.epage=1752&rft_id=info:doi/10.1109%2FICASSP.2011.5946840&rft.externalDocID=5946840
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1520-6149&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1520-6149&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1520-6149&client=summon