Deep clustering: Discriminative embeddings for segmentation and separation
Hershey, John R., Zhuo Chen, Le Roux, Jonathan, Watanabe, Shinji
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2016)
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2016)
Get full text
Conference Proceeding
Journal Article
Deep clustering and conventional networks for music separation: Stronger together
Luo, Yi, Chen, Zhuo, Hershey, John R., Le Roux, Jonathan, Mesgarani, Nima
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Get full text
Conference Proceeding
Journal Article
Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks
Erdogan, Hakan, Hershey, John R., Watanabe, Shinji, Le Roux, Jonathan
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)
Get full text
Conference Proceeding
Improving Universal Sound Separation Using Sound Classification
Tzinis, Efthymios, Wisdom, Scott, Hershey, John R., Jansen, Aren, Ellis, Daniel P. W.
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech Enhancement
Koizumi, Yuma, Karita, Shigeki, Wisdom, Scott, Erdogan, Hakan, Hershey, John R., Jones, Llion, Bacchiani, Michiel
Published in 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (01.01.2021)
Published in 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (01.01.2021)
Get full text
Conference Proceeding
Hybrid CTC/Attention Architecture for End-to-End Speech Recognition
Watanabe, Shinji, Hori, Takaaki, Kim, Suyoun, Hershey, John R., Hayashi, Tomoki
Published in IEEE journal of selected topics in signal processing (01.12.2017)
Published in IEEE journal of selected topics in signal processing (01.12.2017)
Get full text
Journal Article
Universal Sound Separation
Kavalerov, Ilya, Wisdom, Scott, Erdogan, Hakan, Patton, Brian, Wilson, Kevin, Le Roux, Jonathan, Hershey, John R.
Published in 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (01.10.2019)
Published in 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (01.10.2019)
Get full text
Conference Proceeding
Minimum word error training of long short-term memory recurrent neural network language models for speech recognition
Hori, Takaaki, Hori, Chiori, Watanabe, Shinji, Hershey, John R.
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2016)
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2016)
Get full text
Conference Proceeding
Journal Article
Multi-Channel Deep Clustering: Discriminative Spectral and Spatial Embeddings for Speaker-Independent Speech Separation
Wang, Zhong-Qiu, Le Roux, Jonathan, Hershey, John R.
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Get full text
Conference Proceeding
Attention-Based Multimodal Fusion for Video Description
Hori, Chiori, Hori, Takaaki, Teng-Yok Lee, Ziming Zhang, Harsham, Bret, Hershey, John R., Marks, Tim K., Sumi, Kazuhiko
Published in 2017 IEEE International Conference on Computer Vision (ICCV) (01.10.2017)
Published in 2017 IEEE International Conference on Computer Vision (ICCV) (01.10.2017)
Get full text
Conference Proceeding
Deep long short-term memory adaptive beamforming networks for multichannel robust speech recognition
Zhong Meng, Watanabe, Shinji, Hershey, John R., Erdogan, Hakan
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Get full text
Conference Proceeding
Differentiable Consistency Constraints for Improved Deep Speech Enhancement
Wisdom, Scott, Hershey, John R., Wilson, Kevin, Thorpe, Jeremy, Chinen, Michael, Patton, Brian, Saurous, Rif A.
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Get full text
Conference Proceeding
Deep NMF for speech separation
Le Roux, Jonathan, Hershey, John R., Weninger, Felix
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)
Get full text
Conference Proceeding
Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation
Wisdom, Scott, Jansen, Aren, Weiss, Ron J., Erdogan, Hakan, Hershey, John R.
Published in 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (17.10.2021)
Published in 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (17.10.2021)
Get full text
Conference Proceeding
What's all the Fuss about Free Universal Sound Separation Data?
Wisdom, Scott, Erdogan, Hakan, Ellis, Daniel P. W., Serizel, Romain, Turpault, Nicolas, Fonseca, Eduardo, Salamon, Justin, Seetharaman, Prem, Hershey, John R.
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Get full text
Conference Proceeding
End-To-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings
Maiti, Soumi, Erdogan, Hakan, Wilson, Kevin, Wisdom, Scott, Watanabe, Shinji, Hershey, John R.
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Get full text
Conference Proceeding
Sound Event Detection and Separation: A Benchmark on Desed Synthetic Soundscapes
Turpault, Nicolas, Serizel, Romain, Wisdom, Scott, Erdogan, Hakan, Hershey, John R., Fonseca, Eduardo, Seetharaman, Prem, Salamon, Justin
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Get full text
Conference Proceeding
Adapting Speech Separation to Real-World Meetings using Mixture Invariant Training
Sivaraman, Aswin, Wisdom, Scott, Erdogan, Hakan, Hershey, John R.
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Get full text
Conference Proceeding