Follow
Ryo Masumura
Ryo Masumura
Distinguished Research Scientist, NTT Computer and Data Science Laboratories, NTT Corporation
Verified email at lab.ntt.co.jp - Homepage
Title
Cited by
Cited by
Year
Domain adaptation of dnn acoustic models using knowledge distillation
T Asami, R Masumura, Y Yamaguchi, H Masataki, Y Aono
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
1022017
A transformer-based audio captioning model with keyword estimation
Y Koizumi, R Masumura, K Nishida, M Yasuda, S Saito
arXiv preprint arXiv:2007.00222, 2020
742020
Soft-target training with ambiguous emotional utterances for dnn-based speech emotion classification
A Ando, S Kobashikawa, H Kamiyama, R Masumura, Y Ijima, Y Aono
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
512018
Online end-of-turn detection from speech based on stacked time-asynchronous sequential networks.
R Masumura, T Asami, H Masataki, R Ishii, R Higashinaka
Interspeech 2017, 1661-1665, 2017
462017
Neural Dialogue Context Online End-of-Turn Detection
R Masumura, T Tanaka, A Ando, R Ishii, R Higashinaka, Y Aono
Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue …, 2018
382018
Hierarchical transformer-based large-context end-to-end asr with large-context knowledge distillation
R Masumura, N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
352021
Large context end-to-end automatic speech recognition via extension of hierarchical recurrent encoder-decoder models
R Masumura, T Tanaka, T Moriya, Y Shinohara, T Oba, Y Aono
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
332019
Neural Error Corrective Language Models for Automatic Speech Recognition.
T Tanaka, R Masumura, H Masataki, Y Aono
INTERSPEECH, 401-405, 2018
332018
Customer satisfaction estimation in contact center calls based on a hierarchical multi-task model
A Ando, R Masumura, H Kamiyama, S Kobashikawa, Y Aono, T Toda
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 715-728, 2020
312020
Neural confnet classification: Fully neural network based spoken utterance classification using word confusion networks
R Masumura, Y Ijima, T Asami, H Masataki, R Higashinaka
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
302018
Hierarchical LSTMs with Joint Learning for Estimating Customer Satisfaction from Contact Center Calls.
A Ando, R Masumura, H Kamiyama, S Kobashikawa, Y Aono
INTERSPEECH, 1716-1720, 2017
282017
Improving neural text normalization with data augmentation at character-and morphological levels
I Saito, J Suzuki, K Nishida, K Sadamitsu, S Kobashikawa, R Masumura, ...
Proceedings of the Eighth International Joint Conference on Natural Language …, 2017
262017
Self-Distillation for Improving CTC-Transformer-Based ASR Systems.
T Moriya, T Ochiai, S Karita, H Sato, T Tanaka, T Ashihara, R Masumura, ...
INTERSPEECH, 546-550, 2020
242020
End-to-end japanese multi-dialect speech recognition and dialect identification with multi-task learning
R Imaizumi, R Masumura, S Shiota, H Kiya
APSIPA Transactions on Signal and Information Processing 11 (1), 2022
222022
Sequence-level consistency training for semi-supervised end-to-end automatic speech recognition
R Masumura, M Ihori, A Takashima, T Moriya, A Ando, Y Shinohara
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
222020
Adversarial training for multi-task and multi-lingual joint modeling of utterance intent classification
R Masumura, Y Shinohara, R Higashinaka, Y Aono
Proceedings of the 2018 Conference on Empirical Methods in Natural Language …, 2018
222018
Speech Emotion Recognition Based on Multi-Label Emotion Existence Model.
A Ando, R Masumura, H Kamiyama, S Kobashikawa, Y Aono
INTERSPEECH, 2818-2822, 2019
212019
A Joint End-to-End and DNN-HMM Hybrid Automatic Speech Recognition System with Transferring Sharable Knowledge.
T Tanaka, R Masumura, T Moriya, T Oba, Y Aono
INTERSPEECH, 2210-2214, 2019
202019
Parallel phonetically aware DNNs and LSTM-RNNs for frame-by-frame discriminative modeling of spoken language identification
R Masumura, T Asami, H Masataki, Y Aono
Acoustics, Speech and Signal Processing (ICASSP), 2017 IEEE International …, 2017
202017
Training a Language Model Using Webdata for Large Vocabulary Japanese Spontaneous Speech Recognition.
R Masumura, S Hahm, A Ito
INTERSPEECH, 1465-1468, 2011
172011
The system can't perform the operation now. Try again later.
Articles 1–20