Publication & Citation Trends
Publications
71 total
Multi-View Learning for Speech Emotion Recognition with Categorical Emotion, Categorical Sentiment, and Dimensional Scores OA
Cited by 8
Semantic Scholar
Pengi: An Audio Language Model for Audio Tasks OA
Cited by 274
Semantic Scholar
Training Audio Captioning Models without Audio OA
Cited by 29
Semantic Scholar
CLAP Learning Audio Concepts from Natural Language Supervision OA
Cited by 845
Semantic Scholar
Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session OA
Cited by 11
Semantic Scholar
Describing emotions with acoustic property prompts for speech emotion recognition OA
Cited by 12
Semantic Scholar
Audio Retrieval with WavText5K and CLAP Training OA
Cited by 69
Semantic Scholar
Identifying Actions for Sound Event Classification OA
Cited by 6
Semantic Scholar
Research Topics
Music and Audio Processing
(57)
Speech and Audio Processing
(44)
Speech Recognition and Synthesis
(19)
Music Technology and Sound Studies
(19)
Video Analysis and Summarization
(11)
Affiliations
National Institute of Technology Karnataka
Microsoft (United States)
International Computer Science Institute
Laboratoire d'Informatique de Paris-Nord
Microsoft Research (United Kingdom)