An updated summary of publications is accessible at the Google Scholar page: click here.
This paper introduces Project Boli, a multi-lingual stuttered speech dataset designed to advance scientific understanding and technology development for individuals who stutter, particularly in India.
Ashita Batra, Mannas Narang, Neeraj Kumar Sharma, Pradip K Das
This paper presents the Coswara dataset, a dataset containing diverse set of respiratory sounds and rich meta-data, recorded between April-2020 and February-2022 from 2635 individuals (1819 SARS-CoV-2 negative, 674 positive, and 142 recovered subjects).
Debarpan Bhattacharya, Neeraj Kumar Sharma, Debottam Dutta, Srikanth Raj Chetupalli, Pravin Mote, Sriram Ganapathy, C Chandrakiran, Sahiti Nori, KK Suhail, Sadhana Gonuguntla, Murali Alagesan
Scientific Data, Nature (2023)
We present a reaction time analysis of a sound lateralization test. Stimuli from the sides yielded quicker reactions and better class accuracy than from the front. Congruent ITD–ILD cues significantly improved both metrics.
Neeraj Kumar Sharma, Ünal Ege Gaznepoglu, Thomas Robotham, Emanuël AP Habets
While listening to speech, does our ability to understand a spoken language impact our attention to acoustic attributes of the talker? In this paper, we explore answer this question by analyzing data obtained from a behavioral listening test.
Neeraj Kumar Sharma, Venkat Krishnamohan, Sriram Ganapathy, Ahana Gangopadhayay, Lauren Fink
How good are humans at detecting change in talker while listening to spoken converstations? How do machine compare with humans in this task? We explored this topic here.
Neeraj Kumar Sharma, Shobhana Ganesh, Sriram Ganapathy, Lori L Holt
Modulation Filtering and Attention-Based Model for Enhanced Audio Classification
Aditya Suryawanshi
B.Tech. (DSAI) Term Project Thesis (2025)
Audio Abstractor for Diarization, Summarization and Embedding Based Conversation Analysis
Nishchay Nilabh
B.Tech. (DSAI) Term Project Thesis (2025)
Prediction of Strength and Deformation Behaviour of Jointed Rocks Using Machine Learning Algorithms
Aadarsh Thakur
M.Tech. (Civil) Term Project Thesis (2024); Co-sup Vivek Padamanabha
Spoken language diversity quantification in campus population: A multimodal dataset creation and analysis study
Saksham Kumar, Samarth Hegde
B.Tech. (ECE) Term Project Thesis (2024)
Listening to the lungs: Respiratory sound visualization, annotation, and analysis
Askshaj Padmakar, Sanskar Kejriwal
B.Tech. (ECE) Term Project Thesis (2024)
Auditory attention decoding using listening-state EEG signals in multi-speaker scenarios
Soham Karak
B.Tech. (ECE) Term Project Thesis (2024)
A study on neural network learning dynamics
Partham Kulkarni
B.Tech. (Engg. Physics) Term Project Thesis (2024)
Sentiment analysis of video: Howe we perceive audio and video
Suhayl Mahek
B.Tech. (Engg. Physics) Term Project Thesis (2024)