Publications

An updated summary of publications is accessible at the Google Scholar page: click here.

Group highlights

Boli: A dataset for understanding stuttering experience and analyzing stuttered speech

This paper introduces Project Boli, a multi-lingual stuttered speech dataset designed to advance scientific understanding and technology development for individuals who stutter, particularly in India.

Ashita Batra, Mannas Narang, Neeraj Kumar Sharma, Pradip K Das

IEEE ICASSP (2025)

Coswara: A respiratory sounds and symptoms dataset for remote screening of SARS-CoV-2 infection

This paper presents the Coswara dataset, a dataset containing diverse set of respiratory sounds and rich meta-data, recorded between April-2020 and February-2022 from 2635 individuals (1819 SARS-CoV-2 negative, 674 positive, and 142 recovered subjects).

Debarpan Bhattacharya, Neeraj Kumar Sharma, Debottam Dutta, Srikanth Raj Chetupalli, Pravin Mote, Sriram Ganapathy, C Chandrakiran, Sahiti Nori, KK Suhail, Sadhana Gonuguntla, Murali Alagesan

Scientific Data, Nature (2023)

Two congruent cues are better than one: Impact of ITD–ILD combinations on reaction time for sound lateralization

We present a reaction time analysis of a sound lateralization test. Stimuli from the sides yielded quicker reactions and better class accuracy than from the front. Congruent ITD–ILD cues significantly improved both metrics.

Neeraj Kumar Sharma, Ünal Ege Gaznepoglu, Thomas Robotham, Emanuël AP Habets

JASA Express Letters (2023)

Acoustic and linguistic features influence talker change detection

While listening to speech, does our ability to understand a spoken language impact our attention to acoustic attributes of the talker? In this paper, we explore answer this question by analyzing data obtained from a behavioral listening test.

Neeraj Kumar Sharma, Venkat Krishnamohan, Sriram Ganapathy, Ahana Gangopadhayay, Lauren Fink

JASA Express Letters (2021)

Talker change detection: A comparison of human and machine performance

How good are humans at detecting change in talker while listening to spoken converstations? How do machine compare with humans in this task? We explored this topic here.

Neeraj Kumar Sharma, Shobhana Ganesh, Sriram Ganapathy, Lori L Holt

JASA (2019)

Inside Science, IISc Pick.

 

MTech/BTech Thesis Projects

Modulation Filtering and Attention-Based Model for Enhanced Audio Classification
Aditya Suryawanshi
B.Tech. (DSAI) Term Project Thesis (2025)

Audio Abstractor for Diarization, Summarization and Embedding Based Conversation Analysis
Nishchay Nilabh
B.Tech. (DSAI) Term Project Thesis (2025)

Prediction of Strength and Deformation Behaviour of Jointed Rocks Using Machine Learning Algorithms
Aadarsh Thakur
M.Tech. (Civil) Term Project Thesis (2024); Co-sup Vivek Padamanabha

Spoken language diversity quantification in campus population: A multimodal dataset creation and analysis study
Saksham Kumar, Samarth Hegde
B.Tech. (ECE) Term Project Thesis (2024)

Listening to the lungs: Respiratory sound visualization, annotation, and analysis
Askshaj Padmakar, Sanskar Kejriwal
B.Tech. (ECE) Term Project Thesis (2024)

Auditory attention decoding using listening-state EEG signals in multi-speaker scenarios
Soham Karak
B.Tech. (ECE) Term Project Thesis (2024)

A study on neural network learning dynamics
Partham Kulkarni
B.Tech. (Engg. Physics) Term Project Thesis (2024)

Sentiment analysis of video: Howe we perceive audio and video
Suhayl Mahek
B.Tech. (Engg. Physics) Term Project Thesis (2024)