Vipul Arora

Vipul Arora
Associate Professor, Department of Electrical Engineering
PK Kelkar Fellow
IIT Kanpur
Email: vipul ar (AT) iitk (.) ac (.) in
Office location: Sustainable Energy Engineering Building, Near Media Labs (map)
Postal Address: 305-D, ACES building, IIT Kanpur - 208016 (map)

MADHAV lab: For latest info on my research, team, publications and open positions.
My calendar (office hours: MW 5-6 PM)
GitaSTEM

About me:

I work on developing learning based methods mostly for audio processing (music, speech, and other sounds). I have interest in Physics, so I work with Physicists to develop learning based methods for problems in computational Physics too.

My research contributes to (i) audio representation learning for tasks such as audio search, audio event detection and audio annotation, (ii) human-machine learning for accelerating data annotation and for trustworthy machine learning (such as uncertainty estimation) with application to speech, music and audio analysis, (iii) generative machine learning for enhancing Monte Carlo simulations for studying lattices in statistical and particle Physics, and (iv) developing advanced learning-based tools to enhance air quality monitoring at scale.

I received my B.Tech. and Ph.D. degrees in Electrical Engineering from the Indian Institute of Technology (IIT) Kanpur, India. During postdoc at Oxford University (UK), I developed speech recognition systems using linguistic principles, with applications in automatic language teacher and speech recognition for low-resource languages. At Amazon in Boston (USA), I worked on audio classification for developing Alexa home security system, with research focusing on classification with imbalanced data.

Current Research interests:

Speech, Music, Audio processing

Audio LLMs (audio tokenization, speech recognition, audio search and music analysis)
Generative Machine Learning (Flow-models, diffusion, Monte carlo sampling) (application in computational Physics)
Trustworthy AI (uncertainty estimation and confidence calibration)
Human-machine learning (active learning and education)
Efficient machine learning (applications in audio and air quality monitoring)