website picture

Vipul Arora
Associate Professor, Department of Electrical Engineering
PK Kelkar Fellow
IIT Kanpur
Email: vipul ar (AT) iitk (.) ac (.) in
Office location: Sustainable Energy Engineering Building, Near Media Labs (map)
Postal Address: 305-D, ACES building, IIT Kanpur - 208016 (map)

About me:

I work on developing learning based methods mostly for audio processing (music, speech, and other sounds). I have interest in Physics, so I work with Physicists to develop learning based methods for problems in computational Physics too.

My research has immensely contributed to (i) audio representation learning for tasks such as audio search, audio event detection and audio annotation, (ii) human-machine learning for accelerating data annotation and for trustworthy machine learning with application to speech, music and audio analysis, (iii) generative machine learning for enhancing Monte Carlo simulations for studying lattices in statistical and particle Physics, and (iv) developing advanced learning-based tools to enhance air quality monitoring at scale.

I received my B.Tech. and Ph.D. degrees in Electrical Engineering from the Indian Institute of Technology (IIT) Kanpur, India. My Ph.D. thesis was titled “Analysis of Pitched Polyphonic Music for Source Transcription”, where I worked on analyzing music audio to identify and transcribe different instruments/voices playing simultaneously. During postdoc at Oxford University (UK), I developed speech recognition systems using linguistic principles, with applications in automatic language teacher and speech recognition for low-resource languages. At Amazon in Boston (USA), I worked on audio classification for developing Alexa home security system, with research focusing on classification with imbalanced data.

Current Research interests:

Speech, Music, Audio processing

Machine learning:

  • Human-machine learning
  • Audio LLMs
  • Generative Machine Learning
  • Trustworthy AI


  • Audio retrieval
  • Speech recognition
  • Computational Physics (Monte Carlo simulations of lattices in QCD)
  • Image generation
  • Music education
  • Sensor calibration