MVA Material for the course on Audio Signal Processing
Slides of the course
- Part II : Analogic signal/Digital signal (download)
- Part IV : Stochastic signal processing (download)
- Part VII : Time-frequency analysis (download)
Registration to the course :
- Send me an email at emmanuel.bacry@cnrs.fr before January 31st to notify your registration
For the validation of the course :
- A report on a paper (to choose among the list below or choose your own and send me an email for validation)
- Send me an email at least at least 2 weeks before the oral exam indicating the paper you chose
- Send me the report before 8pm the day before the oral exam
- This is an individual work
- The report should be structured in the following way :
- Part I : corresponding to a summary of the paper
- Part II : (more personal) corresponding to your critics (positive/negative), potential numerical experiments and/or extensions
- Part III : A bibliography (you are encouraged to talk about other papers than the reference paper in Part II)
- A 15min oral exam
- 10min : Presentation of your report
- 5min : Discussion around your work and around the course (that you should know !) in relation with your report
- Warning : simply reading the slides of the course is NOT sufficient to understand/know the course
Possible Papers for the report :
Denoising
Sound transformation or synthesis
- Improved Phase Vocoder Time-Scale Modification of Audio (1999)
(download)
- Time contraction/dilation
- Some more improvements (2023) (1 more paper : download)
- A comparison of recent neural vocoders (2019) (study in detail one of them in another paper to be found) (download)
- Singing voice synthesis : download)
- Percussive sound synthesis
(download)
- Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders (2017)
(download)
Pitch detection
- Multipitch estimation of piano sounds using a new probabilistic spectral smoothness principle (2010) : download
- A pitch salience function derived from harmonic frequency deviations for polyphonic music analysis (2014) : download
- Combining Spectral and Temporal Representations
for Multipitch Estimation of Polyphonic Music (2015) : download
- A Discriminative Model for Polyphonic Piano Transcription (2007) : download
- Chord detection using deep learning (2015) (download)
- Pitch recognition using NMF (2021) (arXiv:2107.11250)
Source separation
- Multichannel Nonnegative Matrix Factorization in
Convolutive Mixtures for Audio Source Separation (2010) (download)
- A Source/Filter Model with Adaptive Constraints for NMF-based Speech Separation (2016) (download)
- Unsupervised Source Separation via Self-Supervised Training (2022) (arXiv:2202.03875)
Other
- Multi-Feature Beat Tracking (2014) (download)
- Sigma/Delta (2001) (download)
- Perceptual coding (focus on psycho-acoustic modelization ) (an example of paper)
- Summarization of music (2016) ( download)
- The Application of Hidden Markov Models in Speech Recognition (2008) (download)