KAMoulox

Online unmixing of large historical archives

Executive summary

Funded by ANR The KAMoulox project (ANR-15-CE38-0003) runs from January 2016 to September 2019. It has been funded by ANR, the French national research agency, as a Junior Researcher project with principal investigator Antoine Liutkus. Its name stands for Kernel additive models for online unmixing of large historical arxives. KAMoulox has two main objectives: The main assumption of the project is that audio separation and restoration can help shape new practices regarding our immaterial audio heritage. It can notably benefit to the scientific and educational exploitation of this heritage through the concept of crowd processing.

Assets

The archives of the CNRS - Musée de l’Homme

Description

Countries with items in the archives

Value

The mixed research unit CREM: Centre de Recherche en Ethno-Musicologie is responsible for these archives.

Countries with items in the archives

Challenges

Although they are an invaluable source for information and wealth for our immaterial cultural heritage, the archives also come with some specific challenges, which are the focus of the project.

Objectives for the project

Audio signal separation

Description

  • A. Liutkus et al. “Gaussian processes for underdetermined source separation.” IEEE TSP, 59.7 (2011): 3155-3167.
  • A. Liutkus et al. “Kernel additive models for source separation.” IEEE TSP, 62.16 (2014): 4298-4310.
  • A. Liutkus et al. “Generalized Wiener filtering with fractional power spectrograms.” IEEE ICASSP, 2015
  • A. Nugraha et al. "Multichannel audio source separation with deep neural networks." IEEE TASLP, 2016.
DNN for source separation

Challenges

Our current research on probabilistic models for audio processing comprise several important challenges, that need to be addressed befor the methods may be used in real applications.

Objectives for the project

Work packages

Workpackages for KAMoulox

KAMoulox is decomposed into 4 work-packages (WPs) of equal importance, that closely reflect its numerous scientific and technical objectives. From a transversal application perspective, all WPs revolve around incrementally improving a web-based source separation software architecture.

WP1 Robust separation methods

This WP builds on recent advances I proposed in probabilistic signal processing, that are based on the alpha-stable formalism and which notably extend the classical Gaussian methodology to heavy-tailed signals. This part of the project feature two main challenges. The first one is the estimation of parameters in models that do not come with likelihood functions. Ideas include moment fitting and Monte Carlo methods. Second, the case of multichannel signals brings interesting research directions for probabilistic signal processing.

WP2 Extensions of KAM

This WP introduces new nonparametric ways of estimating the parameters of sources, notably their time-frequency energy profiles. This also comes with new ways to format the feedback displayed to the user for interacting with him. This work-package is divided into two main parts: first, the design of new non-parametric spectrogram models, such as deep-nets or nonnegative alpha-stable. Second, the iterative processing achieved through an interaction between the filtering machine and a user.

WP3 Separation web-service

This WP provides computationally efficient implementations to separation and parameter estimation procedures. It will also be critical in embedding the whole architecture on the existing platform of the CNRS - Musée de l'Homme.

WP4 Design and science outreach

This WP will first focus on the design of the client-side application using a resolutely user-centered methodology. It will then lead to the actual realization of this multimedia client, but also to the promotion of this technology through the realization of multimedia content and through the participation of strategical social events.