Unsupervised Harmonic Parameter Estimation Using Differentiable DSP and Spectral Optimal Transport - Laboratoire Traitement et Communication de l'Information
Conference Papers Year : 2024

Unsupervised Harmonic Parameter Estimation Using Differentiable DSP and Spectral Optimal Transport

Abstract

In neural audio signal processing, pitch conditioning has been used to enhance the performance of synthesizers. However, jointly training pitch estimators and synthesizers is a challenge when using standard audio-to-audio reconstruction loss, leading to reliance on external pitch trackers. To address this issue, we propose using a spectral loss function inspired by optimal transportation theory that minimizes the displacement of spectral energy. We validate this approach through an unsupervised autoencoding task that fits a harmonic template to harmonic signals. We jointly estimate the fundamental frequency and amplitudes of harmonics using a lightweight encoder and reconstruct the signals using a differentiable harmonic synthesizer. The proposed approach offers a promising direction for improving unsupervised parameter estimation in neural audio applications.
Fichier principal
Vignette du fichier
Torres_Peeters_Richard_Spectral_OT_arxiv.pdf (524.39 Ko) Télécharger le fichier
Origin Files produced by the author(s)

Dates and versions

hal-04358467 , version 1 (21-12-2023)
hal-04358467 , version 2 (22-12-2023)
hal-04358467 , version 3 (11-01-2024)

Identifiers

  • HAL Id : hal-04358467 , version 2

Cite

Bernardo Torres, Geoffroy Peeters, Gaël Richard. Unsupervised Harmonic Parameter Estimation Using Differentiable DSP and Spectral Optimal Transport. IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2024, Seoul, South Korea. ⟨hal-04358467v2⟩
900 View
241 Download

Share

More