Audio manipulations with Singular Value Decomposition

Analysis and synthesis with SVD

Ansatz
, in 06 June 2017
SVDaudiofiltering

Audio manipulations with Singular Value Decomposition

1976 Film about SVD

Lets apply Singular Value Decomposition to an audio complex spectrogram $X \in \mathbb{C}^{n \times k}$, where rows $n$ are time bins, and columns $k$ are frequency bins.

$$ USV^* = X$$

$U$,$S$ and $V$ dimensions could be arranged to be $U \in \mathbb{C}^{n \times k}$, $S \in \mathbb{C}^{k \times k}$ and $V \in \mathbb{C}^{k \times k}$.

This decomposition could be analyzed as $V$ representing a spectrum basis, $S$ as scaling values, and $U$ as the score with the weight for the linear combinations of basis vectors.

This is how the log magnitude of $V^*$ looks like for this audio

Original