Example-based Cross-Modal Denoising


Supplementary Material:

General Digits Bartender Xylophone

Here are the details and links of the attached videos



Speech (Digits): Noisy Input.


 



Speech (Digits): Cross-Modal Audio Visual Denoising
(Our algorithm)

Corresponding to the noisy sequences above.

 



Speech (Digits): Unimodal (Audio-only) denoising
(Cohen and Bardugo's algorithm)

Corresponding to the noisy sequences above.

 



Speech (Digits): Unimodal (Audio-only) denoising
(example-based algorithm, run only on the audio channel)

Corresponding to the noisy sequences above.