SoylentNews
SoylentNews is people
https://soylentnews.org/

Title    The Sound of Pixels
Date    Tuesday September 25 2018, @07:43PM
Author    chromas
Topic   
from the musical-chairs dept.
https://soylentnews.org/article.pl?sid=18/09/25/1927205

DannyB writes:

In this article the authors introduce . . .

PixelPlayer, a system that, by watching large amounts of unlabeled videos, learns to locate image regions which produce sounds and separate the input sounds into a set of components that represents the sound from each pixel. Our approach capitalizes on the natural synchronization of the visual and audio modalities to learn models that jointly parse sounds and images, without requiring additional manual supervision.

The system is trained with a large number of videos containing people playing instruments in different combinations, including solos and duets. No supervision is provided on what instruments are present on each video, where they are located, or how they sound. During test time, the input to the system is a video showing people playing different instruments, and the mono auditory input. Our system performs audio-visual source separation and localization, splitting the input sound signal into N sound channels, each one corresponding to a different instrument category. In addition, the system can localize the sounds and assign a different audio wave to each pixel in the input video.

A video is included along with an explanation of several interesting demos, such as pointing at any pixel to hear the sound from that pixel. Or remixing the volume levels of different musical instruments in the video.

The paper is included along with the data set. It says the code is coming soon.


Original Submission

Links

  1. "DannyB" - https://soylentnews.org/~DannyB/
  2. "this article" - http://sound-of-pixels.csail.mit.edu/
  3. "video" - https://www.youtube.com/watch?time_continue=2&v=2eVDLEQlKD0
  4. "paper" - https://arxiv.org/abs/1804.03160
  5. "data set" - https://github.com/roudimit/MUSIC_dataset
  6. "Original Submission" - https://soylentnews.org/submit.pl?op=viewsub&subid=29192

© Copyright 2026 - SoylentNews, All Rights Reserved

printed from SoylentNews, The Sound of Pixels on 2026-03-13 08:03:53