Why Nostr? What is Njump?
2024-05-23 07:42:26

ava on Nostr: This tech is sick 👀 **Images that Sound: Composing Images and Sounds on a Single ...

This tech is sick 👀

**Images that Sound: Composing Images and Sounds on a Single Canvas**

Play with sound on 🔊



"tl;dr: We use diffusion models to generate spectrograms that look like images but can also be played as sound."

"Spectrograms are 2D representations of sound that look very different from the images found in our visual world. And natural images, when played as spectrograms, make unnatural sounds. In this paper, we show that it is possible to synthesize spectrograms that simultaneously look like natural images and sound like natural audio. We call these spectrograms images that sound."

"Various musicians have inserted images into spectrograms of their music, including Aphex Twin (go to 5:27), Nine Inch Nails, Venetian Snares, and in Doom's OST. Our work differs from these examples in that our spectrograms both look and sound natural."

https://ificl.github.io/images-that-sound/

#cybersecgirl
Author Public Key
npub1f6ugxyxkknket3kkdgu4k0fu74vmshawermkj8d06sz6jts9t4kslazcka