hugo flores garcía

Sketch2Sound

2024-12-11T00:00:00+00:00

Sketch2Sound is a generative audio model capable of creating high-quality sounds from a set of interpretable time-varying control signals: loudness, brightness, and pitch, as well as text prompts.

Sketch2Sound can synthesize arbitrary sounds from sonic imitations (i.e., a vocal imitation or a reference sound-shape).

Check out our demo video, paper and website: sketch2sound website

The Rhythm In Anything (TRIA)

2024-09-14T00:00:00+00:00

Your browser does not support the video tag.

Led by my labmate Patrick O’Reilly, TRIA (The Rhythm In Anything), takes as input two audio prompts – one specifying the desired drum timbre, and one specifying the desired rhythm – and synthesizes drum beats that follow the rhythm prompt, while keeping the timbre prompt (i.e. playing the desired rhythm with the desired timbre).

Chicago Creative Machines

2024-02-25T00:00:00+00:00

I had the joy of giving the inaugural talk + performance for the Chicago Creative Machines series at ESS Chicago on Feb. 25, 2024. I talked about my compositional work with vampnet, using the mouth as the interface for a generative model, showed an 8ch composition for voice and vampnet, and played a solo set of instrumental music with.

VampNet - Music Generation Via Masked Transfomers

2023-07-09T00:00:00+00:00

VampNet is a generative model for music that uses a masked token modeling technique to perform music generation and compression. We proposed new ways to prompt a generative model with music by masking out parts of some input with a meaningful mask structure, and have VampNet fill in the missing parts with new musical content. Check out the paper and listen to audio samples!

with the jack sundstrom quintet at Que4 Radio, Chicago, IL (2023)

2023-05-30T00:00:00+00:00

I played a set of Jack’s original music with his quintet at Que4 Radio in Chicago, IL.

con el colectivo “los homies” en santé, teguciagalpa, honduras (2022)

2022-12-18T00:00:00+00:00

el jam del año! con los homies: Michael Pineda, Fernando Orellana, Joyce Pineda, Daniel Nuñez, y Guillermo Arturo.

foto por Jumbo Producciones (Lizzie Diaz).

ISMIR 2022 Tutorial on Few-Shot and Zero-Shot Learning for MIR

2022-12-01T00:00:00+00:00

Yu Wang, Jeong Choi and I gave a tutorial during ISMIR 2022 on few-shot and zero-shot learning centered around music information retrieval tasks. In this tutorial, we cover the foundations of few-shot//zero-shot learning, build standalone coding examples, and discuss the state-of-the-art in the field, as well as future directions.

The tutorial is available as a jupyter book online.

a solo improvised, electronic set at the crowdpleaser (evanston, IL, may 2022)

2022-05-13T00:00:00+00:00

I played a quick set of improvised music with drum loops, synthesizers and electric guitar at the crowdpleaser in Evanston, IL. one of the best tunes was my own rendition of “I love parmesan cheese”, a sound that was trending on tiktok at the time.

audacitorch (Audacity with Deep Learning)

2021-12-03T00:00:00+00:00

I contributed a deep learning framework and a deep model manager that connects to HuggingFace to Audacity. This project was funded by a Google Summer of Code grant. Read the Work Product Summary.

You can download Audacity with Deep Learning here.

Take a look at the code.

Deep Learning Tools for Audacity

2021-11-02T00:00:00+00:00

Aldo Aguilar, Ethan Manilow and I made a software framework that lets deep learning practitioners easily integrate their own PyTorch models into Audacity. This lets ML audio researchers put tools in the hands of sound artists without doing DAW-specific development work, which is often a long and arduous process in itself.

Learn more about it in our project page :).