Chord Recognition in Beatles Songs

While a graduate student at MIT’s Media Lab, I collaborated with office-mate Victor Adán to explore how if we might train a machine to recognize chord changes in music. We tried multiple models to solve the problem, including Support Vector Machines, Neural Networks, Hidden Markov Models, and a few variations of Maximum Likelihood systems.

We chose Beatles tunes as a subset of the larger problem and trained our systems with 16 songs from three of their albums. Our systems processed 2700 training samples, 150 validation samples, and 246 testing samples. Our most successful system, a Support Vector Machine, achieved 68% accuracy in testing.

Our intention was to further the research which will lead to applications such as automatic transcription, live tracking for improvisation, and computer-assisted (synthetic) performers. Our models were an extension of the research provided by the following papers:

Musical Key Extraction from Audio, Steffen Pauws
Chord Segmentation and Recognition using EM-Trained Hidden Markov Models, Alexander Sheh and Daniel P.W. Ellis
SmartMusicKIOSK: Music Listening Station with Chorus-Search Function, Masataka Goto
A Chorus-Section Detecting Method for Musical Audio Signals, Masataka Goto

Main Website

Gameboy Hardware Interfacing

I built a connector to access the Gameboy circuitry using a solderless breadboard and used it to interface flash memory and a Digital to Analog Convertor (DAC) to the Gameboy.

Read More Gameboy Hardware Interfacing
pyPortMidi

Send and receive MIDI data in realtime from Python. Supports in32, OS X, Linux.

Read More pyPortMidi
TechArtICT: Whispering Woodlands

Whispering Woodlands was an outdoor installation created by TechArtICT. It was installed at Exploration Place in Wichita, Kansas from November 2023 through January 2024. The work featured 24 independently controlled sets of speakers and LEDs, all synchronized to create an immersive sound and lightscape. Using eclectic audio ranging from thunder and rain to spaceships and…

Read More TechArtICT: Whispering Woodlands
DoubleTalk

Doubletalk, a two player audio-manipulation game was my first serious endeaver with the Gameboy. The game used the Pocketvoice, a Gameboy cartridge with a built-in amplified speaker and microphone. In Doubletalk, players record themselves, reverse their recordings, then try to guess what each other is saying.

Read More DoubleTalk
Ghost in the Machine

Originally conceived in 2008, Ghost in the Machine (GITM) consists of a webcam and display which mixes and crossfades events in realtime with motion-activated video it has recorded previously. It continually shifts between 3 states: individual, community, and the world. GITM has been shown in many venues and contexts.

Read More Ghost in the Machine
SoundScratch

SoundScratch is a set of extensions I wrote to manipulate audio in a children’s programming language called Scratch. The environment emphasizes the expressive capabilities of sound through the act of creation and design.

Read More SoundScratch

Chord Recognition in Beatles Songs

Similar Posts