SoundSpotter Computer Music Software
Michael Casey in Krems, Vienna
Professor Michael Casey


Goldsmiths Digital Studios
Department of Computing
Goldsmiths, University of London
25 St. James, New Cross
London, SE14 6NW +44(0)2079197867
Office: Ben Pimlott Building 2.06

Dartmouth College Home Page

Bio

Michael Casey (PhD MIT 1998) conducted his doctoral research at the MIT Media Lab's Music-Mind-Machine group. His research explores new approaches to computing as a creative medium and advanced computational methods for organising large multimedia collections to support digital humanties research. He is an editor of the MPEG-7 International Standard for Multimedia Content Description (ISO15938-4 Audio 2002), a standard for automatic organisation of multimedia databases. Michael is also a composer and artist and he has received a number of international awards for his works in digital media.

Research

Software

Teaching

Course Director: BSc Creative Computing
CC112 Creative Computing I image, sound, motion
CC227 Creative Computing II interactive multimedia
CC342 Advanced Audio-Visual Processing
CIS320 Undergraduate Final Year Projects 2005/2006

Talks

Publications

Journals (bold=PI)

Casey, M., Rhodes, C. and Slaney, M., "Analysis of Minimum Distances in High Dimensional Musical Spaces", IEEE Transactions on Audio, Speech and Language Processing, July, 2008.

Casey, M., Veltkamp, R., Goto, M., Leman, M., Rhodes, C. and Slaney, M., "Content-Based Music Information Retrieval: Current Directions and Future Challenges", Proceedings of the IEEE, April 2008.

Slaney, M. and Casey, M., "Locality Sensitive Hashing for Finding Nearest Neighbours", IEEE Signal Processing, May 2008.

Abdallah, S., Sandler, M., Rhodes, C. and Casey, M., "Using Duration Models To Reduce Fragmentation in Audio Segmentation", Machine Learning, special issue on Machine Learning for Music, November 2006

Casey, M. "Acoustic Lexemes for Organizing Internet Audio", Contemporary Music Review special issue on Internet Music, A. Marsden and A. Hugil (Eds.), October 2005

Hershey, J. and Casey, M. "Audiovisual Source Separation Using Hidden Markov Models.", Advances in Neural Information Processing Systems, 14, MIT Press, 2002.

Casey, M., "Musical Applications of MPEG-7 Audio", Organised Sound, 6:2, Cambridge University Press, 2002.

Casey, M. "MPEG-7 Sound Recognition", in IEEE Transaction on Circuits and Systems Video Technology, special issue on MPEG-7, IEEE, May/June 2001.

Waters, R.C, Anderson, D.B., Barrus, J.W., Brogan, D.C., Casey, M.A., McKeown, S.G., Nitta, T., Sterns, I.B. and Yerazunis, W.S., "Diamond Park and Spline: Social Virtual Reality with 3D Animation, Spoken Interaction, and Runtime Extendability", Presence, MIT Press, August 1997.

Anderson, D.B. and Casey, M., "The Sound Dimension: Audio for Distributed Virtual Environments", IEEE Spectrum special issue on Distributed Virtual Environments, April 1997.

Casey, M. "Understanding Musical Sound with Forward Models and Physical Models", Connection Science 6:2, 1995.

Refereed Conference Papers

Rhodes, C. and Casey, M., "Algorithms for Determining and Labelling Approximate Hierarchical Self-Similarity", in Proceedings of the International Conference on Music Information Retrieval, Vienna, Austria, 2007.

Mauch, M., Dixon, S., Harte, C., Casey, M. and Fields, B. "Discovering Chord Idioms Through Beatles and Real Book Songs", in Proceedings of the International Conference on Music Information Retrieval, Vienna, Austria, 2007.

Casey, M. and Grierson, M. "Soundspotter and Remix-TV: Fast Approxmate Matching for Audio-Visual Performance", in Proceedings of the International Computer Music Conference, Copenhagen, Denmark, 2007.

Casey, M. and Slaney, M. "Fast Recognition of Remixed Music Audio", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), HI, USA, May 2007

Casey, M. and Slaney, M. "Song Intersection by Approximate Nearest Neighbour Retrieval", Proc. International Conference on Music Information Retrieval (ISMIR), Victoria, BC, Oct. 2006

Casey, M. and Slaney, M. "The Importance of Sequences in Music Similarity", Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toulouse, France, May 2006

Rhodes, C., Casey, M., Abdallah, S. and Sandler, M. A "Markov-Chain Monte-Carlo Approach to Musical Audio Segmentation", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toulouse, 2006

Levy, M., Sandler, M. and Casey, M. "Extraction of High Level Musical Structure from Audio Data", Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toulouse, France, May 2006

Abdallah, S., Noland, K., Sandler, M., Casey, M. and Rhodes, C. "Theory and Evaluation of a Baysian Music Structure Extractor", Proc. International Conference on Music Information Retrieval (ISMIR), London, UK, Sept. 2005

Casey, M. and Crawford, T., "Automatic Location and Measurement of Ornaments in Audio Recordings", Proc. International Conference on Music Information Retrieval, Barcelona, October 2004.

Casey, M., "Integrating Low Level Metadata in Multimedia Database Mangement Systems", Presented at Audio Engeneering Society 25th International Conference, London, 2004.

Divakaran, A., Regunathan, R., Xiong, Z., and Casey, M., "Procedure for audio-assisted browsing of news video using generalized sound recognition", Proc. SPIE Vol. 5021, p. 160-166, Storage and Retrieval for Media Databases 2003; Minerva M. Yeung, Rainer W. Lienhart, Chung-Sheng Li; Eds. Jan 2003

Casey, M. "Computational Creativity with Acoustic Bayesian Models", Proceedings of AISB '03 Symposium on Artificial Intelligence and Creativity in Arts and Science, Aberystwyth, April, 2003.

Casey, M. "Musical Structure and Content Repurposing with Bayesian Models", Proceedings of the Cambridge Music Processing Colloquium, University of Cambridge, April, 2003.

Smaragdis, P. and Casey, M. "Audiovisual Independent Component Analysis", Proceedings of ICA03 International Conference on Independent Component Analysis, Tokyo, March 2003.

Casey, M., "Reduced-Rank Spectra and Minimum Entropy Priors for Generalized Sound Recognition", Proceedings of the Workshop on Consistent and Reliable Cues for Sound Analysis, EUROSPEECH 2001, Aalborg, Denmark, September 2001.

Casey, M. and Westner, A., "Separation of Mixed Audio Sources by Independent Subspace Analysis", in Proceedings of the International Computer Music Conference, ICMA, Berlin, August, 2000.

Casey, M. "Representation of Musical Signals by Independent Component Cross Entropies", presented at Advances in Neural Information Processing Systems, workshop on Neural Models of Music Processing. Denver, CO 1999.

Casey, M., "Independent Component Analysis and Sound Synthesis", presented at the International Conference on Auditory Display, Palo Alto, CA, November 1997.

Casey, M., Gardner, W. and Basu, S., "Vision-Steered Beam Forming and Transaural Rendering for the Artificial Life Interactive Video Environment (ALIVE)", Proceedings of the Audio Engineering Society 99th Conference, New York, November 1996.

Casey, M. and Wachman, J., "Unsupervised Cross-Modal Analysis of Discourse", Proceedings of the Workshop on the Integration of Gesture and Language in Speech, Delaware, October 1996.

Casey, M. and Smaragdis, P., "Netsound: Structured Audio Encoding and Rendering", Proceedings of the International Computer Music Conference, ICMA, Hong Kong, September, 1996

Casey, M., "Multi-Model Classification as Basis for Computational Timbre Understanding", in Proceedings of the International Conference on Music Perception and Cognition, Montreal, August 1996.

Casey, M., "Practice Makes Perfect: Distal Learning of Musical Instrument Control Parameters", International Conference on Music Perception and Cognition, Philadelphia, July 1993.

Book Chapters

Casey, M., "General Audio Information Retrieval", in MMIR MultiMedia Information Retrieval:, Roberto Raieli and Perla Innocenti (Eds.), AIDA, Rome 2004 ISBN 88-901144-9-5

Casey, M., "Sound Classification and Similarity Tools", in B.S. Manjunath, P. Salembier and T. Sikora, (Eds), Introduction to MPEG-7: Multimedia Content Description Language, J. Wiley, 2002

Casey, M. "NetSound", chapter in R. Boulanger and B. Vercoe, (Eds.), The Csound Book. MIT Press, Cambridge, 2000.

Casey, M., "Understanding Musical Sound with Forward Models and Physical Models", in Musical Networks: Parallel Distributed Perception and Performance, Niall Griffith and Peter M. Todd(eds.), Cambridge, MIT Press, 1998.

Patents


Casey, M. (2001) United States Patent 6,321,200 Method for extracting features from a mixture of signals (Granted 2003)

Smaragdis, P. and Casey, M. (2003) United States Patent 7,218,755 Detecting temporally related components of multi-modal signals (Granted 2007)

Casey, M. (2001) United States Patent Application 20010044719 Method and system for recognizing, indexing, and searching acoustic signals

Wolf, P. and Casey, M., (2002) United States Patent Application 20040064306 Voice activated music playback system

Divakaran, A., Radhakrishnan, R. and Casey, M. (2002) United States Patent Application 20040008789 Audio-assisted video segmentation and summarization