From CSWiki
Revision as of 04:00, 24 April 2006 by Amisra (talk | contribs)

Jump to: navigation, search


  • The smartest sound editor ever built
  • (Feature-based Sound Design Framework/Workbench/System/null)
  • (Feature-aware TAPESTREA: A Integrated/Comprehensive/Smart/Interactive Approach to Sound Design Workbench)
  • (TAPESTREA: Augmenting Interactive Sound Design with Feature-based Audio Analysis)
  • Interactive Content Retrieval for Intelligent/Template-aware Sound Design
  • Interactive Sound Design by Example
  • FAT-APE-STREAT: Sound Design by Querying
  • Sound Design-by-Querying and by-Example
  • Finding New Examples to Sound Design By
  • Extending Sound Scene Modeling By Example with Examples
  • Integrating Sound Scene Modeling and Query-by-example
  • Sound Scene Modeling by Example with Integrated Audio Retrieval
  • Facilitating Sound Design using Query-by-example
  • Enriching/Extending/Expanding Sound Scene Modeling By Examples using Audio Information Retrieval
  • Enhancing the Palette: Querying in the Service of Interactive Sound Design
  • Expanding the Palette: Audio Information Retrieval for Intelligent Sound Design
  • Expanding the Palette: Audio Information Retrieval for Intelligent Data-driven Sound Design
  • Enhancing the Palette: Audio Information Retrieval for TAPESTREA
  • Expanding the Palette: Audio Information Retrieval for Sound Scene Modeling by Example
  • Enhancing the Palette: Template-based Retrieval for Intelligent Sound Design
  • Enhancing the Palette: Using Audio Information Retrieval to Expand the Transformative Power of TAPESTREA

AUTHORS (order ok?):

Ananya Misra, Matt Hoffman, Perry R. Cook, Ge Wang


(no. down with order.)


We integrate music information retrieval technologies with TAPESTREA techniques to facilitate and enhance sound design, providing a new class of "intelligent" sound design workbench.

I. Introduction + Motivation


  • Large corpus of unstructured and largely unlabled audio

(sound effects, field recordings, soundtracks from movies and TV, etc.)

  • leverage audio analysis in interactive sound design (via TAPS)
  • vice versa


  • To aid sound designers in creating varied and interesting scenes (standard TAPS stuff)
  • To enable a human operator to quickly identify similar sounds in a large collection of sounds.
  • can be also useful for forensic audio applications, watermarking

II. Previous Work

Related Work

  • see references
  • Marsyas, Taps (sine+noise, transient, wavelet), feature-based synthesis
  • related systems generally falls into one of two categories, (1) "intelligent" audio editors,

which generally extracted musical information, or (2) sonic browsers for search and retrieval.


  • Sound-scene modeling by example / Content-aware tapestrea analysis interface
  • Re-composing natural sounds

III. Feature-based Sound Design


(big figure?)

Interactive template-based similarity search (database)

quering/marking recorded sounds for template discovery

IV. Results

V. Conclusion and Future Work

VI. References

Bregman, A. Auditory Scene Analysis. MIT Press, Cambridge, 1990.

   * NOT what taps does.

Chafe, C., B. Mont-Reynaud, and L. Rush. (1982). "Towards an intelligent editor of digital audio: Recognition of musical constructs," Computer Music Journal 6(1): .

   * 1st paper to deal with transcription without dealing with identifying notes

Dubnov, S., Z. Bar-Joseph, R. El-Yaniv, D. Lischinski, and M. Werman (2002). "Synthesizing sound textures through wavelet tree learning,". IEEE Computer Graphics and Applications 22(4).

Fernstrom, M. and E. Brazil. (2001)."Sonic Browsing: an auditory tool for multimedia asset management," In Proceedings of the International Conference on Auditory Display.

   * deals more with musical structures and notes

Foote, J. (1999). "An overview of audio information retrieval," ACM Multimedia Systems, 7:2(10).

Jolliffe, L. (1986). Principal Component Analysis. Springer-Verlag, New York.

Kang, H. and B. Shneiderman. (2000). "Visualization Methods for Personal Photo Collections: Browsing and Searching in the PhotoFinder," In Proceedings of the International Conference on Multimedia and Expo, New York, IEEE.

Kashino, Tanaka. (1993). "A sound source separation system with the ability of automatic tone modeling," International Computer Music Conference.

   * uses of clustering techniques for identifying sound sources

Misra, A., P. Cook, and G. Wang. (2006). "Musical Tapestry: Re-composing Natural Sounds," International Computer Music Conference. Submitted.

Misra, A., P. Cook, and G. Wang. (2006). "TAPESTREA: Sound Scene Modeling By Example," International Conference on Digital Audio Effects. Submitted.

Serra, X. (1989). "A System for Sound Analysis Transformation Synthesis based on a Deterministic plus Stochastic Decomposition," PhD thesis, Stanford University.

Shneiderman, B. (1998). Designing the User Interface: Strategies for Effective Human- Computer Interaction. Addison-Wesley, 3rd edition.

Tzanetakis G. and P. Cook. (2000). "MARSYAS: A Framework for Audio Analysis" Organized Sound, Cambridge University Press 4(3).

Tzanetakis, G. and P. Cook. (2001). "MARSYA3D: A prototype audio browser-editor using a large scale immersive visual and audio display," In Proceedings of the International Conference on Auditory Display.