Features for segmenting and classifying long-duration recordings of "personal" audio

Daniel P.W. Ellis and Keansub Lee


Physical principles driven joint evaluation of multiple F0 hypotheses

Chunghsin Yeh and Axel Röbel


MAP Estimation of Speech Spectral Component Under GGD a Priori

Rajkishore Prasad, Hiroshi Saruwatari and Kiyohiro Shikano


Specmurt Anasylis: A Piano-Roll-Visualization of Polyphonic Music Signal by Deconvolution of Log-Frequency Spectrum

Shigeki Sagayama, Keigo Takahashi, Hirokazu Kameoka and Takuya Nishimoto


PLP-squared: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns

Marios Athineos, Hynek Hermansky and Daniel P.W. Ellis


Stochastic techniques in deriving perceptual knowledge

Hynek Hermansky


Towards single-channel unsupervised source separation of speech mixtures: The layered harmonics/formants separation-tracking model

Manuel Reyes-Gomez, Nebojsa Jojic and Daniel P.W. Ellis


Model-Based Fusion of Bone and Air Sensors for Speech Enhancement and Robust Speech Recognition

John Hershey, Trausti Kristjansson and Zhengyou Zhang


Soft Mask Estimation for Single Channel Speaker Separation

Aarthi M. Reddy and Bhiksha Raj


Discovering Auditory Objects Through Non-Negativity Constraints

Paris Smaragdis


Sound Source Localization and Separation Based on the EM Algorithm

Futoshi Asano and Hideki Asoh


Modelling of Note Events for Singing Transcription

Matti P. Ryynänen and Anssi P. Klapuri


Hierarchical clustering applied to overcomplete BSS for convolutive mixtures

Stefan Winter, Hiroshi Sawada, Shoko Araki and Shoji Makino


Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods

Kazuyoshi Yoshii, Masataka Goto and Hiroshi G. Okuno


Multiple-Microphone Robust Speech Recognition Using Decoder-Based Channel Selection

Yasunari Obuchi


Harmonicity Based Blind Dereverberation with Time Warping

Tomohiro Nakatani, Keisuke Kinoshita, Masato Miyoshi and Parham S. Zolfaghari


Separation of Sound Sources by Convolutive Sparse Coding

Tuomas Virtanen


Auditory Segmentation Based on Event Detection

Guoning Hu and DeLiang Wang


Bayesian Networks for Error Handling through Multimodality Fusion in Spoken Dialogues with Mobile Robots

Plamen Prodanov and Andrzej Drygajlo


Auditory-based automatic speech recognition

Werner Hemmert, Marcus Holmberg and David Gelbart


Representation and Classification of the Timbre Space of a Single Musical Instrument

Hugo de Paula, Mauricio Loureiro and Hani Yehia


A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays

Guillaume Lathoud and Iain A. McCowan