Conference Paper (published)

A biologically inspired onset and offset speech segmentation approach

Details

Citation

Abel A, Hunter D & Smith L (2015) A biologically inspired onset and offset speech segmentation approach. In: 2015 International Joint Conference on Neural Networks (IJCNN). International Joint Conference on Neural Networks, Killarney, Ireland, 12.07.2015-17.07.2105. Washington DC, USA: IEEE Computer Society. http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=7280347&tag=1; https://doi.org/10.1109/IJCNN.2015.7280347

Abstract
A key component in the processing of speech is the division of longer input sounds into a number of smaller sections. For speech interpretation it is generally easier to classify single sections. Similarly, when processing speech for other purposes (e.g. speech filtering), it can be easier and more relevant to process individual phonemes. Here, we propose a biologically inspired speech segmentation technique that filters the speech into multiple bandpassed channels using a Gammatone filterbank, and then uses an essentially energy-based spike coding technique in order to find the onsets and offsets present in an audio signal. These onsets and offsets are then processed using leaky integrate-and-fire neurons, and the spikes from these used to determine the speech segmentation. We evaluate this new system using a quantitative evaluation metric, and the promising results of segmentation of both clean speech and speech in noise demonstrate the effectiveness of this technique.

Status	Published
Funders	Engineering and Physical Sciences Research Council
Publication date	30/09/2015
Publication date online	31/07/2015
URL	http://hdl.handle.net/1893/22525
Publisher	IEEE Computer Society
Publisher URL	http://ieeexplore.ieee.org/…er=7280347&tag=1
Place of publication	Washington DC, USA
Conference	International Joint Conference on Neural Networks
Conference location	Killarney, Ireland
Dates	31/07/2015–31/07/2105

People (1)

Professor Leslie Smith

Emeritus Professor, Computing Science

Projects (1)

A multichannel adaptive integrated MEMS/CMOS microphone.
PI:
08/03/2010–27/01/2015

Files (1)

PID3679099.pdf