Please use this identifier to cite or link to this item:
For citation please use:
Main Title: Using the beat histogram for speech rhythm description and language identification
Author(s): Lykartsis, Athanasios
Weinzierl, Weinzierl
Type: Conference Object
Is Part Of: 10.14279/depositonce-9530
Language Code: en
Abstract: In this paper we present a novel approach for the description of speech rhythm and the extraction of rhythm-related features for automatic language identification (LID). Previous methods have extracted speech rhythm through the calculation of features based on salient elements of speech such as consonants, vowels and syllables. We present how an automatic rhythm extraction method borrowed from music information retrieval, the beat histogram, can be adapted for the analysis of speech rhythm by defining the most relevant novelty functions in the speech signal and extracting features describing their periodicities. We have evaluated those features in a rhythm-based LID task for two multilingual speech corpora using support vector machines, including feature selection methods to identify the most informative descriptors. Results suggest that the method is successful in describing speech rhythm and provides LID classification accuracy comparable to or better than that of other approaches, without the need for a preceding segmentation or annotation of the speech signal. Concerning rhythm typology, the rhythm class hypothesis in its original form seems to be only partly confirmed by our results.
Issue Date: 2015
Date Available: 24-Feb-2020
DDC Class: 620 Ingenieurwissenschaften und zugeordnete Tätigkeiten
780 Musik
Subject(s): speech rhythm
beat histogram
language identification
novelty functions
rhythm typology
Proceedings Title: INTERSPEECH 2015 - 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015
Publisher: International Speech Communication Association
Publisher Place: [s.l.]
Page Start: 1007
Page End: 1011
ISSN: 1990-9770
Appears in Collections:FG Audiokommunikation » Publications

Files in This Item:

Accepted manuscript

Format: Adobe PDF | Size: 223.84 kB
DownloadShow Preview

Item Export Bar

Items in DepositOnce are protected by copyright, with all rights reserved, unless otherwise indicated.