Using the beat histogram for speech rhythm description and language identification

dc.contributor.authorLykartsis, Athanasios
dc.contributor.authorWeinzierl, Stefan
dc.date.accessioned2020-02-24T17:20:13Z
dc.date.available2020-02-24T17:20:13Z
dc.date.issued2015
dc.description.abstractIn this paper we present a novel approach for the description of speech rhythm and the extraction of rhythm-related features for automatic language identification (LID). Previous methods have extracted speech rhythm through the calculation of features based on salient elements of speech such as consonants, vowels and syllables. We present how an automatic rhythm extraction method borrowed from music information retrieval, the beat histogram, can be adapted for the analysis of speech rhythm by defining the most relevant novelty functions in the speech signal and extracting features describing their periodicities. We have evaluated those features in a rhythm-based LID task for two multilingual speech corpora using support vector machines, including feature selection methods to identify the most informative descriptors. Results suggest that the method is successful in describing speech rhythm and provides LID classification accuracy comparable to or better than that of other approaches, without the need for a preceding segmentation or annotation of the speech signal. Concerning rhythm typology, the rhythm class hypothesis in its original form seems to be only partly confirmed by our results.en
dc.identifier.issn1990-9770
dc.identifier.urihttps://depositonce.tu-berlin.de/handle/11303/10819
dc.identifier.urihttp://dx.doi.org/10.14279/depositonce-9714
dc.language.isoenen
dc.relation.ispartof10.14279/depositonce-9530
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subject.ddc620 Ingenieurwissenschaften und zugeordnete Tätigkeitenen
dc.subject.ddc780 Musikde
dc.subject.otherspeech rhythmen
dc.subject.otherbeat histogramen
dc.subject.otherlanguage identificationen
dc.subject.othernovelty functionsen
dc.subject.otherrhythm typologyen
dc.titleUsing the beat histogram for speech rhythm description and language identificationen
dc.typeConference Objecten
dc.type.versionacceptedVersionen
dcterms.bibliographicCitation.originalpublishernameInternational Speech Communication Associationen
dcterms.bibliographicCitation.originalpublisherplace[s.l.]en
dcterms.bibliographicCitation.pageend1011en
dcterms.bibliographicCitation.pagestart1007en
dcterms.bibliographicCitation.proceedingstitleINTERSPEECH 2015 - 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015en
tub.accessrights.dnbfreeen
tub.affiliationFak. 1 Geistes- und Bildungswissenschaften::Inst. Sprache und Kommunikation::FG Audiokommunikationde
tub.affiliation.facultyFak. 1 Geistes- und Bildungswissenschaftende
tub.affiliation.groupFG Audiokommunikationde
tub.affiliation.instituteInst. Sprache und Kommunikationde
tub.publisher.universityorinstitutionTechnische Universität Berlinen

Files

Original bundle
Now showing 1 - 1 of 1
Loading…
Thumbnail Image
Name:
lykartsis_weinzierl_2015.pdf
Size:
223.84 KB
Format:
Adobe Portable Document Format
Description:
Accepted manuscript
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
4.9 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections