Speaker Identification for Swiss German with Spectral and Rhythm Features

dc.contributor.authorLykartsis, Athanasios
dc.contributor.authorWeinzierl, Stefan
dc.contributor.authorDellwo, Volker
dc.date.accessioned2020-02-24T17:33:58Z
dc.date.available2020-02-24T17:33:58Z
dc.date.issued2017-06-13
dc.description.abstractWe present results of speech rhythm analysis for automatic speaker identification. We expand previous experiments using similar methods for language identification. Features describing the rhythmic properties of salient changes in signal components are extracted and used in an speaker identification task to determine to which extent they are descriptive of speaker variability. We also test the performance of state-of-the-art but simple-to-extract frame-based features. The paper focus is the evaluation on one corpus (swiss german, TEVOID) using support vector machines. Results suggest that the general spectral features can provide very good performance on this dataset, whereas the rhythm features are not as successful in the task, indicating either the lack of suitability for this task or the dataset specificity.en
dc.identifier.urihttps://depositonce.tu-berlin.de/handle/11303/10821
dc.identifier.urihttp://dx.doi.org/10.14279/depositonce-9716
dc.language.isoenen
dc.relation.ispartof10.14279/depositonce-9530
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subject.ddc620 Ingenieurwissenschaften und zugeordnete Tätigkeitende
dc.subject.ddc780 Musikde
dc.subject.otherspeech rhythmen
dc.subject.otherspeaker identificationen
dc.subject.otherlanguage identificationen
dc.subject.otherbeat histogramen
dc.subject.otherswiss germanen
dc.titleSpeaker Identification for Swiss German with Spectral and Rhythm Featuresen
dc.typeConference Objecten
dc.type.versionacceptedVersionen
dcterms.bibliographicCitation.articlenumber2-1en
dcterms.bibliographicCitation.originalpublishernameAudio Engineering Societyen
dcterms.bibliographicCitation.originalpublisherplaceNew York, NYen
dcterms.bibliographicCitation.proceedingstitle2017 AES International Conference on Semantic Audioen
tub.accessrights.dnbfreeen
tub.affiliationFak. 1 Geistes- und Bildungswissenschaften::Inst. Sprache und Kommunikation::FG Audiokommunikationde
tub.affiliation.facultyFak. 1 Geistes- und Bildungswissenschaftende
tub.affiliation.groupFG Audiokommunikationde
tub.affiliation.instituteInst. Sprache und Kommunikationde
tub.publisher.universityorinstitutionTechnische Universität Berlinen

Files

Original bundle
Now showing 1 - 1 of 1
Loading…
Thumbnail Image
Name:
lykartsis_etal_2017.pdf
Size:
622.04 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
4.9 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections