Detecting violent content in Hollywood movies by mid-level audio representations

dc.contributor.authorAcar, Esra
dc.contributor.authorHopfgartner, Frank
dc.contributor.authorAlyabrak, Sahin
dc.date.accessioned2018-04-17T09:26:00Z
dc.date.available2018-04-17T09:26:00Z
dc.date.issued2013
dc.description.abstractMovie violent content detection e.g., for providing automated youth protection services is a valuable video content analysis functionality. Choosing discriminative features for the representation of video segments is a key issue in designing violence detection algorithms. In this paper, we employ mid-level audio features which are based on a Bag-of-Audio Words (BoAW) method using Mel-Frequency Cepstral Coefficients (MFCC). BoAW representations are constructed with two different meth- ods, namely the vector quantization-based (VQ-based) method and the sparse coding-based (SC-based) method. We choose two- class support vector machines (SVMs) for classifying video shots as (non-)violent. Our experimental results on detecting violent video shots in Hollywood movies show that the mid-level audio features provide promising results. Additionally, we establish that the SC-based method outperforms the VQ-based one. More importantly, the SC-based method outperforms the unimodal submissions in the MediaEval Violent Scenes Detection (VSD) task except one visual-based method in terms of average precision.en
dc.identifier.isbn978-1-4799-0956-8
dc.identifier.issn1949-3991
dc.identifier.urihttps://depositonce.tu-berlin.de//handle/11303/7609
dc.identifier.urihttp://dx.doi.org/10.14279/depositonce-6799
dc.language.isoenen
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subject.ddc000 Informatik, Informationswissenschaft, allgemeine Werkede
dc.subject.othervideo codingen
dc.subject.otherfeature extractionen
dc.subject.othermel frequency cepstral coefficienten
dc.subject.othervisualizationen
dc.subject.othersupport vector machinesen
dc.subject.otherdictionariesen
dc.subject.othertrainingen
dc.titleDetecting violent content in Hollywood movies by mid-level audio representationsen
dc.typeConference Objecten
dc.type.versionacceptedVersionen
dcterms.bibliographicCitation.doi10.1109/CBMI.2013.6576556en
dcterms.bibliographicCitation.editorCzúni, László
dcterms.bibliographicCitation.editorSchöffmann, Klaus
dcterms.bibliographicCitation.editorSzirányi, Tamás
dcterms.bibliographicCitation.originalpublishernameIEEEen
dcterms.bibliographicCitation.originalpublisherplaceVeszprem, Hungaryen
dcterms.bibliographicCitation.pageend78en
dcterms.bibliographicCitation.pagestart73en
dcterms.bibliographicCitation.proceedingstitle2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI)en
dcterms.bibliographicCitation.volume2013en
tub.accessrights.dnbdomainen
tub.affiliationFak. 4 Elektrotechnik und Informatik>Inst. Wirtschaftsinformatik und Quantitative Methoden>FG Agententechnologien in betrieblichen Anwendungen und der Telekommunikation (AOT)de
tub.affiliation.facultyFak. 4 Elektrotechnik und Informatikde
tub.affiliation.groupFG Agententechnologien in betrieblichen Anwendungen und der Telekommunikation (AOT)de
tub.affiliation.instituteInst. Wirtschaftsinformatik und Quantitative Methodende
tub.publisher.universityorinstitutionTechnische Universität Berlinen
Files
Original bundle
Now showing 1 - 1 of 1
Loading…
Thumbnail Image
Name:
2013_acar_etal.pdf
Size:
2.92 MB
Format:
Adobe Portable Document Format
Description:
Collections