出版社:Academy & Industry Research Collaboration Center (AIRCC)
摘要:This work proposes a novel system for Violent Scenes Detection, which is based on thecombination of visual and audio features with machine learning at segment-level. MultipleKernel Learning is applied so that multimodality of videos can be maximized. In particular,Mid-level Violence Clustering is proposed in order for mid-level concepts to be implicitlylearned, without using manually tagged annotations. Finally a violence-score for each shot iscalculated. The whole system is trained ona dataset from MediaEval 2013 Affect Task andevaluated by its official metric. The obtained results outperformed its best score.
关键词:Multimedia Analysis; Video Processing; Machine Learning