Please use this identifier to cite or link to this item: https://idr.l3.nitk.ac.in/jspui/handle/123456789/7915
Full metadata record
DC FieldValueLanguage
dc.contributor.authorRaghuram, M.A.
dc.contributor.authorChavan, N.R.
dc.contributor.authorKoolagudi, S.G.
dc.contributor.authorRamteke, P.B.
dc.date.accessioned2020-03-30T10:03:05Z-
dc.date.available2020-03-30T10:03:05Z-
dc.date.issued2016
dc.identifier.citationCanadian Conference on Electrical and Computer Engineering, 2016, Vol.2016-October, , pp.-en_US
dc.identifier.urihttp://idr.nitk.ac.in/jspui/handle/123456789/7915-
dc.description.abstractIdentifying different audio segments in videos is the first step for many important tasks such as event detection and speech transcription. Approaches using Mel-Frequency Cepstral coefficients (MFCCs) with Gaussian mixture models (GMMs) and hidden Markov models (HMMs) perform reasonably well in stationary conditions but do not scale to a broad range of environmental conditions. This paper focuses on the audio segmentation in broadcast soccer videos into audio classes such as Silence, Speech Only, Speech Over Crowd, Crowd Only and Excited, with an alternative feature set which is simplistic as well as robust to changes in the environment conditions. Support Vector Machines (SVMs), Neural Networks and Random Forest are used for the classification. The accuracy achieved with SVMs, Neural Networks and Random Forest are 83.80%, 86.07%, and 88.35% respectively. The proposed features and Random Forest classifier are found to achieve better accuracy compared to the other classifiers. � 2016 IEEE.en_US
dc.titleEfficient audio segmentation in soccer videosen_US
dc.typeBook chapteren_US
Appears in Collections:2. Conference Papers

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.