In this paper, we propose an end-to-end learning framework called deep ensemble machine (DEM) for video classification. Video classification has been extensively researched in computer vision due to its wide spread applications.
To achieve efficient classification, we propose ensemble learning based on random projections aiming to transform high-dimensional features into a set of lower dimensional compact features in subspaces; an ensemble of classifiers is trained on the subspaces and combined with a weighting layer during the back propagation.
To further enhance the performance, we introduce rectified linear encoding (RLE) inspired from error-correcting output coding to encode the initial outputs of classifiers, followed by a softmax layer to produce the final classification results.
We show the great effectiveness of DEM by extensive experiments on four data sets for diverse video classification tasks including action recognition and dynamic scene classification. Results have shown that DEM achieves high performance on all tasks with an improvement of up to 13% on CIFAR10 data set over the baseline model.
NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

