Browse by author
Lookup NU author(s): Dr Ali Alameer, Ghazal Ghazaei, Professor Patrick Degenaar, Professor Kianoush Nazarpour
The hierarchical MAX (HMAX) model of human visual system has been used in robotics and autonomous systems widely. However, there is still a stark gap between human and robotic vision in observing the environment and intelligently categorising the objects. Therefore, improving models such as the HMAX is still topical. In this work, in order to enhance the performance of HMAX in an object recognition task, we augmented it using an elastic net-regularised dictionary learning approach. We used the notion of sparse coding in the S layers of the HMAX model to extract mid-and high-level, i.e. abstract, features from input images. In addition, we used spatial pyramid pooling (SPP) at the output of higher layers to create a fixed feature vectors before feeding them into a softmax classifier. In our model, the sparse coefficients calculated by the elastic net-regularised dictionary learning algorithm were used to train and test the model. With this setup, we achieved a classification accuracy of 82.6387% 3.7183% averaged across 5-folds which is significantly better than that achieved with the original HMAX.
Author(s): Alameer A, Ghazaei G, Degenaar P, Nazarpour K
Publication type: Conference Proceedings (inc. Abstract)
Publication status: Published
Conference Name: 2nd IET International Conference on Intelligent Signal Processing 2015 (ISP)
Year of Conference: 2015
Online publication date: 17/11/2016
Acceptance date: 01/12/2015
Date deposited: 29/01/2018
Publisher: Institution of Engineering and Technology
URL: http://doi.org/10.1049/cp.2015.1753
DOI: 10.1049/cp.2015.1753