Limits...
Computational Model of Primary Visual Cortex Combining Visual Attention for Action Recognition.

Shu N, Gao Z, Chen X, Liu H - PLoS ONE (2015)

Bottom Line: Based on the inhibitory effect of stimuli outside the classical receptive field caused by lateral connections of spiking neuron networks in V1, we propose surround suppressive operator to further process spatiotemporal information.Moreover, in order to represent the human action, we consider the characteristic of the neural code: mean motion map based on analysis of spike trains generated by spiking neurons.The experimental evaluation on some publicly available action datasets and comparison with the state-of-the-art approaches demonstrate the superior performance of the proposed model.

View Article: PubMed Central - PubMed

Affiliation: School of Biomedical Engineering, South-Central University for Nationalities, Wuhan 430074, China.

ABSTRACT
Humans can easily understand other people's actions through visual systems, while computers cannot. Therefore, a new bio-inspired computational model is proposed in this paper aiming for automatic action recognition. The model focuses on dynamic properties of neurons and neural networks in the primary visual cortex (V1), and simulates the procedure of information processing in V1, which consists of visual perception, visual attention and representation of human action. In our model, a family of the three-dimensional spatial-temporal correlative Gabor filters is used to model the dynamic properties of the classical receptive field of V1 simple cell tuned to different speeds and orientations in time for detection of spatiotemporal information from video sequences. Based on the inhibitory effect of stimuli outside the classical receptive field caused by lateral connections of spiking neuron networks in V1, we propose surround suppressive operator to further process spatiotemporal information. Visual attention model based on perceptual grouping is integrated into our model to filter and group different regions. Moreover, in order to represent the human action, we consider the characteristic of the neural code: mean motion map based on analysis of spike trains generated by spiking neurons. The experimental evaluation on some publicly available action datasets and comparison with the state-of-the-art approaches demonstrate the superior performance of the proposed model.

No MeSH data available.


Related in: MedlinePlus

Confusion matrices on KTH dataset.From left to right: s1, s2, s3 and s4.
© Copyright Policy
Related In: Results  -  Collection

License
getmorefigures.php?uid=PMC4489578&req=5

pone.0130569.g016: Confusion matrices on KTH dataset.From left to right: s1, s2, s3 and s4.

Mentions: Fig 16 represents the confusion matrices of the classification on the KTH dataset using our approach. The column of the confusion matrix represents the instances to be classified, while each row represents the corresponding classification results. The main confusion occurs between jogging and running in four different scenarios. It is a difficult challenge to distinguish the jogging and running because the two actions performed by some subjects are very similar.


Computational Model of Primary Visual Cortex Combining Visual Attention for Action Recognition.

Shu N, Gao Z, Chen X, Liu H - PLoS ONE (2015)

Confusion matrices on KTH dataset.From left to right: s1, s2, s3 and s4.
© Copyright Policy
Related In: Results  -  Collection

License
Show All Figures
getmorefigures.php?uid=PMC4489578&req=5

pone.0130569.g016: Confusion matrices on KTH dataset.From left to right: s1, s2, s3 and s4.
Mentions: Fig 16 represents the confusion matrices of the classification on the KTH dataset using our approach. The column of the confusion matrix represents the instances to be classified, while each row represents the corresponding classification results. The main confusion occurs between jogging and running in four different scenarios. It is a difficult challenge to distinguish the jogging and running because the two actions performed by some subjects are very similar.

Bottom Line: Based on the inhibitory effect of stimuli outside the classical receptive field caused by lateral connections of spiking neuron networks in V1, we propose surround suppressive operator to further process spatiotemporal information.Moreover, in order to represent the human action, we consider the characteristic of the neural code: mean motion map based on analysis of spike trains generated by spiking neurons.The experimental evaluation on some publicly available action datasets and comparison with the state-of-the-art approaches demonstrate the superior performance of the proposed model.

View Article: PubMed Central - PubMed

Affiliation: School of Biomedical Engineering, South-Central University for Nationalities, Wuhan 430074, China.

ABSTRACT
Humans can easily understand other people's actions through visual systems, while computers cannot. Therefore, a new bio-inspired computational model is proposed in this paper aiming for automatic action recognition. The model focuses on dynamic properties of neurons and neural networks in the primary visual cortex (V1), and simulates the procedure of information processing in V1, which consists of visual perception, visual attention and representation of human action. In our model, a family of the three-dimensional spatial-temporal correlative Gabor filters is used to model the dynamic properties of the classical receptive field of V1 simple cell tuned to different speeds and orientations in time for detection of spatiotemporal information from video sequences. Based on the inhibitory effect of stimuli outside the classical receptive field caused by lateral connections of spiking neuron networks in V1, we propose surround suppressive operator to further process spatiotemporal information. Visual attention model based on perceptual grouping is integrated into our model to filter and group different regions. Moreover, in order to represent the human action, we consider the characteristic of the neural code: mean motion map based on analysis of spike trains generated by spiking neurons. The experimental evaluation on some publicly available action datasets and comparison with the state-of-the-art approaches demonstrate the superior performance of the proposed model.

No MeSH data available.


Related in: MedlinePlus