Limits...
Neural computation of visual imaging based on Kronecker product in the primary visual cortex.

Songnian Z, Qi Z, Zhen J, Guozheng Y, Li Y - BMC Neurosci (2010)

Bottom Line: Our results are: 1.Fourier Transformation is replaced with Kronecker product, which greatly reduces the computational complexity.Clearly, the above new perspective has some reference significance to exploring the neural mechanisms on the human visual information processing.

View Article: PubMed Central - HTML - PubMed

Affiliation: LAPC, Institute of Atmospheric Physics, Chinese Academy of Sciences, Beijing 100029, China.

ABSTRACT

Background: What kind of neural computation is actually performed by the primary visual cortex and how is this represented mathematically at the system level? It is an important problem in the visual information processing, but has not been well answered. In this paper, according to our understanding of retinal organization and parallel multi-channel topographical mapping between retina and primary visual cortex V1, we divide an image into orthogonal and orderly array of image primitives (or patches), in which each patch will evoke activities of simple cells in V1. From viewpoint of information processing, this activated process, essentially, involves optimal detection and optimal matching of receptive fields of simple cells with features contained in image patches. For the reconstruction of the visual image in the visual cortex V1 based on the principle of minimum mean squares error, it is natural to use the inner product expression in neural computation, which then is transformed into matrix form.

Results: The inner product is carried out by using Kronecker product between patches and function architecture (or functional column) in localized and oriented neural computing. Compared with Fourier Transform, the mathematical description of Kronecker product is simple and intuitive, so is the algorithm more suitable for neural computation of visual cortex V1. Results of computer simulation based on two-dimensional Gabor pyramid wavelets show that the theoretical analysis and the proposed model are reasonable.

Conclusions: Our results are: 1. The neural computation of the retinal image in cortex V1 can be expressed to Kronecker product operation and its matrix form, this algorithm is implemented by the inner operation between retinal image primitives and primary visual cortex's column. It has simple, efficient and robust features, which is, therefore, such a neural algorithm, which can be completed by biological vision. 2. It is more suitable that the function of cortical column in cortex V1 is considered as the basic unit of visual image processing (such unit can implement basic multiplication of visual primitives, such as contour, line, and edge), rather than a set of tiled array filter. Fourier Transformation is replaced with Kronecker product, which greatly reduces the computational complexity. The neurobiological basis of this idea is that a visual image can be represented as a linear combination of orderly orthogonal primitive image containing some local feature. In the visual pathway, the image patches are topographically mapped onto cortex V1 through parallel multi-channels and then are processed independently by functional columns. Clearly, the above new perspective has some reference significance to exploring the neural mechanisms on the human visual information processing.

Show MeSH

Related in: MedlinePlus

Optimal matching between a patch (upper right corner of the hat) and receptive fields of specific orientations in cortical modules [Bk,l (s)]K × L.
© Copyright Policy - open-access
Related In: Results  -  Collection

License
getmorefigures.php?uid=PMC2865487&req=5

Figure 5: Optimal matching between a patch (upper right corner of the hat) and receptive fields of specific orientations in cortical modules [Bk,l (s)]K × L.

Mentions: Therefore, some edge or contour located at (i, j) in retinal image [Ri,j(a)]M × N with area a will find a best match with the receptive field gi,j(b) of the same orientation and shape. When ρrg = 1, it means the patch at (i, j) in [Ri,j(a)]M × N completely matches the cortical module [Bk,l(s)]K × L with the specific orientation. Then the neuron is activated with the strongest response. The process of this image reconstruction is shown in figure 5.


Neural computation of visual imaging based on Kronecker product in the primary visual cortex.

Songnian Z, Qi Z, Zhen J, Guozheng Y, Li Y - BMC Neurosci (2010)

Optimal matching between a patch (upper right corner of the hat) and receptive fields of specific orientations in cortical modules [Bk,l (s)]K × L.
© Copyright Policy - open-access
Related In: Results  -  Collection

License
Show All Figures
getmorefigures.php?uid=PMC2865487&req=5

Figure 5: Optimal matching between a patch (upper right corner of the hat) and receptive fields of specific orientations in cortical modules [Bk,l (s)]K × L.
Mentions: Therefore, some edge or contour located at (i, j) in retinal image [Ri,j(a)]M × N with area a will find a best match with the receptive field gi,j(b) of the same orientation and shape. When ρrg = 1, it means the patch at (i, j) in [Ri,j(a)]M × N completely matches the cortical module [Bk,l(s)]K × L with the specific orientation. Then the neuron is activated with the strongest response. The process of this image reconstruction is shown in figure 5.

Bottom Line: Our results are: 1.Fourier Transformation is replaced with Kronecker product, which greatly reduces the computational complexity.Clearly, the above new perspective has some reference significance to exploring the neural mechanisms on the human visual information processing.

View Article: PubMed Central - HTML - PubMed

Affiliation: LAPC, Institute of Atmospheric Physics, Chinese Academy of Sciences, Beijing 100029, China.

ABSTRACT

Background: What kind of neural computation is actually performed by the primary visual cortex and how is this represented mathematically at the system level? It is an important problem in the visual information processing, but has not been well answered. In this paper, according to our understanding of retinal organization and parallel multi-channel topographical mapping between retina and primary visual cortex V1, we divide an image into orthogonal and orderly array of image primitives (or patches), in which each patch will evoke activities of simple cells in V1. From viewpoint of information processing, this activated process, essentially, involves optimal detection and optimal matching of receptive fields of simple cells with features contained in image patches. For the reconstruction of the visual image in the visual cortex V1 based on the principle of minimum mean squares error, it is natural to use the inner product expression in neural computation, which then is transformed into matrix form.

Results: The inner product is carried out by using Kronecker product between patches and function architecture (or functional column) in localized and oriented neural computing. Compared with Fourier Transform, the mathematical description of Kronecker product is simple and intuitive, so is the algorithm more suitable for neural computation of visual cortex V1. Results of computer simulation based on two-dimensional Gabor pyramid wavelets show that the theoretical analysis and the proposed model are reasonable.

Conclusions: Our results are: 1. The neural computation of the retinal image in cortex V1 can be expressed to Kronecker product operation and its matrix form, this algorithm is implemented by the inner operation between retinal image primitives and primary visual cortex's column. It has simple, efficient and robust features, which is, therefore, such a neural algorithm, which can be completed by biological vision. 2. It is more suitable that the function of cortical column in cortex V1 is considered as the basic unit of visual image processing (such unit can implement basic multiplication of visual primitives, such as contour, line, and edge), rather than a set of tiled array filter. Fourier Transformation is replaced with Kronecker product, which greatly reduces the computational complexity. The neurobiological basis of this idea is that a visual image can be represented as a linear combination of orderly orthogonal primitive image containing some local feature. In the visual pathway, the image patches are topographically mapped onto cortex V1 through parallel multi-channels and then are processed independently by functional columns. Clearly, the above new perspective has some reference significance to exploring the neural mechanisms on the human visual information processing.

Show MeSH
Related in: MedlinePlus