The Role and Utilization of CNN in Automatic Logo Based Document Image Retrieval Methods

  • Authors

    • Raveendra K
    • R Vinoth Kanna
    2018-08-04
    https://doi.org/10.14419/ijet.v7i3.1.16786
  • CNN, Computer Vision, Deep Learning, Image Classification, Machine Learning, Pattern Recognition.
  • Abstract

    Automatic logo based document image retrieval process is an essential and mostly used method in the feature extraction applications. In this paper the architecture of Convolutional Neural Network (CNN) was elaborately explained with pictorial representations in order to understand the complex Convolutional Neural Networks process in a simplified way. The main objective of this paper is to effectively utilize the CNN in the process of automatic logo based document image retrieval methods.

     

     

  • References

    1. [1] Hyeonwoo Noh, Seunghoon Hong, Bohyung Han, “Learning Deconvolution Network for Semantic Segmentationâ€, http://doi.ieeecomputersociety.org/10.1109/ICCV.2015.178 pp.1520-1528

      [2] Jure Zbontar, Yann LeCun, “Stereo Matching by training a convolutional neural network to compare image patchesâ€, Journal of Machine Learning Research 17(2016) 1-32

      [3] Yoshua Bengio, Yan Le Cun, Donnie Henderson, “Globally Trained Handwritten Word Recognizer using Spatial Representation, Convolutional Neural Networks and Hidden Markov Modelsâ€

      [4] Zohra Saidane, Christophe Garcia, “Automatic Scene Text Recognition using a Convolutional Neural Networkâ€

      [5] Cicero Nogueira dos Santos, Maira Gatti, “Deep Convolutional Neural Networks for Sentiment Analysis of Short Textsâ€, proceedings of COLING 2014, THE 25th International Conference on Computational Linguistics: Technical Papers, pages 69-78, Dublin, Ireland, August 23-29 2014.

      [6] Jeff Donahue, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng, Trevor Darrell “DeCAF: A Deep convolutional Activation Feature for Generic Visual Recognition†proceedings of the 31st International Conference on Machine Learning, Beijing, China 2014. JMLR:W&CP volume32.

      [7] Ossama Abdel-Hamid, Li Deng, Dong Yu “Exploring Convolutional Neural Network Structures and Optimization Techniques for Speech Recognition INTERSPEECH 2013 25-29 August 2013, Lyon, France

      [8] Leon A.Gatys, Alexander S.Ecker, Matthias Bethge “Image Style Trnasfer Using Convolutional Neural Networks†http://doi.ieeecomputersociety.org/10.1109/CVPR.2016.265 pp:2414-2423

      [9] Le Kang, Peng Ye, Yi Li, David Doermann “Convolutional Neural Networks for No-Reference Image Quality Assessmentâ€

      [10] Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, Li Fei-Fei “ Large Scale Video Classification with Convolutional Neural Networks†http://doi.ieeecomputersociety.org/10.1109/CVPR.2014.223 pp:1725-1732

      [11] Haoxiang Li, Zhe Lin, Xiohui Shen, Jonathan Brandt, Gang Hua “A Convolutional Neural Network Cascade for Face Detection†in CVPR, 2015, pp. 5325–5334.

      [12] Wei Li, Rui Zhao, Tong Xiao, Xiaogang Wang “ DeepReID: Deep Filter Pairing Neural Network for Person Re-Identification†http://doi.ieeecomputersociety.org/10.1109/CVPR.2014.27pp:152-159

      [13] Sijin Li, Zhi-Qiang Liu, Antoni B. Chan “Hetrogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network†in CVPR, 2014,pp.488-495

      [14] Fayao Liu, Chunhua Shen, Guosheng Lin “Deep Convolutional Neural Fields for Depth Estimation from a Single Image†in CVPR, 2015,pp.5162-5170

      [15] Mingsheng Long, Yue Cao, Jianmin Wang, Michael I.Jordan “ Learning Transeferable Features with Deep Adaptation Networks†proceedings of the 32nd International Conference on Machine Learning, Lille, France, 2015. JMLR: W&CP volume 37.

      [16] Pavlo Molchanov, Xiaodong Yang, Shalini Gupta, Kihwan Kim, Stephen Tyree, Jan Kautz “Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks†2016 IEEE Conference on Computer Vision and Pattern Recognition ISBN: 978-1-4673-8851-1

      [17] Hyeonseob Nam, Bohyung Han “ Learning Multi-Domain Convolutional Neural Networks for Visual Tracking†Computer vision foundation in CVPR, 2016,pp.4293-4302

      [18] Maxime Oquab, Leon Bottou, Ivan Laptev, Josef Sivic “Learning and Transferring Mid-Level Image Representations using Convolutional Neural Networks’ CVPRW, 2015,pp.91-96

      [19] Pedro O.Pinheiro, Ronan Collobert “Recurrent Convolutional Neural Networks for Scene Labelling†proceedings of the 31st International Conference on Machine Learning, Beijing, China,2014. JMLR: W&CP volume 32.

      [20] Hyeonwoo Noh, Paul Hongsuck Seo, Bohyung Han “ Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction†in CVPR,2016,pp.30-38

      [21] Wenzhe Shi, Jose Caballero, Ferenc Huszar, Johannes Totz, Andrew P.Aitken,Rob Bishop, Daniel Rueckert, Zehan Wang “ Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network†in CVPR,2016, ISBN: 978-1-4673-8851-1

      [22] Patrice Y.Simard, Dave Steinkraus, John C. Platt “ Best Practices for Convolutional Neural Networks Applied to Visual Document Analysisâ€

      [23] Nitish Srivastava, Geoffry Hinton, Alex Krizhevsky, Ilya Sutskever, Ruslan Salakhutdinov “Dropout: A Simple way to Prevent Neural Networks from Overfitting†Journal of Machine Learning Research 15 (2014) 1929-1958

      [24] Hang Su, Subharansu Maji, Evangelos Kalogerakis, Erik Learned-Miller “ †Computer vision foundation

      [25] Tianjun Xiao, Yichong Xu, Kuiyuan Yang, Jiaxing Zhang, Yuxin Peng, Zheng Zhang “ The Application of Two-level Attention Models in Deep Convolutional Neural Network for Fine-grained Image Classification†in CVPR, 2015,pp.1063-6919

      [26] Sergey Zagoruyko, Nikos Komodakis “Learning to Compare Image Patches via Convolutional Neural Networks†in CVPR, 2015,pp.4353-4361

      [27] Jure Zbontar, Yann LeCun “Computing the Stereo Matching Cost with a Convolutional Neural Network†Computer vision foundation

  • Downloads

  • How to Cite

    K, R., & Vinoth Kanna, R. (2018). The Role and Utilization of CNN in Automatic Logo Based Document Image Retrieval Methods. International Journal of Engineering & Technology, 7(3.1), 13-16. https://doi.org/10.14419/ijet.v7i3.1.16786

    Received date: 2018-08-03

    Accepted date: 2018-08-03

    Published date: 2018-08-04