5th Workshop on Document Analysis and Recognition

In Conjunction with ICVGIP 2021

December 19th 2021


The ability to understand the content and structure of documents as well as text that appears in the form of handwriting or scene text is critical for intelligent systems to operate and interact with humans. With advances from research in the past few decades, the area has moved from solutions that work with highly restricted data to real-world degraded noisy document images and those acquired using hand-held cameras or handwriting data captured by mobile devices. The fifth workshop on document analysis and recognition aims to provide a platform for researchers to present techniques for solving such problems and interact with their peers in academia and industry.

Click HERE to view a video recording of the workshop. Password: Zt9pDAGq


All times in IST

2:00pm - 2:10pm Welcome and Introduction by workshop chairs
2:10pm - 3:00pm Invited Talk 1 (Lluis Gomez)
Title: Scene Text Beyond Reading: from simple text reading to reasoning with text
3:00pm - 3:20pm Paper Presentation - 1
Title: Model Compression Based Lightweight Online Signature Verification Framework
3:20pm - 3:40pm Paper Presentation - 2
Title: End to End Transformer based Architecture for Text Recognition from Document Images
3:40pm - 4:30pm Invited Talk 2 (Ganesh Ramakrishnan)
Title: Data efficient Document Analysis
4:30pm - 4:50pm Paper Presentation - 3
Title: A Hybrid Approach for Table Detection in Document Images
4:50pm - 5:00pm Closing Remarks

Accepted Papers

  • Model Compression Based Lightweight Online Signature Verification Framework, Authors: Chandra V sekhar (IIIT SriCity); Balasubramanian S (SSSIHL); Viswanath P (IIIT Sri City); Avinash Gautam (BITS Pilani)
  • End to End Transformer based Architecture for Text Recognition from Document Images, Authors: Dipankar Ganguly (CDAC); Akkshita Trivedi (IIT Jodhpur); Bhupendra Kumar (CDAC Noida); Tushar Patnaik (C-DAC); Santanu Chaudhury (IIT Delhi)
  • A Hybrid Approach for Table Detection in Document Images, Authors: Sunil Kumar Vengalil (IIIT Bangalore); Kevin Xavier (Razorthink); Amith Konda Sai (IIIT Bangalore); Sree Koyi Harsha (IIIT Banglore); Ganesh Barma (IIIT, Bangalore); Neelam Sinha (IIIT Bangalore)

  • Invited Speakers

    Prof. Ganesh Ramakrishnan is currently serving as an Institute Chair Professor at the Department of Computer Science and Engineering, IIT Bombay. He is also the Professor-in-charge of Koita Centre for Digital Health at IIT Bombay. His areas of research include human assisted AI/ML, AI/ML in resource constrained environments, learning with symbolic encoding of domain knowledge in ML and NLP, etc. More recently, he has been focusing his energy on organizing relevant machine learning modules for resource constrained environments into DECILE . He has demonstrated the impact of such data efficient machine learning in applications such as Video Analytics , machine translation end-to-end eco-system , OCR and in multi-modal analytics. He has received awards such as IBM Faculty Award, Dr. P.K. Patwardhan Technology Development Award, and awards from Qualcomm, Microsoft as well as IIT Bombay Impactful Research Award. He also held the J.R. Isaac Chair at IIT Bombay. Ganesh is very passionate about boosting the AI research eco-system for India and towards that, the research by him and his students as well as collaborators has resulted in startups that he has either jointly founded, has transferred technology to, or is mentoring.

    Dr. Lluís Gómez is a Postdoctoral Researcher at the Computer Vision Center - Universitat Autonoma de Barcelona. He is a member of the Intelligent Reading Systems Group, where he works in developing algorithms to make computers read and understand textual information in unconstrained scenarios. He has done research stays at the Media Integration and Communication Center (MICC) - University of Florence, and the Intelligent Media Processing Group - Osaka Prefecture University, Japan. He has also collaborated with other prominent research groups in the organization of the ICDAR Robust Reading Competitions. His research interests include a variety of different topics in machine learning and computer vision. Currently he works on deep embedding models, multi modal self-supervised learning, joint modeling of textual and visual information, and CNN architectures for scene text understanding.

    Call for Papers

    Paper topics may include but are not limited to:
    • Text Recognition, including from video sources
    • Document Layout Analysis and Understanding
    • Historical and Degraded Document Analysis
    • Multilingual Document Recognition
    • Document Image Compression
    • Camera-captured Document Analysis
    • Handwriting Recognition Technologies
    • Writer/style Identification, Adaptation
    • Document Image Retrieval
    • Datasets, Performance evaluation and Metrics
    Submission: Please refer to manuscript preparation and submission instructions as provided HERE . IMPORTANT: Do NOT submit your paper at ICVGIP main conference portal. Submit your paper at Workshop's CMT Submission Portal

    Important Dates

    Paper Submission Deadline October 21 2021, 23:59 Indian Standard Time (IST) - EXTENDED
    Notification to Authors November 10 2021
    Camera-Ready Deadline November 15 2021
    Workshop Date December 19 2021


    Program Committee