Ayan Kumar Bhunia

I am a Doctor of Philosophy (PhD) student, focusing on Computer Vision and Deep Learning, at SketchX Lab. of Centre for Vision, Speech and Signal Processing (CVSSP), University of Surrey, England, United Kingdom. My primary supervisor is Prof. Yi-Zhe Song, and co-supervisors are Prof. Tao(Tony) Xiang and Dr. Yongxin Yang.

Prior to that, I worked as a full-time research assistant at the Institute for Media Innovation (IMI) Lab of Nanyang Technological University (NTU), Singapore.

Top-venue Conference publications (July 2021): 6xCVPR, 3xICCV, 1xECCV, 1xSiggraph Asia.

Google Scholar  /  GitHub  /  LinkedIn  /  DBLP

profile photo
Research Interests

My research focus is broadly centered around Computer Vision and Deep Learning. In particular, I work on reasoning from Sparse Image Data (e.g. sketch or handwriting) and on how visual drawing or sketch could be used for visual understanding. In context of sketch, I have developed novel algorithms that can handle partial data for on-the-fly cross-modal retrieval. I intend to explore deep model under low-resource training data scenario using semi-supervised, self-supervised and few-shot adaptive paradigm. Furthermore, I earlier worked (occasionally work) on Document Image Analysis and Text Recognition related problems.

Notes: If you are interested in some potential research collaboration, feel free to contact me by Email or LinkedIn . Most importantly, I would be happy to collaborate with some really self-motivated and enthusiastic undergraduate or post-graduate students who have intention to pursue MS/Ph.D. in future.

Recent Updates

  • New!! [July 2021]: Three papers got accepted in ICCV 2021!!
  • [June 2021]: Talk on 'Beyond Supervised Sketch Representation Learning' YouTube
  • [March 2021] : Four papers got accepted in CVPR 2021.
  • [Aug 2020] : One paper got accepted in Siggraph Asia 2020. Check Online Demo
  • [Aug 2020] : One paper got accepted in BMVC 2020 for oral presentation.
  • [July 2020] : One paper got accepted in ECCV 2020.
  • [March 2020] : One paper got accepted in CVPR 2020 for oral presentation.
  • Selected Publications

    2021
    Text is Text, No Matter What: Unifying Text Recognition using Knowledge Distillation

    Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Yi-Zhe Song .
    IEEE International Conference on Computer Vision (ICCV), 2021 (New!)

    Abstract / arXiv / BibTex

    Towards the Unseen: Iterative Text Recognition by Distilling from Errors

    Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Yi-Zhe Song .
    IEEE International Conference on Computer Vision (ICCV), 2021 (New!)

    Abstract / arXiv / BibTex

    Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition

    Ayan Kumar Bhunia, Aneeshan Sain, Amandeep Kumar, Shuvozit Ghose, Pinaki Nath Chowdhury, Yi-Zhe Song .
    IEEE International Conference on Computer Vision (ICCV), 2021 (New!)

    Abstract / arXiv / BibTex

    Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting

    Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Yongxin Yang, Timothy Hospedales, Tao Xiang, Yi-Zhe Song .
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021

    Abstract / Code / arXiv / BibTex

    More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval

    Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Yongxin Yang, Tao Xiang, Yi-Zhe Song .
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021

    Abstract / Code / arXiv / BibTex

    StyleMeUp: Towards Style-Agnostic Sketch-Based Image Retrieval

    Aneeshan Sain, Ayan Kumar Bhunia, Yongxin Yang and , Tao Xiang, Yi-Zhe Song .
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021

    Abstract / arXiv / BibTex

    MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition

    Ayan Kumar Bhunia, Shuvozit Ghose, Amandeep Kumar, Pinaki Nath Chowdhury, Aneeshan Sain, Yi-Zhe Song .
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021

    Abstract / arXiv / BibTex

    2020
    Pixelor: A Competitive Sketching AI Agent. So you think you can beat me?

    Ayan Kumar Bhunia*, Ayan Das*, Umar Riaz Muhammad*, Yongxin Yang, Timothy M. Hospedalis, Tao Xiang, Yulia Gryaditskaya, Yi-Zhe Song .
    SIGGRAPH Asia, 2020.

    Abstract / Code / arXiv / BibTex/ Try Online Demo (*equal contribution)

    Cross-Modal Hierarchical Modelling for Fine-Grained Sketch Based Image Retrieval

    Aneeshan Sain, Ayan Kumar Bhunia, Yongxin Yang, Tao Xiang, Yi-Zhe Song .
    British Machine Vision Conference (BMVC ), 2020.

    Abstract / arXiv / BibTex (Oral Presentation)

    Fine-grained visual classification via progressive multi-granularity training of jigsaw patches

    Ruoyi Du, Dongliang Chang, Ayan Kumar Bhunia, Jiyang Xie, Zhanyu Ma, Yi-Zhe Song , Jun Guo .
    European Conference on Computer Vision (ECCV ), 2020.

    Abstract / Code/ arXiv / BibTex

    Sketch Less for More: On-the-Fly Fine-Grained Sketch Based Image Retrieval

    Ayan Kumar Bhunia, Yongxin Yang, Timothy M. Hospedalis, Tao Xiang, Yi-Zhe Song.
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.

    Abstract / Code / arXiv / BibTex (Oral Presentation)

    2019
    Handwriting Recognition in Low-Resource Scripts Using Adversarial Learning

    Ayan Kumar Bhunia , Abhirup Das, Ankan Kumar Bhunia, Perla Sai Raj Kishore, Partha Pratim Roy.
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019

    Abstract / Code / arXiv / BibTex

    Improving Document Binarization via Adversarial Noise-Texture Augmentation

    Ankan Kumar Bhunia, Ayan Kumar Bhunia , Aneeshan Sain, Partha Pratim Roy.
    IEEE Conference on Image Processing (ICIP), 2019

    Abstract / Code / arXiv / BibTex (Top 10% Papers)

    A Deep One-Shot Network for Query-based Logo Retrieval

    Ayan Kumar Bhunia , Ankan Kumar Bhunia, Shuvozit Ghose, Abhirup Das, Partha Pratim Roy, Umapada Pal
    Pattern Recognition (PR), 2019

    Abstract / Code / Third Party Implementation / arXiv / BibTex

    User Constrained Thumbnail Generation Using Adaptive Convolutions

    Perla Sai Raj Kishore, Ayan Kumar Bhunia, Shovozit Ghose, Partha Pratim Roy
    International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019

    Abstract / Code / arXiv / BibTex (Oral Presentation)

    Texture Synthesis Guided Deep Hashing for Texture Image Retrieval

    Ayan Kumar Bhunia, Perla Sai Raj Kishore, Pranay Mukherjee, Abhirup Das, Partha Pratim Roy
    IEEE Winter Conference on Applications of Computer Vision (WACV), 2019

    Abstract / arXiv / BibTex/ Video Presentation

    Script identification in natural scene image and video frames using an attention based Convolutional-LSTM network

    Ankan Kumar Bhunia, Aishik Konwer, Ayan Kumar Bhunia , Abir Bhowmick, Partha Pratim Roy, Umapada Pal
    Pattern Recognition (PR), 2019

    Abstract / Code / arXiv / BibTex



    Template credits : Dr. Jon Barron