Ayan Kumar Bhunia

I am a Doctor of Philosophy (PhD) student, focusing on Computer Vision and Deep Learning, at SketchX Lab. of Centre for Vision, Speech and Signal Processing (CVSSP), University of Surrey, England, United Kingdom. My primary supervisor is Dr. Yi-Zhe Song, and co-supervisors are Prof. Tao(Tony) Xiang and Dr. Yongxin Yang.

Prior to that, I worked as a full-time research assistant at the Institute for Media Innovation (IMI) Lab of Nanyang Technological University (NTU), Singapore.

Top-venue Conference publications (May 2021): 6xCVPR, 1xECCV, 1xSiggraph Asia.

Google Scholar  /  GitHub  /  LinkedIn  /  DBLP

profile photo
Research Interests

My research focus is broadly centered around Computer Vision and Deep Learning. In particular, I work on reasoning from Sparse Image Data (e.g. sketch or handwriting) and on how visual drawing or sketch could be used for visual understanding. In context of sketch, I have developed novel algorithms that can handle partial data for on-the-fly cross-modal retrieval. Furthermore, I explore deep model under low-resource training data scenario using semi-supervised, self-supervised and few-shot adaptive paradigm.

Notes: If you are interested in some potential research collaboration, feel free to contact me by Email or LinkedIn . Most importantly, I would be happy to collaborate with some really self-motivated and enthusiastic undergraduate or post-graduate students who have intention to pursue MS/Ph.D. in future.

Selected Publications

2021
Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting

Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Yongxin Yang, Timothy Hospedales, Tao Xiang, Yi-Zhe Song .
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 (New!)

Abstract / Code / arXiv / BibTex

More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval

Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Yongxin Yang, Tao Xiang, Yi-Zhe Song .
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 (New!)

Abstract / Code / arXiv / BibTex

StyleMeUp: Towards Style-Agnostic Sketch-Based Image Retrieval

Aneeshan Sain, Ayan Kumar Bhunia, Yongxin Yang and , Tao Xiang, Yi-Zhe Song .
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 (New!)

Abstract / arXiv / BibTex

MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition

Ayan Kumar Bhunia, Shuvozit Ghose, Amandeep Kumar, Pinaki Nath Chowdhury, Aneeshan Sain, Yi-Zhe Song .
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 (New!)

Abstract / arXiv / BibTex

2020
Pixelor: A Competitive Sketching AI Agent. So you think you can beat me?

Ayan Kumar Bhunia*, Ayan Das*, Umar Riaz Muhammad*, Yongxin Yang, Timothy M. Hospedalis, Tao Xiang, Yulia Gryaditskaya, Yi-Zhe Song .
SIGGRAPH Asia, 2020.

Abstract / Code / arXiv / BibTex/ Try Online Demo (*equal contribution)

Cross-Modal Hierarchical Modelling for Fine-Grained Sketch Based Image Retrieval

Aneeshan Sain, Ayan Kumar Bhunia, Yongxin Yang, Tao Xiang, Yi-Zhe Song .
British Machine Vision Conference (BMVC ), 2020.

Abstract / arXiv / BibTex (Oral Presentation)

Fine-grained visual classification via progressive multi-granularity training of jigsaw patches

Ruoyi Du, Dongliang Chang, Ayan Kumar Bhunia, Jiyang Xie, Zhanyu Ma, Yi-Zhe Song , Jun Guo .
European Conference on Computer Vision (ECCV ), 2020.

Abstract / Code/ arXiv / BibTex

Sketch Less for More: On-the-Fly Fine-Grained Sketch Based Image Retrieval

Ayan Kumar Bhunia, Yongxin Yang, Timothy M. Hospedalis, Tao Xiang, Yi-Zhe Song.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.

Abstract / Code / arXiv / BibTex (Oral Presentation)

2019
Handwriting Recognition in Low-Resource Scripts Using Adversarial Learning

Ayan Kumar Bhunia , Abhirup Das, Ankan Kumar Bhunia, Perla Sai Raj Kishore, Partha Pratim Roy.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019

Abstract / Code / arXiv / BibTex

Improving Document Binarization via Adversarial Noise-Texture Augmentation

Ankan Kumar Bhunia, Ayan Kumar Bhunia , Aneeshan Sain, Partha Pratim Roy.
IEEE Conference on Image Processing (ICIP), 2019

Abstract / Code / arXiv / BibTex (Top 10% Papers)

A Deep One-Shot Network for Query-based Logo Retrieval

Ayan Kumar Bhunia , Ankan Kumar Bhunia, Shuvozit Ghose, Abhirup Das, Partha Pratim Roy, Umapada Pal
Pattern Recognition (PR), 2019

Abstract / Code / Third Party Implementation / arXiv / BibTex

User Constrained Thumbnail Generation Using Adaptive Convolutions

Perla Sai Raj Kishore, Ayan Kumar Bhunia, Shovozit Ghose, Partha Pratim Roy
International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019

Abstract / Code / arXiv / BibTex (Oral Presentation)

Texture Synthesis Guided Deep Hashing for Texture Image Retrieval

Ayan Kumar Bhunia, Perla Sai Raj Kishore, Pranay Mukherjee, Abhirup Das, Partha Pratim Roy
IEEE Winter Conference on Applications of Computer Vision (WACV), 2019

Abstract / arXiv / BibTex/ Video Presentation

Script identification in natural scene image and video frames using an attention based Convolutional-LSTM network

Ankan Kumar Bhunia, Aishik Konwer, Ayan Kumar Bhunia , Abir Bhowmick, Partha Pratim Roy, Umapada Pal
Pattern Recognition (PR), 2019

Abstract / Code / arXiv / BibTex



Template credits : Dr. Jon Barron