Aishik Konwer


						Aishik Konwer, PhD Advanced AI Research Scientist Center of Advanced AI, Accenture Email: akonwer@cs.stonybrook.edu, Mobile: +1-6317471244 Resume • Google Scholar • Github • Linkedin

At the Center of Advanced AI, Accenture, I build multimodal generative agents for video synthesis, develop LLM post-training pipelines with RL-based optimization (GRPO), and engineer spatially grounded VLM pipelines for anomaly detection and zero-shot object recognition. I contribute to multimodal agentic capabilities for AI Refinery and am currently exploring multi-view VLA representations for embodied AI and robotics.

I completed my PhD in Computer Science from Stony Brook University (April 2025). My thesis focused on designing algorithms that learn efficiently from imperfect 2D/3D/multi-stained medical imaging datasets, spanning segmentation, caption generation, and future timepoint prediction. My work includes enhancing multimodal foundational models (VLMs) with efficient prompts and preference optimization, applying GANs and conditional diffusion models, and customizing meta-learning and few-shot algorithms for clinical tasks.

Research Interests: Multimodal LLMs (VLM/VLA), LLM Agents, Generative AI (Diffusion Models, Video Generation), RLHF & Post-Training (SFT, DPO, GRPO), Embodied AI, Video Understanding, Meta-learning, Few-shot Learning, Domain Generalization.

I am currently on the job market for Research Scientist roles.

Recent News

^NEW 04/2026: 1 paper submitted to ECCV 2026

^NEW 03/2026: 1 paper accepted to ISBI 2026 [ORAL]

^NEW 04/2026: 1 preprint: Physical AI: The Next Frontier in AI and Robotics

^NEW 02/2025: 1 paper accepted to CVPR 2025 [ORAL]

06/2024: 1 paper accepted to MICCAI

08/2023: 1 paper accepted to MICCAI workshop

07/2023: 1 paper accepted to ICCV

07/2022: 1 abstract accepted to RSNA [ORAL]

03/2022: 1 paper accepted to CVPR [ORAL]

10/2021: 1 paper accepted to SPIE Medical Imaging [ORAL]

04/2021: 1 paper accepted to MICCAI [EARLY ACCEPT]

03/2021: 1 paper accepted to MIDL

10/2019: 1 paper accepted to ICASSP

04/2018: 3 papers accepted to ICPR

08/2018: 1 paper accepted to Pattern Recognition

05/2017: 1 paper accepted to ACPR

Education


						Stony Brook University, USA PhD in Computer Science & Engineering Aug 2019 - Apr 2025


						Institute of Engineering & Management, India Btech in Electronics and Communication Engineering Aug 2013 - June 2017

Research Experiences


						Indian Statistical Institute, Kolkata, India Under Prof. Umapada Pal Sept 2016 - Sept 2017 [Certificate]


						Indian Institute of Technology Roorkee, India Under Prof. Partha Pratim Roy Sept 2017 - March 2018 [Certificate]

Industry


						Accenture, USA Advanced AI Research Scientist May 2025 - Present


						GE Healthcare, USA AI Scientist Intern May 2024 - August 2024


						SRI International, USA Deep Learning Research Intern May 2023 - August 2023


						Roche, USA Advanced ML Research Intern May 2022 - August 2022


						Cognizant, India Datawarehouse developer Dec 2017 - July 2019

Selected Research

	Reasoning-Guided Grounding: Elevating Video Anomaly Detection through Multimodal Large Language Models Sakshi Agarwal, Aishik Konwer, Ankit Parag Shah Submitted to European Conference on Computer Vision (ECCV), 2026.
	Gaze2Report: Radiology Report Generation via Visual-Gaze Prompt Tuning of LLMs Aishik Konwer, Moinak Bhattacharya, Prateek Prasanna IEEE International Symposium on Biomedical Imaging (ISBI), 2026. [ORAL]
	Physical AI: The Next Frontier in AI and Robotics to Build Truly Autonomous Machines Aishik Konwer et al. Preprint, 2026. [paper]
	Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation Aishik Konwer, Zhijian Yang, Erhan Bas, Cao Xiao, Prateek Prasanna, Parminder Bhatia, Taha Kass-Hout IEEE Conference on Computer Vision and Pattern Recognition Conference (CVPR), 2025. [ORAL] [paper]
	MetaStain: Stain-generalizable Meta-learning for Cell Segmentation and Classification with Limited Exemplars Aishik Konwer, Prateek Prasanna International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2024. [paper]
	Enhancing Modality-Agnostic Representations via Meta-Learning for Brain Tumor Segmentation Aishik Konwer, Xiaoling Hu, Joseph Bae, Xuan Xu, Chao Chen, Prateek Prasanna IEEE International Conference on Computer Vision (ICCV), 2023. [paper]
	Temporal Context Matters: Enhancing Single Image Prediction with Disease Progression Representations Aishik Konwer, Xuan Xu, Joseph Bae, Chao Chen, Prateek Prasanna IEEE Conference on Computer Vision and Pattern Recognition Conference (CVPR), 2022. [ORAL] [paper]
	Attention-based Multi-scale Gated Recurrent Encoder with Novel Correlation Loss for COVID-19 Progression Prediction Aishik Konwer, Joseph Bae, Gagandeep Singh, Rishabh Gattu, Syed Ali, Jeremy Green, Tej Phatak, Prateek Prasanna International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2021. [EARLY ACCEPT] [paper]
	Predicting COVID-19 Lung Infiltrate Progression on Chest Radiographs Using Spatio-temporal LSTM based Encoder-Decoder Network Aishik Konwer, Joseph Bae, Gagandeep Singh, Rishabh Gattu, Syed Ali, Jeremy Green, Tej Phatak, Amit Gupta, Chao Chen, Joel Saltz, Prateek Prasanna Medical Imaging with Deep Learning (MIDL), 2021. [paper]
	Facial Micro-Expression Spotting and Recognition using Time Contrasted Feature with Visual Memory Sauradip Nag , Ayan Kumar Bhunia , Aishik Konwer, Partha Pratim Roy International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019. [paper], [arXiv]
	Script identification in natural scene image and video frames using an attention based Convolutional-LSTM network Ankan Kumar Bhunia, Aishik Konwer, Abir Bhowmick, Ayan Kumar Bhunia, Partha Pratim Roy Pattern Recognition, 2019. [I.F. 3.962] [paper], [arXiv], [GitHub]
	Word Level Font-to-Font Image Translation using Convolutional Recurrent Generative Adversarial Networks Ankan Kumar Bhunia, Ayan Kumar Bhunia, Prithaj Banerjee, Aishik Konwer, Abir Bhowmick, Partha Pratim Roy, Umapada Pal *International Conference on Pattern Recognition (ICPR), 2018* [paper], [arXiv]
	Staff line Removal using Generative Adversarial Networks Aishik Konwer , Ayan Kumar Bhunia, Abir Bhowmick, Ankan Kumar Bhunia, Prithaj Banerjee, Partha Pratim Roy, Umapada Pal International Conference on Pattern Recognition (ICPR), 2018 (Oral) [paper], [arXiv]
	Handwriting Trajectory Recovery using End-to-End Deep Encoder-Decoder Network Ayan Kumar Bhunia, Abir Bhowmick, Ankan Kumar Bhunia, Aishik Konwer, Prithaj Banerjee, Partha Pratim Roy, Umapada Pal International Conference on Pattern Recognition (ICPR), 2018 [paper], [arXiv]
	A New GVF Arrow Pattern for Character Segmentation from Double Line License Plate Images Palaiahnakote Shivakumara, Aishik Konwer, Abir Bhowmick, Vijeta Khare, Umapada Pal, Tong Lu 4th Asian Conference on Pattern Recognition (ACPR),, 2017. [paper]

Aishik Konwer, PhD

Recent News

Education

Research Experiences

Industry

Selected Research