web statisticsweb statistics Aishik Konwer

Aishik Konwer, PhD


Advanced AI Research Scientist
Center of Advanced AI, Accenture
Email: akonwer@cs.stonybrook.edu, Mobile: +1-6317471244
Resume Google Scholar GithubLinkedin

At the Center of Advanced AI, Accenture, I build multimodal generative agents for video synthesis, develop LLM post-training pipelines with RL-based optimization (GRPO), and engineer spatially grounded VLM pipelines for anomaly detection and zero-shot object recognition. I contribute to multimodal agentic capabilities for AI Refinery and am currently exploring multi-view VLA representations for embodied AI and robotics.

I completed my PhD in Computer Science from Stony Brook University (April 2025). My thesis focused on designing algorithms that learn efficiently from imperfect 2D/3D/multi-stained medical imaging datasets, spanning segmentation, caption generation, and future timepoint prediction. My work includes enhancing multimodal foundational models (VLMs) with efficient prompts and preference optimization, applying GANs and conditional diffusion models, and customizing meta-learning and few-shot algorithms for clinical tasks.

Research Interests: Multimodal LLMs (VLM/VLA), LLM Agents, Generative AI (Diffusion Models, Video Generation), RLHF & Post-Training (SFT, DPO, GRPO), Embodied AI, Video Understanding, Meta-learning, Few-shot Learning, Domain Generalization.

          I am currently on the job market for Research Scientist roles.

 

Recent News

  • NEW 04/2026: 1 paper submitted to ECCV 2026
  • NEW 03/2026: 1 paper accepted to ISBI 2026 [ORAL]
  • NEW 04/2026: 1 preprint: Physical AI: The Next Frontier in AI and Robotics
  • NEW 02/2025: 1 paper accepted to CVPR 2025 [ORAL]
  • 06/2024: 1 paper accepted to MICCAI
  • 08/2023: 1 paper accepted to MICCAI workshop
  • 07/2023: 1 paper accepted to ICCV
  • 07/2022: 1 abstract accepted to RSNA [ORAL]
  • 03/2022: 1 paper accepted to CVPR [ORAL]
  • 10/2021: 1 paper accepted to SPIE Medical Imaging [ORAL]
  • 04/2021: 1 paper accepted to MICCAI [EARLY ACCEPT]
  • 03/2021: 1 paper accepted to MIDL
  • 10/2019: 1 paper accepted to ICASSP
  • 04/2018: 3 papers accepted to ICPR
  • 08/2018: 1 paper accepted to Pattern Recognition
  • 05/2017: 1 paper accepted to ACPR
  •  

    Education

    Stony Brook University, USA
    PhD in Computer Science & Engineering
    Aug 2019 - Apr 2025

    Institute of Engineering & Management, India
    Btech in Electronics and Communication Engineering
    Aug 2013 - June 2017

    Research Experiences

    Indian Statistical Institute, Kolkata, India
    Under Prof. Umapada Pal
    Sept 2016 - Sept 2017 [Certificate]

    Indian Institute of Technology Roorkee, India
    Under Prof. Partha Pratim Roy
    Sept 2017 - March 2018 [Certificate]

    Industry

    Accenture, USA
    Advanced AI Research Scientist
    May 2025 - Present
    GE Healthcare, USA
    AI Scientist Intern
    May 2024 - August 2024
    SRI International, USA
    Deep Learning Research Intern
    May 2023 - August 2023
    Roche, USA
    Advanced ML Research Intern
    May 2022 - August 2022
    Cognizant, India
    Datawarehouse developer
    Dec 2017 - July 2019

     

    Selected Research

    Reasoning-Guided Grounding: Elevating Video Anomaly Detection through Multimodal Large Language Models
    Sakshi Agarwal, Aishik Konwer, Ankit Parag Shah
    Submitted to European Conference on Computer Vision (ECCV), 2026.

    Gaze2Report: Radiology Report Generation via Visual-Gaze Prompt Tuning of LLMs
    Aishik Konwer, Moinak Bhattacharya, Prateek Prasanna
    IEEE International Symposium on Biomedical Imaging (ISBI), 2026. [ORAL]

    Physical AI: The Next Frontier in AI and Robotics to Build Truly Autonomous Machines
    Aishik Konwer et al.
    Preprint, 2026.
    [paper]

    Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation
    Aishik Konwer, Zhijian Yang, Erhan Bas, Cao Xiao, Prateek Prasanna, Parminder Bhatia, Taha Kass-Hout
    IEEE Conference on Computer Vision and Pattern Recognition Conference (CVPR), 2025. [ORAL]
    [paper]

    MetaStain: Stain-generalizable Meta-learning for Cell Segmentation and Classification with Limited Exemplars
    Aishik Konwer, Prateek Prasanna
    International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2024.
    [paper]

    Enhancing Modality-Agnostic Representations via Meta-Learning for Brain Tumor Segmentation
    Aishik Konwer, Xiaoling Hu, Joseph Bae, Xuan Xu, Chao Chen, Prateek Prasanna
    IEEE International Conference on Computer Vision (ICCV), 2023.
    [paper]

    Temporal Context Matters: Enhancing Single Image Prediction with Disease Progression Representations
    Aishik Konwer, Xuan Xu, Joseph Bae, Chao Chen, Prateek Prasanna
    IEEE Conference on Computer Vision and Pattern Recognition Conference (CVPR), 2022. [ORAL]
    [paper]

    Attention-based Multi-scale Gated Recurrent Encoder with Novel Correlation Loss for COVID-19 Progression Prediction
    Aishik Konwer, Joseph Bae, Gagandeep Singh, Rishabh Gattu, Syed Ali, Jeremy Green, Tej Phatak, Prateek Prasanna
    International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2021. [EARLY ACCEPT]
    [paper]

    Predicting COVID-19 Lung Infiltrate Progression on Chest Radiographs Using Spatio-temporal LSTM based Encoder-Decoder Network
    Aishik Konwer, Joseph Bae, Gagandeep Singh, Rishabh Gattu, Syed Ali, Jeremy Green, Tej Phatak, Amit Gupta, Chao Chen, Joel Saltz, Prateek Prasanna
    Medical Imaging with Deep Learning (MIDL), 2021.
    [paper]

    Facial Micro-Expression Spotting and Recognition using Time Contrasted Feature with Visual Memory
    Sauradip Nag , Ayan Kumar Bhunia , Aishik Konwer, Partha Pratim Roy
    International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019.
    [paper], [arXiv]

    Script identification in natural scene image and video frames using an attention based Convolutional-LSTM network
    Ankan Kumar Bhunia, Aishik Konwer, Abir Bhowmick, Ayan Kumar Bhunia, Partha Pratim Roy
    Pattern Recognition, 2019. [I.F. 3.962]
    [paper], [arXiv], [GitHub]

    Word Level Font-to-Font Image Translation using Convolutional Recurrent Generative Adversarial Networks
    Ankan Kumar Bhunia, Ayan Kumar Bhunia, Prithaj Banerjee, Aishik Konwer, Abir Bhowmick, Partha Pratim Roy, Umapada Pal
    International Conference on Pattern Recognition (ICPR), 2018
    [paper], [arXiv]

    Staff line Removal using Generative Adversarial Networks
    Aishik Konwer , Ayan Kumar Bhunia, Abir Bhowmick, Ankan Kumar Bhunia, Prithaj Banerjee, Partha Pratim Roy, Umapada Pal
    International Conference on Pattern Recognition (ICPR), 2018 (Oral)
    [paper], [arXiv]

    Handwriting Trajectory Recovery using End-to-End Deep Encoder-Decoder Network
    Ayan Kumar Bhunia, Abir Bhowmick, Ankan Kumar Bhunia, Aishik Konwer, Prithaj Banerjee, Partha Pratim Roy, Umapada Pal
    International Conference on Pattern Recognition (ICPR), 2018
    [paper], [arXiv]

    A New GVF Arrow Pattern for Character Segmentation from Double Line License Plate Images
    Palaiahnakote Shivakumara, Aishik Konwer, Abir Bhowmick, Vijeta Khare, Umapada Pal, Tong Lu
    4th Asian Conference on Pattern Recognition (ACPR),, 2017.
    [paper]