My research interests lie in generative AI, including GANs, diffusion models, and Vision Transformers (ViTs), with the aim of advancing computer vision and image processing techniques. My past work centered on improving the CycleGAN model, and my current focus is on developing enhanced ViT models utilizing local and global attention to achieve more robust image understanding.
Search for Mohammad Mahmoudabadi's papers on the Research page