Girish G Hegde
Girish G Hegde
4th Year · BS–MS Data Science
IISER Thiruvananthapuram
Download CV ↓

Hello. I'm a 4th-year BS–MS Data Science student at IISER Thiruvananthapuram, working at the intersection of deep learning, computer vision, medical imaging, and trustworthy AI.

My current interests are frequency-aware deepfake detection, physics-guided MRI reconstruction and segmentation, vision transformers, multimodal foundation models, and interpretable learning systems. I like building models from first principles and turning rough research ideas into reproducible experiments.

Computer Vision & Trustworthy Deep Learning
Deepfake detection, visual robustness, explainability, saliency, and model reliability under distribution shift
Physics-Guided Medical Imaging
Accelerated MRI reconstruction, tumor segmentation, k-space consistency, Fourier priors, and clinically meaningful evaluation
Transformers, VLMs & Multimodal Learning
Vision transformers, cross-attention, image-text alignment, model distillation, and domain-adapted foundation models
Generative Models & 3D Scene Understanding
Diffusion models, GANs, representation learning, semantic 3D reconstruction, Gaussian splatting, and geometry-aware perception
Deep Intelligence Lab, IISER Thiruvananthapuram (2024–Present)
Supervisor: Dr. Alwin Poulose · Frequency-aware deep learning for facial deepfake detection. Designed Fourier & DCT pipelines for spectral feature learning; integrated CNN and Transformer-based modules; evaluated robustness against GAN, diffusion, and edited-image artifacts.
Physics-Guided Dual-Head MRI Learning
Research direction on accelerated MRI: joint image reconstruction and tumor segmentation using k-space data consistency, Fourier positional encoding, attention U-Nets, and ViT-style token fusion across image, mask, and sampling information.
Emerging Research Ideas
Building original hybrid architectures for vision-language reasoning, semantic scene reconstruction, and efficient learning systems where geometry, physics, and representation learning guide each other instead of being treated as isolated modules.
Frequency-Aware Facial Deepfake Detection
Multi-stream framework using RGB, FFT, and DCT representations for detecting GAN-generated, diffusion-generated, edited, and real face images. Built spectral feature pipelines, CNN/ViT fusion modules, ablation studies, Grad-CAM style interpretability, and robustness evaluation.
Physics-Guided MRI Reconstruction and Tumor Segmentation
Dual-head attention U-Net and hybrid ViT direction for accelerated MRI. Simulated Cartesian and radial undersampling; optimized reconstruction with PSNR, SSIM, NMSE, and segmentation with Dice, IoU, Recall, Specificity, and HD95.
Transformer Models and Vision Transformers
Implemented sequence-to-sequence Transformers, attention mechanisms, positional encoding, and Vision Transformers from scratch. Extended these ideas toward token-based image, k-space, mask, and optional text-conditioning frameworks.
Semantic 3D Reconstruction and Scene Understanding
Exploring geometry-aware pipelines that combine visual encoders, point-cloud reasoning, semantic priors, and Gaussian-splatting-style scene representations for interpretable 3D reconstruction from visual data.
Big Data Search and ML Systems
Built Hadoop-oriented search and analytics prototypes using TF-IDF, MapReduce, approximate search, Bloom filters, sketching ideas, and lightweight interfaces for querying retrieved documents and model outputs.
Languages
Python · R · C · Bash · SQL
Frameworks
PyTorch · TensorFlow/Keras · scikit-learn · Hugging Face · Streamlit
Tools
Git · Jupyter · Kaggle · Colab · LaTeX · Overleaf · Hadoop · OpenCV
Methods
Deep Learning · Computer Vision · Medical Imaging · Transformers · Diffusion Models · Explainable AI · Big Data Systems
Best Poster Award (2026)
FS Data Science Conference, IISER Thiruvananthapuram