Computer Vision Engineer @ OpenCV University
Writing exciting blog posts on AI & Computer Vision covering Diffusion Models, Vision-Language Models, JEPA models, as well as fixing bugs and creating python notebooks for courses related to Transformers and VLM .
- Graduated from IIIT Jabalpur (2023) in Mechanical Engineering.
- Worked as a Research Assistant at IIT Hyderabad, where I explored Knowledge Distillation across Vision, Language, and Audio modalities.
- Now working my way through generative AI & multimodal learning while building cool projects and sharing knowledge.
- Writing blog posts on various topics related to Computer Vision while managing and creating courses related to applications of transformers and Vision Language Models.
- Experimenting with Diffusion + Transformers for real-world applications.
- Building a solid understanding of VLMs (Vision-Language Models) and their practical uses.
- Languages: Python
- Deep Learning: PyTorch, TensorFlow
- Computer Vision: OpenCV, YOLO, ViT, HuggingFace
- Other Interests: Generative AI, Multimodal ML, MLOps
✨ “Always learning, always experimenting because AI is just getting started.”

