MSc HPC with Data Science at Edinburgh. Previously built ML research pipeline systems at ISRO on real meteorological data, real users, real constraints.
Most of my work lives somewhere between ML and systems: training pipelines that actually run fast, parallel code that doesn't embarrass itself on a supercomputer, APIs that hold up under load. I care about the stuff underneath the model — memory, scheduling, latency.
Looking for graduate roles in HPC, Data Engineering, and MLOps in the UK. UK Graduate Route Visa from August 2026.
Languages
ML / AI
HPC
Infra & Data
Optimized-Molecular-Dynamics-Simulation — N-body MD simulation optimised from 872s → 27.4s on AMD EPYC 7742 (Zen 2). Cache-aware programming, loop fusion, and compiler tuning. No parallelism — pure single-core performance engineering. C, HPC.
HPC-MPI-Prey-Predator-simulation — High-performance parallel predator-prey simulation using a custom MPI-based Actor Framework. Distributed grid decomposition, real-time telemetry, WebGL topology visualiser for rank partitioning. C, MPI, C++.
hpx-matmul-demo — Matrix multiplication benchmarks using HPX, a C++ standard library for concurrency and parallelism. Demonstrates task-based parallelism and async execution for HPC workloads. C++.
hpc-sandpile-model — Parallel Abelian Sandpile simulation in C. 2D domain decomposition on ARCHER2, non-blocking halo exchange, 85% parallel efficiency at 64 cores. MPI + OpenMP.
c-dynamic-affinity-scheduler — Custom OpenMP scheduler in C. Dynamic affinity scheduling to minimise lock contention in HPC workloads.
ScholarLens — Search and analysis tool for academic literature. Semantic search over large paper corpora, LLM-powered trend extraction, topic evolution over time. Python, Docker, CI pipeline.
Timeline-Explorer — Takes academic PDFs and turns them into structured knowledge timelines. NER, hybrid semantic search, automated metadata extraction.
LLM Inference Server — REST API for transformer inference. KV-cache, SSE streaming, containerised. FastAPI + PyTorch + Docker.
AutomationB-Showcase — EtsyBot: fully autonomous pipeline that researches trending niches via Google Trends, generates AI art, writes SEO-optimised listings, and publishes to Etsy via v3 API. 5 stages, 3 AI integrations, 0 manual steps. Python, Gemini, Leonardo.ai, Docker. [Live Demo]
Career — Daily career intelligence dashboard: GitHub profile analysis, live job market matching, and skill gap tracking powered by Gemini Flash. Python, Next.js, GitHub API.
Sentinel-Scan — Security scanning tool. Automated vulnerability and misconfiguration detection across network targets. Python.
Distributed ETL Pipeline — Ingests 1 GB financial datasets into a hybrid SQL/NoSQL schema. Redis caching layer, 10 ms query latency. Python, PostgreSQL, MongoDB, Redis.
ISRO National Remote Sensing Centre — ML Engineer Intern (Oct 2023 – Jan 2024) Built a ConvLSTM lightning prediction system end-to-end in PyTorch. 92% accuracy on live data, deployed to active meteorologist workflows. Cut training time 35% by optimising on 500 GB WRF datasets with VAE. Also built the React dashboard they used to monitor it.
Clustor Computing — Data Science Intern (Jun 2023 – Sep 2023) Stock price models, 94% directional accuracy. Data pipelines and Tableau dashboards for investment teams.
Develoscope Software Solutions — Software Engineer Intern (Jun 2023 – Sep 2023) Four Java web apps shipped from scratch to production. JSP, Tomcat.
| Certification | Issuer | Valid |
|---|---|---|
| Azure DevOps Engineer Expert | Microsoft | Jun 2026 |
| Azure Developer Associate | Microsoft | Jun 2026 |
| Fabric Analytics Engineer Associate | Microsoft | Jun 2026 |
| OCI Generative AI Professional | Oracle | Jul 2026 |
| Google Cybersecurity Specialization | Google / Coursera | — |
| McKinsey Forward Program | McKinsey & Company | — |


