Skip to content
View yranjan06's full-sized avatar
🇮🇳
WFH
🇮🇳
WFH

Highlights

  • Pro

Block or report yranjan06

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yranjan06/README.md

I'm a Data Engineer building Scalable Pipelines and Cloud-Native Architectures that turn complex data into Meaningful Insights. I work with Python, Spark, Kafka, and cloud platforms like Azure and GCP to design systems that handle large-scale data processing efficiently. Beyond core engineering, I'm deeply into Mathematics, MLOps, Deep Learning, and Optimization Problems — always experimenting with new technologies, contributing to open source, and collaborating on projects that push boundaries of what's possible with data.

Ranjan's Logo Typing SVG
GitHub Stars

GitHub Commits

Pinned Loading

  1. mini_kafka mini_kafka Public

    A from scratch Python implementation of Apache Kafka concepts including producers, brokers, topics, consumers, and offset management, built to learn distributed messaging without external dependenc…

    Python 23 1

  2. airflow airflow Public

    Forked from apache/airflow

    My fork of Apache Airflow — working on documentation and RBAC fixes.

    Python 1

  3. beam beam Public

    Forked from apache/beam

    Apache Beam is a unified programming model for Batch and Streaming data processing.

    Java

  4. datafusion datafusion Public

    Forked from apache/datafusion

    Apache DataFusion SQL Query Engine

    Rust

  5. apache/airflow apache/airflow Public

    Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

    Python 43k 15.9k

  6. airbyte airbyte Public

    Forked from airbytehq/airbyte

    The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

    Python