Experience
Senior Software Engineer
@ Teikametrics
Aug 2025 - Present
Boston, US (Remote)
- Optimized compute costs by refactoring ETL pipelines supporting AI feature-engineering workflows.
- Migrated legacy bid-strategy optimization jobs for Ads using industry best practices, improving overall efficiency by ~40%.
- Built a standardized data-consumption layer within a medallion architecture to unify access for AI and BI teams.
- Leveraged Databricks Unity Catalog for centralized data governance and eliminated reliance on DBFS-backed tables.
SDE-2 Data
@ Cars24
Jun 2024 - Aug 2025
Gurugram
- Designed and developed an in-house no-code middleware tool to configure and automate ETL from Emails, APIs, SAP, PubSub, and Databases.
- Implemented SAP S/4HANA accounting integration covering 97% of accounting volume for real-time financial reporting.
- Integrated third-party APIs to automatically generate and trigger Airflow DAGs.
- Utilized Airflow, Spanner, GCS, GKS, FastAPI, Temporal, Redis, and Snowflake.
- Automated tasks with finance team saving 500 hrs/month.
Data Engineer Consultant
@ Sanius Health
Jan 2024 - Jul 2024
Contract (6 months)
- Wrote Python scripts from scratch to ingest data from different wearable sources and devices to Data Warehouse which resulted in overall time reduction from 24hrs to 2hrs.
- Utilized Apache Kafka, Apache Druid & Apache Superset to get customized health reports in real time.
- Built trigger based data ingestion pipeline using Azure Function to ingest data from health devices.
Data Engineer
@ Tata Consultancy Services
Aug 2021 - Feb 2024
Delhi
- Migrated complex on-premise contact center solution (100+ tables) to Cloud Environment for Commonwealth Bank of Australia.
- Developed robust ETL pipelines processing data from 5000+ agents, 6M+ inbound calls, and 1.5M+ digital chats monthly.
- Automated stream job monitoring reducing manual effort by 80%.
- Optimized deployed pipelines to improve daily processing time.
Skills
Data & Cloud
Spark
Kafka
Airflow
Hadoop
Superset
Snowflake
Hive
AWS Glue
Redshift
S3
GCP
Databricks
Terraform
Development & Tools
Python
SQL
C++
Docker
FastAPI
Git
Unix
Redis
Kubernetes
MLflow
Projects
Know Your Personality
Predicting personality by reducing 50+ trait questions using Factor Analysis & clustering algorithms. Achieved 81% accuracy.
Stack: Python, REST, Flask, ML
State Fiscal Data Explorer
Automated ETL pipeline to scrape, ingest, transform & orchestrate public financial data of states into a DWH for reporting.
Stack: Airflow, Selenium
Wearable Data Insights
Analyzed student wearable data (EDA, heart rate, temperature, etc.) to correlate exam performance with physiological metrics.
Stack: Python Libraries
Neural Style Transfer
Used pre-trained VGG-19 CNN to impose artistic styles onto content images.
Stack: Python, VGG-19 CNN