Pushpneet Singh

Senior Software Engineer specializing in Data Engineering, Scalable Pipelines, and Cloud Architecture.

Experience

Senior Software Engineer @ Teikametrics
Aug 2025 - Present

Boston, US (Remote)

  • Optimized compute costs by refactoring ETL pipelines supporting AI feature-engineering workflows.
  • Migrated legacy bid-strategy optimization jobs for Ads using industry best practices, improving overall efficiency by ~40%.
  • Built a standardized data-consumption layer within a medallion architecture to unify access for AI and BI teams.
  • Leveraged Databricks Unity Catalog for centralized data governance and eliminated reliance on DBFS-backed tables.
SDE-2 Data @ Cars24
Jun 2024 - Aug 2025

Gurugram

  • Designed and developed an in-house no-code middleware tool to configure and automate ETL from Emails, APIs, SAP, PubSub, and Databases.
  • Implemented SAP S/4HANA accounting integration covering 97% of accounting volume for real-time financial reporting.
  • Integrated third-party APIs to automatically generate and trigger Airflow DAGs.
  • Utilized Airflow, Spanner, GCS, GKS, FastAPI, Temporal, Redis, and Snowflake.
  • Automated tasks with finance team saving 500 hrs/month.
Data Engineer Consultant @ Sanius Health
Jan 2024 - Jul 2024

Contract (6 months)

  • Wrote Python scripts from scratch to ingest data from different wearable sources and devices to Data Warehouse which resulted in overall time reduction from 24hrs to 2hrs.
  • Utilized Apache Kafka, Apache Druid & Apache Superset to get customized health reports in real time.
  • Built trigger based data ingestion pipeline using Azure Function to ingest data from health devices.
Data Engineer @ Tata Consultancy Services
Aug 2021 - Feb 2024

Delhi

  • Migrated complex on-premise contact center solution (100+ tables) to Cloud Environment for Commonwealth Bank of Australia.
  • Developed robust ETL pipelines processing data from 5000+ agents, 6M+ inbound calls, and 1.5M+ digital chats monthly.
  • Automated stream job monitoring reducing manual effort by 80%.
  • Optimized deployed pipelines to improve daily processing time.

Skills

Data & Cloud

Spark Kafka Airflow Hadoop Superset Snowflake Hive AWS Glue Redshift S3 GCP Databricks Terraform

Development & Tools

Python SQL C++ Docker FastAPI Git Unix Redis Kubernetes MLflow

Projects

Know Your Personality

Predicting personality by reducing 50+ trait questions using Factor Analysis & clustering algorithms. Achieved 81% accuracy.

Stack: Python, REST, Flask, ML

State Fiscal Data Explorer

Automated ETL pipeline to scrape, ingest, transform & orchestrate public financial data of states into a DWH for reporting.

Stack: Airflow, Selenium

Wearable Data Insights

Analyzed student wearable data (EDA, heart rate, temperature, etc.) to correlate exam performance with physiological metrics.

Stack: Python Libraries

Neural Style Transfer

Used pre-trained VGG-19 CNN to impose artistic styles onto content images.

Stack: Python, VGG-19 CNN

Connect With Me