Summary
Overview
Work History
Education
Skills
Personal Information
Websites
Certification
Timeline
Generic

Suraj Mhetre

Naperville

Summary

Results-driven Data Engineer delivering cloud solutions using GCP and AWS, Designing scalable ETL pipelines, data visualization and modeling. I enjoy problem-solving, collaboration, and adaptability to leverage technical skills in developing innovative data solutions across diverse environments.Proven ability to lead cross‑functional teams, automate workflows, and deliver 99.9% reliable data products.

Overview

10
10
years of professional experience
1
1
Certification

Work History

Data Engineer

Egen Solutions
Naperville
11.2020 - Current
  • Partnered with clients, product managers, UX designers, and cross-functional engineers to capture software requirements and design approaches.
  • Define project architecture and specifications, mentor engineers through complex troubleshooting, and govern software quality, testing, and release cycles.
  • Orchestrated and maintained 40+ automated ETL/ELT pipelines, ingesting over 3 TB of daily clinical data into AWS and GCP environments, improving data availability SLA from 48 hours to 12 hours.
  • Implemented a dbt-based data analytics framework.
  • Led end-to-end high-quality, real-world data deliveries to pharma and research partners.
  • Defined sprint scope, wrote Epics and Stories, and performed code reviews; mentored junior engineers.
  • Delivered on-call support for production environments, diagnosing and resolving incidents to maintain high system uptime.

Software Engineer Intern

CodersData
Remote
07.2020 - 11.2020
  • Designed REST APIs using Python and a responsive UI using JavaScript and React.js.
  • Added unit and integration tests (pytest, Jest), achieving 95% code coverage.

Machine Learning Intern

Truevim Inc
Lewisville
08.2019 - 11.2019
  • Created an Airflow pipeline to ingest and transform over 1 million real estate records into PostgreSQL.
  • Designed the pricing model for predicting home prices within ±5%.
  • Exposed the model via REST API to the iOS app, enabling near real-time valuation.

Student Engineer

The University of Texas at Dallas
Dallas
09.2018 - 08.2019
  • I built a classification model for IT support tickets with 94% accuracy, cutting resolution time by 80%.
  • Enhanced help-desk web app (Angular, C# .NET), adding alerts, and queue monitoring.

Software Engineer

Capgemini
Mumbai
06.2015 - 07.2018
  • Delivered web modules (ASP.NET MVC, AngularJS) for Fortune 500 clients; won 'Project Star.'
  • Authored complex SQL Server stored procedures, improving query performance by 40%.
  • Automated troubleshooting tasks with shell scripts save 10 or more engineer hours per week.
  • Delivered on-call production support, diagnosing and resolving incidents to maintain high system uptime.

Education

Master of Science - Information Technology Management

University of Texas At Dallas
Dallas, TX
05-2020

Bachelor of Science - Electronics Engineering

University of Mumbai
Mumbai, IN
05-2015

Skills

  • Typescript
  • Python
  • SQL
  • C#, JavaScript, Nodejs, Nestjs and React
  • Amazon Web Services - Redshift, S3, EC2, EMR, Lambda, Step Functions / State Machines, CloudWatch, CodeBuild, Simple Notification Service (SNS), Simple Queue Service (SQS)
  • Google Cloud Platform - BigQuery, Cloud Storage, Batch, Nextflow, Logs Explorer, APIs and Services, Compute Engine, Pub/Sub, Cloud Composer, Dataproc, Healthcare API, Life Sciences API, Artifact Registry
  • Docker
  • Kubernetes
  • Apache Airflow
  • Dbt
  • PostgreSQL
  • Tableau
  • Git
  • Jira
  • Terraform
  • Concourse
  • CI/CD
  • Unit Testing
  • Integration Testing
  • Data pipeline orchestration
  • Data modeling
  • Data analysis
  • Big data processing

Personal Information

Title: Software Engineer | Data Engineer

Certification

Google Cloud Professional Data Engineer, 12/20, 31369084, https://www.credential.net/24a66c5f-50a8-4b61-be84-cd18a694ce1f#acc.ALV7ai60

Timeline

Data Engineer

Egen Solutions
11.2020 - Current

Software Engineer Intern

CodersData
07.2020 - 11.2020

Machine Learning Intern

Truevim Inc
08.2019 - 11.2019

Student Engineer

The University of Texas at Dallas
09.2018 - 08.2019

Software Engineer

Capgemini
06.2015 - 07.2018

Master of Science - Information Technology Management

University of Texas At Dallas

Bachelor of Science - Electronics Engineering

University of Mumbai
Suraj Mhetre