About me

Are you looking for a Data Engineer who bridges deep technical skill with real-world business value? Let’s connect!

I’m a results-driven Data Engineer with a Master’s in Computer Science from Syracuse University (May 2024) and over 3 years of experience building scalable data pipelines, cloud-native platforms, and real-time analytics systems. From Morgan Stanley to Intuit, I’ve architected data solutions that power smarter decisions across finance, retail, and healthcare domains.

My technical toolkit includes Python, SQL, PySpark, and tools like Apache Airflow, Kafka, and dbt for building robust, modular ETL pipelines. I’m proficient with AWS and Azure cloud platforms, leveraging services like S3, Glue, Redshift, Lambda, and Databricks to process petabyte-scale data and ensure high availability and performance.

Whether it’s migrating critical workloads from Hadoop to Snowflake, optimizing real-time ingestion pipelines for tick data, or developing interactive dashboards using Power BI and Tableau, I’m passionate about building systems that are not only efficient but also drive business outcomes.

I thrive in fast-paced, cross-functional environments where collaboration meets innovation. My ability to break down complex data into actionable insights has helped optimize workflows, improve customer experience, and drive smarter business decisions across finance, tech, retail, and healthcare domains.

Outside of work, I’m constantly exploring the latest trends in AI and cloud technologies, while nurturing a lifelong love for reading and music. This balance fuels my creativity and helps me think outside the box.

Let’s connect! I’d love to help your team turn complex data into actionable systems and insights.

Technical Skills

  • Prog icon

    Programming Languages & Tools

    Python, R, SQL, Docker, Kubernetes, CI/CD Pipelines, Git, Jenkins, Agile, Scrum, JIRA, OpenAI

  • software icon

    Data Engineering & Cloud

    AWS (EC2, S3, Glue, Athena, Redshift), Azure Blob Storage, Azure SQL Database, Hadoop, Hive, MapReduce, Apache Airflow, Data Warehousing, ETL, Data Integration, CloudWatch, Snowflake, Redshift, Data Lakes

  • machine icon

    Data Science

    Statistical Modeling, Predictive Modeling, Time Series Analysis, Hypothesis Testing, ANOVA, Regression Analysis, Exploratory Data Analysis, KPI Reporting, A/B Testing

  • mobile app icon

    Machine Learning & AI

    Supervised Learning, Unsupervised Learning, Recommender Systems, Anomaly Detection, Computer Vision, Natural Language Processing (NLP), Deep Learning, TensorFlow, Keras, PyTorch, SciPy, Large Language Models (LLM)

  • software icon

    Data Visualization & Analytics

    NumPy, Pandas, Matplotlib, Seaborn, Tableau, Looker, Power BI, DAX

  • software icon

    Database Technologies

    MySQL, SQL Server, Oracle, MongoDB, Database Design, Query Optimization, Data Migration, NoSQL Databases, Relational Databases, Data Modeling, Database Administration, Data Governance

Testimonials

  • Mahek Sota

    Mahek Sota

    I had the privilege of collaborating with Rupa on numerous projects during our time at college, spanning Machine Learning and Human-Machine Interaction, and beyond. Rupa consistently brought forth unwavering dedication and effort, contributing wholeheartedly to the success of our team endeavours. Rupa's novel approach and quick thinking on her feet were instrumental in our project's success. Her ability to think outside the box and come up with innovative solutions to complex problems significantly elevated our team's performance. Additionally, her collaborative skills fostered an environment of open communication and idea exchange, allowing us to work together seamlessly towards our goals. Notably, Rupa possesses outstanding time management skills, always punctual and reliable. Her ability to maintain composure and foster productivity during high-pressure situations is truly commendable, making her an invaluable asset to any team. In my opinion, any team would be fortunate to have Rupa onboard. Her combination of talent, work ethic, quick thinking, and collaborative spirit makes her a standout individual, and I wholeheartedly recommend her to any organization seeking new talent.

  • Mahek Chhabria

    Mahek Chhabria

    Rupa and I had worked on a number of projects in our 2nd and 3rd year of Engineering. We had worked on projects in the subjects of Python, Data Structures, Data Warehouse Management, Web Development Lab, Database Management System, Mini project (based on AI). Working with Rupa on so many projects was a great learning experience. We used to do all the work equally. We had a perfect team of two people where there was an understanding between each other, we used to hear each other's opinion and choose the one that was best for our project. She was surely great as a team member, used to brainstorm and come up with ideas that would help us to make our project more interactive and interesting.

  • Mahek Khathurani

    Mahek Khathurani

    I had the opportunity of working with Rupa on several projects based on Artificial intelligence, Machine Learning, Human Machine Interaction and many more in college. She gave 100 percent effort consistently to the team. She is punctual having excellent time management skills and has a knack for keeping everyone calm and productive during intense crunch periods. Any team would be lucky to have Rupa, and I couldn't recommend her more for any business looking for new talent.

  • Shivani Mokashi

    Shivani Mokashi

    Rupa is an extremely hard-working, diligent individual. She is driven by the passion to make things work. In my tenure as Editorial Director of Rotaract club of Deonar 2021-22, I have mentored her for 4-5 Projects under my Avenue. She has Chaired amazing projects which have had a good impact and boosted learning of the members. She plans her work in an amazing way and has a solution based approach towards things.

  • Simran Tawar

    Simran Tawar

    I've had the opportunity to collaborate with Rupa on multiple projects at Syracuse University, where she demonstrated exceptional skills in Machine Learning, Android Development, Database Management, Social Media and Data Mining, and Java programming. Rupa is a talented and cooperative team player with a strong dedication to her work. I highly recommend Rupa for positions that require technical expertise in software development and problem-solving abilities.

Resume

Education

  1. Syracuse University College of Engineering and Computer Science - Master of Science in Computer Science

    Aug 2022 — May 2024

    Cumulative GPA: 3.6/4.0
    Design Analysis of Algorithms, Machine Learning, Data Science, Operating Systems, Object Oriented Design, DBMS, Cryptography , Android
    Achievement: Awarded 30% merit scholarship for outstanding academic performance at Syracuse University.

  2. University of Mumbai - Bachelor of Engineering in Computer Engineering

    Aug 2018 — May 2022

    Cumulative GPA: 9.1/10.0
    Data Structures, Artificial Intelligence, Web Development, User Interaction, Data Warehouse & Mining, Computer Networks
    Achievement: Academic proficiency by earning a perfect 10 pointer in semesters 5, 6, and 7 among a batch of 165 students.

Experience

  1. Morgan Stanley | Data Engineer

    July 2024 — Present

    • Engineered a cloud-native data ingestion framework using Apache Kafka, AWS Glue, S3, and Spark, unifying real-time and batch data sources across teams, improving data availability by 30%, and enabling scalable ingestion for tick data and regulatory inputs.
    • Automated processing of 0.5 TB+ financial data using Python, Spark, SQL and Redshift; integrated Delta Lake on AWS S3 with ACID compliance and time-travel features, improving data reliability, pipeline throughput, and report delivery times.
    • Built modular, metadata-driven ETL pipelines with Apache Airflow (DAGs), reducing daily job failures by 30% and cutting failure recovery times by 60%, while ensuring regulatory compliance (GDPR, CCPA).
    • Executed a petabyte-scale migration from Hadoop to Snowflake, moving 1.2 TB+ of tick-level trade data via AWS S3, increasing analytical throughput by 25% and cutting on maintenance costs by 30%.
    • Standardized 40+ transformation models using dbt, implementing testable, reusable SQL patterns across 10+ financial datasets, reducing redundancy and improving data reliability across financial analytics workflow.

  2. Intuit | Business Intelligence Intern

    Jan 2024 - Jun 2024

    • Engineered end-to-end data pipelines leveraging Azure Databricks (Python, Pandas) and SQL to optimize inventory integration and management for 50,000+ SKUs across multiple warehouses, reducing stockouts by 18% and lowering inventory holding costs by 12%.
    • Automated complex transportation route optimization by designing scalable data pipelines with Azure Data Factory and Azure Functions, improving delivery efficiency by 17% and minimizing bottlenecks in high-demand distribution centers.
    • Created interactive Power BI dashboards integrated with SQL Server Analysis Services (SSAS) and Delta Lake, enabling real-time visualization of operational and financial KPIs, increasing actionable insights by 22% and enhancing decision-making across teams.

  3. Infinite Infolab | Senior Data Engineer

    May 2021 - Jul 2022

    • Architected and implemented a cloud-native AWS data platform (S3, Redshift, Glue, Lambda) reducing storage costs by 20% and boosting query performance by 25%, enabling faster and more cost-effective financial mortgage data analysis.
    • Automated data quality monitoring and event-driven workflows with AWS Lambda, integrating OLAP cube processing cutting manual data validation efforts by 1,000+ man-hours annually and reducing data quality issues by 30%.
    • Built interactive dashboards in AWS QuickSight and Tableau, in driving a 20% increase in data-driven decisions by visualizing key portfolio KPIs and performance metrics.

  4. Vivma Software Inc | Data Engineer

    Feb 2020 - Apr 2021

    • Streamlined ETL workflows using Python, SQL, and SSIS, improving data accuracy by 20% and reducing manual efforts by 30%.
    • Built Power BI dashboards and Excel tools integrating RDBMS data, cutting decision time from 2 days to 12 hours and boosting inventory turnover 1.5x.
    • Implemented CI/CD pipelines with Jenkins and Git, cutting deployment errors by 40% and speeding releases by 25%, while monitoring KPIs to reduce production bottlenecks by 30%.

My skills

  • Data Engineering
    90%
  • Data Science
    80%
  • Data Analytics
    90%
  • Database Management Systems
    85%
  • Design & Analysis of Algorithms
    85%
  • Machine Learning
    80%
  • Web Development
    90%

Rupa Bhatia Resume

My Projects

Contact

Contact Form