About Me

I'm Mohit Sai Gutha, recently graduated with a Master's degree in Computer Science at Boston University. My background includes a Bachelor’s in Computer Science and Engineering. I have a passion for data-driven solutions, well designed and neatly implemented impactful software. I am passionate about developing scalable software, solving real-world data challenges, and optimizing systems to drive business impact. My expertise spans backend development, cloud computing, and data engineering, and I love staying at the cutting edge of emerging technologies.

Mohit Sai Gutha

Projects

Project Image

Future of Hiring

This project automates job market analysis using Azure. Azure Functions fetch job listings from the Adzuna API, storing raw data in Data Lake Storage. Databricks (PySpark) processes and structures the data in Delta Lake, while Data Factory orchestrates the ETL pipeline. Power BI dashboards provide insights into hiring trends, salaries, and demand, with Terraform ensuring scalable, secure deployment.

Azure Terraform PowerBI Azure Databricks Data Lake
Project Image

Epidemic Engine

A real-time ETL pipeline designed for health event monitoring and predictive analytics. It ingests over 1 million records using Apache Kafka for streaming and Hadoop MapReduce for batch processing. Docker ensures modular, containerized execution, while Apache Airflow automates workflow orchestration. Spark ML (PySpark) powers predictive models using KMeans clustering, achieving 93.7% accuracy in epidemic outbreak detection, enhancing public health monitoring.

Apache Kafka Apache Airflow Hadoop Spark Docker
Project Image

Google Analytics Simulation

A cloud-native web analytics system deployed on Google Cloud Platform (GCP), serving 10,000 webpages via a Virtual Machine while tracking 100,000+ user requests in Cloud SQL. Google Cloud Dataflow computes PageRank for real-time streaming analytics, optimizing web performance. An ML model trained to predict user demographics based on webpage requests achieved 99.7% accuracy and was deployed on Google Kubernetes Engine (GKE) with Google Deployment Manager for scalable orchestration.

Google Cloud Storage Google Cloud Dataflow Cloud SQL Google Kubernetes Engine Google Deployment Manager
Project Image

Impact of Remodeling and Renovation on Housing Availability in Boston

For the City of Boston Government

A data-driven analysis of how renovation trends, permit approvals, and urban policies influence housing availability in Boston. Using Python, SQL, and Power BI, we identified housing unit losses due to gentrification and high-income influx. The project explores multi-unit to single-family conversions, income-restricted housing trends, and neighborhood shifts.

Python (Pandas, NumPy) SQL Power BI Matplotlib Geospatial Analysis

Experience

Software Development Consultant

Genpact | India | Oct 2022 - June 2023

  • Architected and deployed Java-based Spring Boot microservices, modernizing legacy monolithic architecture for a top healthcare client, integrating with AWS SQS for reduced latency, driving 60% faster transaction processing
  • Collaborated with a team of 12 on migrating legacy mainframe systems to AWS, leveraging AWS DMS, Amazon Aurora, and EC2 Spot Instances, achieving a 60% reduction in compute costs
  • AAutomated unit and integration testing with JUnit, Selenium, Jira, and Cucumber to enhance code reliability and reduce defects
  • Built and optimized RESTful APIs, integrating with enterprise systems to support customer data management
  • Utilized CI/CD pipelines with Jenkins and GitHub Actions to automate build, test, and deployment processes, reducing deployment time and improving release cycles
  • Worked in an Agile Scrum environment, participating in daily stand-ups, sprint planning, and retrospectives to ensure iterative development and timely delivery

Intern

Johnson Controls Inc | India | Jan 2022 - July 2022

  • Executed transition from legacy WAN to SD-WAN across all Bengaluru locations, for a company-wide network restructuring project boosting cost savings by 65% and operational efficiency by 50%
  • Designed a React based single page application with a Power BI dashboard to visualize the company's network infrastructure transition, integrating real-time analytics on progress
  • Improved overall project efficiency by 20% through data analysis and visualization
  • Led a team of 6 interns in the Future Leaders Internship Program to develop innovative alternatives for public green grants and loans, reaching the finals of the Sustainable Innovation Competition

Skills

  • Programming Languages

    Java, Python (Pandas, PyTorch, NumPy), C++, SQL, Git

  • Languages

    English (Fluent), Spanish (Beginner)

Resume

Click the buttons below to view my Data Science or Software Development resume:

Contact Me

Feel free to get in touch with me for collaborations or opportunities!