top of page
IMG_5379_2-removebg-preview (1).png

Hi, I am Puja Ghimire

Welcome to my personal profile! I am an individual passionate about big-data technology. I grew my interest in technology since my first computer course when I was in grade 7.  I strive to bring unique perspectives and ideas to every project I undertake. Through my work, I aim to make a positive impact and inspire others. Get to know me better by exploring my portfolio and projects.

Professional History
amazon logo.png

Software Development Engineer | Seattle, WA, USA

Sep 2022 - Dec 2023

​

  • Launched a feature to support the Amazon Bulk Services program which could potentially save 161MM for Amazon Business every year.

  • Designed and launched a feature to handle large payloads that uses AWS SQS to share data within services which has a hard limit of 256KB using S3 bucket.

  • Reduced 20% of the team cost by leveraging an internal tool for our team and provisioning utilization of resources in AWS.

  • Added automated integration tests for our services pipeline to capture bugs within 90 minutes of deploying code in production along with automated rollback if failed to maintain high quality code.

  • Created, Updated and Maintained contents of DynamoDB database store with 3MM products to be the single stop point for amazon fulfillment related purposes

  • accessed by internal customers worldwide which provided the needed data within SLA.

  • Created AWS SQS to get data from the cross functional team by subscribing to their available AWS SNS topic to onboard new identifiers into the database store.

  • Constructed monitors and alarms using Amazon Carnaval, logged processes and maintained dashboards using AWS CloudWatch to observe health of services.

  • Analyzed data coming into our services with 40,000 TPS through various sources using Python to populate business metrics and automated graph creation using Monitor Portal for Weekly Business Reviews which helped identify future projects.

  • Won an internal Hackathon upon creating a service that helps teams set alarm threshold for each services by analyzing historical data using Isolation Forest technique

  • Estimated yearly budget for our team’s services for the following year by analyzing previous year’s capacity using Python with 95% accuracy.

impact logo.png

Data Science Intern | Bengaluru, KA, India

February 2018 - June 2018

  • Discovered the top 10 KPI and KPC that uplifted revenue for a client using machine learning algorithms

  • Documented client requirements in client meetings for team reference

  • Found similarity between products using cosine similarity to generate similar products using R

Home: Experience
impact logo.png

Data Scientist | Bengaluru, KA, India

August 2018 - July 2020

  • Recommended promotions for ~2,400 items which helped client generate 12% more revenue than previous year by analyzing historical data and applying machine learning algorithms in seasonal data for a retail client

  • Implemented cannibalization technique using R to identify cannibalized products which increased 10% sales

  • Performed competitor analysis by web scraping competitor data and analyzing using R and SQL which helped in generating 5% more revenue

  • Cleaned client data and performed Exploratory Data Analysis on Marketing campaign using Python libraries NumPy and Pandas to measure efficiency

  • Derived insights using R and SQL to identify potential fraudulent transactions for a client which saved ~3MM every year.

  • Developed an allocation tool that helped client to allocate multiple products at a time which saved ~5 hours of manual work

  • Extracted, Transformed and Loaded weekly transaction data into an internal database for forecasting sales using Apache Airflow.

  • Used Tableau to create visualization dashboard for weekly meetings to show YoY and MoM improvement in sales for a retail client

neu logo.png

Graduate Teaching Assistant | Boston, MA, USA

January 2022 - April 2022

  • Collaborated with faculty to deliver lecture and provide clarifications on course material for Data Management and Database Design

  • Evaluated student course assignments, exams, projects and provided constructive feedback to students to help improve their performance in the course.

Boston, MA, United States

January 2021 - August 2022

Northeastern University

Masters in Information Systems

  • Application Engineering Development

  • Data Management and Database Design

  • Web Design and User Experience Engineering

  • Web Development Methods and Tools

  • User Experience Design and Testing

  • Design Patterns

  • Agile Methodologies

GPA: 3.92/4

Benagluru, KA, India

August 2014 - June 2018

Ramaiah Institute University

Bachelor of Engineering in Information Science

  • Data Structures

  • Natural Language Processing

  • Data Science

  • Data Mining

  • Distributed Systems

  • Operating Systems

  • Java & J2EE

  • Finite Automata and Formal Languages

GPA: 8.72/10

Academic Experience

Technologies I have worked with

python logo.jpeg
Java logo.png
databricks logo.jpeg
ppt logo.jpeg
pyspark logo.png
Java logo.png
R logo.jpeg
aws logo.jpeg
sql logo.jpeg
airflow logo.png
scala logo.jpeg
tableau logo.png
Excel logo.jpeg
tableau logo.png
R logo.jpeg
aws logo.jpeg
sql logo.jpeg
airflow logo.png
scala logo.jpeg
mern logo.webp

Want to know me more?

Thanks for submitting!

Home: Contact
  • Facebook
  • Twitter
  • LinkedIn

©2022 by pujaghimire. Proudly created with Wix.com

bottom of page