Miaosha

Miaosha is a product-selling website which can handle the traffic from a million of users. My backend system can process 20,000 QPS with 8 t3.nano instances and maintain 300,000 WebSocket connections with 6 t3a.small instances.

Website Link: https://miaosha.click/
Explanation Video: https://drive.google.com/file/d/1Y3m75dhT6n5ikO_NZSxNYeaZeiIZEPPO/view?usp=sharing

Login

email: nghdavid123@gmail.com
password: test1234

Main Features
Backend Technique
System design
Architecture
Workflow
Database Schema
Demo
How to prevent overselling
Send email asynchronously
How to prevent robot attack
Asynchronously process requests
How to ensure the stability of other API
Queuing System
Turn on EC2 instances and ElastiCache
Continuous Deployment
Load Test
How to start my project
Future Features

Main Features

Actively informed users with WebSocket instead of short polling to decrease API requests
Improved API efficiency by processing asynchronously with RabbitMQ
Achieved high concurrency with distributed system including Publisher, Consumer, MySQL read replica, and Redis cluster
Routed high traffic API and general API to different EC2 target groups with Elastic Load Balancer to ensure the stability of other API
Prevent overselling with atomic operation in Redis and short-TTL JWT
Sent emails asynchronously with SQS and Lambda
Complete queuing system with RabbitMQ (Dead letter exchange)
Used EventBridge to schedule Lambda to start EC2 before each event
Prevented malicious attacks using Nginx’s rate limiter
Applied CloudFront as CDN to reduce bandwidth loading and latency
Packaged Miaosha system in Docker Compose as development environment, including Node.js, MySQL, Redis cluster, RabbitMQ, and phpMyAdmin
Continuously deployed with GitHub Actions and Docker Hub
Performed unit test and integration test by Jest and Supertest

Backend Technique

Environment

Node.js/Express
WebSocket (Socket.IO)
PM2

Server

EC2
Elastic Load Balancer
Auto Scaling

Web server

Nginx

Serverless

Lambda
EventBridge

Database

RDS (MySQL) with read replica
phpMyAdmin

Cache

ElastiCache (Redis cluster)

Message broker

RabbitMQ

Container

Docker
Docker Compose

Continuous delivery

GitHub Action
Docker Hub

CDN

CloudFront

Others

S3
Route53
JWT

Load test

K6

Test

Unit test: Jest, Supertest

System design

My system design's principle is to filter the traffic layer by layer. The filter consists of six layers.

CDN:
- Deploy static files (html, css, js) to CloudFront
- Ask users to answer a question
Load Balancer:
- Route different APIs to different target groups and multiple instances
Web server:
- Prevent malicious attack by nginx's rate limiter
Application server:
- Use WebSocket instead of short polling to decrease API requests
- Process asynchronously to speed up process time
Redis:
- Store stock in cache to decrease read/write latency and increase concurrency
- Use cache to decrease DB loading

Architecture

There are three kinds of target groups (instances)
- Publisher: Responsible for Miaosha API and Notify users the start of selling event with Socket.IO
- Consumer: Check whether users successfully get the chance to buy and inform users of results.
- General: Process general API (login, checkout, ...)
Publisher, consumer, mysql read replica, redis cluster are horizontally scalable
Application load balancer can route different APIs to different target groups
Use WebSocket to actively inform users

Workflow

Database Schema

Demo

Panic buy (搶購)

Standby (候補機制)

Stock release (庫存釋放給下一位使用者)

Checkout (結帳)

How to prevent robot attack

The user has to answer a question related to the product correctly
Setup rate limiter in Nginx to prevent malicious attacks

How to prevent overselling

Tools: Redis, JWT token

Use atomic operation in redis to prevent race condition
When an user got the chance to buy, the backend system would give the successful user JWT token with short expire time. The user need to submit their JWT token for verification when he checkout. If the user doesn't checkout in 10-minute time limit, the user cannot checkout successfully because JWT token is expired. Therefore, this stock is released and cannot be bought by the user.

Asynchronously process requests

Only check user's answer and time in publisher target group
Send user id to RabbitMQ
Consumer target group would check whether the user gets the chance to buy the product

Send email asynchronously

When an user successfully checks out, checkout API would submit email and user id to SQS before responding to an user. After that, SQS would trigger Lambda to send email with Nodemailer.

How to ensure the stability of other API (login, checkout) when selling event starts?

When flash sale happens, a huge influx would flow into the backend system and may influence other APIs. However, elastic load balancer would route different APIs to different target groups. Miaosha API would be routed to publisher target group. Thus, consumer and general target group would not be influenced by miaosha API.

Queuing System (候補機制)

Release stocks when users forget to pay in 10-minute time limit
Actively inform standby users of successful results via Socket.IO
Store list of standby users in Redis List
Complete waiting queue with Dead Letter Exchange

How do I implement the queuing system?

Tool: Redis List and RabbitMQ (Dead Letter Exchange)

If consumer determined the user as successful, consumer would send the user id to waiting queue with dead letter exchange.
If consumer determined the user as standby, consumer would send the user id to redis list.
After 10 mins time limit, waiting queue would send the user id to payment consumer.
Payment consumer would check whether the user has paid or not.
If the user doesn't pay, the stock would be released and be given to standby user.

Turn on EC2 instances and ElastiCache before each selling event starts

Use CloudWatch EventBridge to schedule Lambda to turn on and off EC2 instances and Elasticache
Use Schedule Action in auto scaling group to scale out instances

Continuous Deployment

Implement continuous deployment by GitHub Actions, Docker Hub and Docker Compose to automatically update app versions in general instances

Load Test

Miaosha website must be capable of handling high traffic. I implemented load tests to check the max WebSocket connections and miaosha API QPS.

Number of WebSocket connections
- Horizontal Scaling
- Vertical Scaling
Miaosha API QPS
- Horizontal Scaling
- Vertical Scaling

Code: https://github.com/nghdavid/miaosha/tree/main/load-test

Number of WebSocket connections

Number of WebSocket connections
- Vertical Scaling
  - Num of connections is positively correlated with ram size
  - t3a.micro has best cost performance ratio in terms of num of connections
  - However, t3a.micro would crush with too many connections (>40000)
  - Therefore, I select t3a.small for horizontal scaling

Horizontal Scaling
- With 6 t3a.small instances, my backend system can maintain 300,000 WebSocket connections

Miaosha API QPS

I used K6 to perform miaosha API load test.

Load test's criteria for passing:
- Ratio of http_req_failed < 1%
- median of http_req_duration < 1 sec
Vertical Scaling
- QPS is not correlated with the number of cpu and ram size
- Bandwidth is bottleneck for QPS's elevation (Compare t3a.medium and c5.large)
- t3.nano has best cost performance ratio in terms of QPS
- Therefore, I select t3.nano for horizontal scaling

Horizontal Scaling
- With 8 t3.nano instances, my backend system can reach 20,000 QPS for miaosha API

How to start my project

cd dockerfiles
sudo chmod u+x start.sh
sudo chmod u+x stop.sh
cp .env_docker_template .env
ipconfig getifaddr en0 # Get your ip
# Modify .env (ip, YEAR, MONTH, DATE, HOUR, MINUTE, SECOND)
./start.sh
# Go to http://localhost:5000/main.html
# Account: test1@test.com Password: 123456789
# Stop docker
./stop.sh

Future Features

If too many users send id to miaosha API, my backend system can change elastic load balancer's response. Elastic load balancer can directly return 'The activity is over' instead of routing requests to publisher target group.
Dynamic url
Use Code Deploy to update multiple instances

Files

README.md

Latest commit

History