PerkyDumpling

Scaling LB

For making highly scalable, highly available applications - applications are put behind a load balancer and LB will distribute traffic between them.

Let say load balancer is reaching its peak traffic then what ? How is traffic handled in that scenario.

8mo ago

Jobs

One interview, 1000+ job opportunities

Take a 10-min AI interview to qualify for numerous real jobs auto-matched to your profile 🔑

+322 new users this month

GroovyTaco

Microsoft8mo

One way i guess would be to have autoscaling in your backend pool of VMs/servers and some standby servers. So as as when demand is increasing and standby servers are getting exhausted, new VMs would automatically spin up and take up traffic. When peak traffic goes away, scale down should happen and the new VMs would get destroyed according to the set thresholds.

Also, we can have multiple load balancers deployed based on geographic areas and Azure Traffic Manager can distribute traffic across these load balancers at global level.

WigglyBoba

Accenture8mo

ChatGPT says below 😅

When a load balancer reaches its peak traffic capacity, the system needs additional measures to handle the traffic and maintain scalability and availability. Here are the common strategies:

Horizontal Scaling (Adding More Instances)

• How it works: Additional application server instances are deployed and registered with the load balancer. The load balancer then distributes the traffic among these new instances. • Automation: Autoscaling groups can automatically scale out instances based on traffic metrics (e.g., CPU, memory usage, or request rate). • Key Consideration: Ensure your load balancer itself has the capacity to handle more backend servers.
Scaling the Load Balancer

• How it works: If the load balancer itself becomes a bottleneck, you may need to: • Upgrade the load balancer (if you’re using a single instance, like an NGINX or HAProxy server). • Use a multi-tier load balancing approach, where: • One set of load balancers handles user requests and distributes traffic to a second tier of load balancers. • Switch to a cloud-based managed load balancer (like AWS ALB, Azure ALB, or GCP’s Load Balancer), which can scale automatically. • Key Consideration: Cloud providers often allow elastic scaling of their load balancers, handling spikes in traffic dynamically.
Geographic Load Balancing

• How it works: Use Global Traffic Manager (GTM) or DNS-based load balancing to route traffic to different data centers or regions based on proximity or availability. • Benefit: Distributes load across multiple locations, reducing the chance of bottlenecks in a single region.
Content Delivery Networks (CDNs)

• How it works: Offload static or cacheable content (e.g., images, videos, CSS, JavaScript) to a CDN like Cloudflare, Akamai, or AWS CloudFront. • Benefit: Reduces the direct load on your load balancer by handling requests closer to the user.
Queueing Systems

• How it works: When peak traffic exceeds server capacity, requests are added to a queue. A message broker (e.g., RabbitMQ, Kafka, or AWS SQS) can buffer requests for asynchronous processing. • Benefit: Prevents system overload by smoothing traffic spikes.
Traffic Throttling or Rate Limiting

• How it works: Define limits for how much traffic any single user or application can generate (e.g., rate limits per IP or API key). • Benefit: Prevents misuse or overloading by a few heavy users.
Failover and Redundancy

• How it works: Configure backup load balancers or failover regions that activate automatically if the primary system cannot handle traffic. • Benefit: Ensures high availability during unexpected surges.
Caching Mechanisms

• How it works: Implement server-side caching (e.g., Redis, Memcached) to reduce the load on your application servers by serving repeated requests from a cache. • Benefit: Reduces response time and application server load.
Optimize Application Performance

• Improve application code to handle requests faster. • Use database optimizations or read replicas to reduce DB load. • Minimize server-side computation for each request.

Planning for Peak Traffic

To prepare for such scenarios: • Use stress testing to understand system limits. • Implement capacity planning based on historical traffic patterns. • Leverage cloud-native services with elastic scaling capabilities.

Would you like to dive deeper into any specific solution?

Discover more

Curated from across

Software Engineers10mo

by PeppySushiSoftware Developer

Interesting Projects SDEs are working on, in India ?

Hey everyone,

Today I googled to read about what interesting projects are currently being worked on by SDEs in big Tech companies (say Google India).

All of the results were kind of "perks of working in XYX", "A day in life of...", "Sa...

Software Engineers10mo

by FuzzyCoconutDeloitte

Discussion on Bookmyshow's server capability

We seldom get asked this question about how do we ensure a highly efficient and robust platform/system that can cater to large number of hits.

Well, Bookmyshow's server was down for the starting 4 mins with a 504. How could they have im...

Software Engineers14mo

by GoofyBagelWalmart

How do you solve for peak concurrency? >10,000 users at one point?

Just how do you scale? Do you use AWS auto-scaler for that?

Top comments

Seems like you are early in your DevOps/BE journey. The way you do it is: 1. Build performant APIs. <300ms is good a...

https://www.youtube.com/watch?v=9b7HNzBB3OQ This is the best video on an Indian company handling insane concurrency,...

From what I remember hotstar used an in-house tooling that scales infra based on request rate and concurrent users pe...

Indian Startups28mo

by SparklyWalrusSpinny

Crickpe failed to scale on day 1

Crickpe failed to scale on day 1. They didn't expect this exponential traffic. Crickpe is build by code brew lab, chandigarh a service based company, their developer might not created such large scale app. Ashneer grover apologised for t...

8245 interviewed this week

Hire Top Engineers

Round1 finds you 10 great candidates in 48 hours!

Get in touch

23 roles listed this week

Apply to top roles

Take a 9-minute interview and auto-apply to top companies!

Apply now

Ask a question on Grapevine.

Get the app on Android or iOS.

Privacy Terms

Guidelines Help