Containers

Category: Best Practices

Scaling StarRocks on Amazon EKS with KEDA and Karpenter for enterprise OLAP workloads

Financial analytics at enterprise scale is unforgiving. Queries must return in seconds, not minutes. Thousands of finance professionals need concurrent access during monthly close cycles. And when data volumes grow from hundreds of gigabytes to terabytes, spanning billions of records, the infrastructure underneath must scale without forcing engineers to choose between performance and cost. This […]

Maximizing value with Amazon EKS Auto Mode: Strategies for visibility, control, and optimization

In this post, we explore how to maximize Auto Mode’s value through comprehensive cost visibility, proactive governance, and continuous optimization strategies. We cover essential cost management dimensions: establishing spending visibility, forecasting resource needs, implementing governance controls, and measuring efficiency improvements. For both new and experienced Amazon EKS Auto Mode users, this guide offers actionable insights to balance performance, reliability, and cost-efficiency in Kubernetes deployments.

Implement centralized observability for multi-account Amazon EKS

This post shows you how to unify your existing Container Insights and CloudWatch data into a centralized monitoring hub using a hub-and-spoke architecture. You will unify fragmented observability data into a single pane of glass that maintains security boundaries while removing the need for account switching. The solution requires no changes to your existing monitoring infrastructure. It connects what you already have. You will reduce incident response time by removing context switching between accounts and Regions. From one console, you will identify clusters experiencing elevated error rates, spot pod CPU and memory spikes, and track which clusters require version upgrades organization wide. This visibility helps you add capacity before issues occur.

Building PCI DSS-Compliant Architectures on Amazon EKS

In this post, we explore key considerations, best practices, and architectural decisions hosting applications on EKS in shared tenancy environments while maintaining PCI DSS compliance. Please note this information is for reference purposes only and does not constitute legal or compliance advice—customers remain responsible for making their own independent assessment, and AWS products or services are provided ‘as is’ without warranties, representations, or conditions of any kind.

Data-driven Amazon EKS cost optimization: A practical guide to workload analysis

In this post, we introduce key considerations for optimizing Amazon EKS costs in production environments through detailed workload analysis and comprehensive monitoring. We demonstrate proven best practices to maximize cost savings while maintaining performance and resilience, supported by real-world examples showing how to eliminate resource waste from overprovisioned pods, excessive replica counts, and fragmented node pools.

SaaS deployment architectures with Amazon EKS

In this post, we explore patterns and practices for building and operating distributed Amazon Elastic Kubernetes Service (Amazon EKS)-based applications effectively. We examine three deployment models – SaaS Provider Hosted, Remote Application Plane, and Hybrid Nodes – each offering distinct advantages for specific use cases as companies scale their software as a service (SaaS) offerings.

Migrating from AWS CodeDeploy to Amazon ECS for blue/green deployments

In this post, we explore the migration path from AWS CodeDeploy to Amazon ECS for blue/green deployments, discussing key architectural differences and implementation considerations. We examine three different migration approaches – in-place update, new service with existing load balancer, and new service with new load balancer – along with their respective trade-offs in terms of complexity, risk, downtime, and cost.

Best practices for resilience and availability on Amazon ECS

In this post, we explore advanced implementation patterns for building highly available services on Amazon ECS, including idempotency, resilience to transient failures, static stability across Availability Zones, deployment safety, and chaos engineering techniques. The post provides detailed guidance on how these patterns can be implemented when deploying applications on Amazon ECS to ensure maximum resilience and availability.

UTH - Amazon EKS ultra scale clusters featured image

Under the hood: Amazon EKS ultra scale clusters

This post was co-authored by Shyam Jeedigunta, Principal Engineer, Amazon EKS; Apoorva Kulkarni, Sr. Specialist Solutions Architect, Containers and Raghav Tripathi, Sr. Software Dev Manager, Amazon EKS. Today, Amazon Elastic Kubernetes Service (Amazon EKS) announced support for clusters with up to 100,000 nodes. With Amazon EC2’s new generation accelerated computing instance types, this translates to […]