Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

You can scale your application horizontally by adding more pods to your deployment. The simplest way is to specify the number of replicas in your deployment manifest. This example will run three pods for a deployment, balancing requests between them:

Code Block
languageyaml
apiVersion: apps/v1
kind: Deployment
metadata:
  name: example
spec:
  replicas: 3

Autoscaling

Most applications experience fluctuations in traffic both seasonally and throughout the day. You can use the horizontal pod autoscaler to automatically add and remove replicas based on metrics observed in your cluster. Here as an example which automatically scales deployment based on the number of incoming requests per second:

apiVersion: autoscaling/v2beta2 kind: HorizontalPodAutoscaler metadata: name: example spec: scaleTargetRef: apiVersion: apps/v1 kind: Deployment name: example minReplicas: 2 maxReplicas: 10 metrics: - type: Pods pods: metric: name: http_requests_per_second target: type: AverageValue averageValue: 10
Code Block
languageyaml
Info

The AWS Platform Guide now lives at thoughtbot.com

Visit Horizontal Scaling to access this page's content