Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

In this Article

Table of Contents
maxLevel2
absoluteUrltrue

Overview

In a Kubernetes environment, application admins must ensure that there are sufficient resources for their users. Support for elastic scaling provides the capability to scale resources up or down depending on the impact on CPU and memory. The scaling happens automatically and is based on HPA. In this article, we describe a basic scenario that explains how you can observe the scaling of resources in your Kubernetes environment.

Resource Scale Up

Additional resources are needed when a Pipeline processes documents at a high volume or too many Pipelines are running at the same time.

...

We see that the Average CPU Usage has violated the HPA CPU Threshold, and therefore more resources are scaled up as the usage reaches the Request CPU metric.

Resources Scale Down

Using the same Pipeline, we can observe the scale down of resources in our Kubernetes environment. When our Pipeline nears completion, the resources in Kubernetes scale down as they are no longer required.

...