使用Terraform的带有AWS Container Insights指标的EKS AutoScaling

如何解决使用Terraform的带有AWS Container Insights指标的EKS AutoScaling

我需要使用AWS CloudWatch容器见解 cluster_failed_node_count 下可用的自定义指标之一实施EKS自动缩放例如，如果我们观察到此指标有一些计数（> = 1），则ASG应相应地开始增加实例。

我已经在EKS集群上部署了CLoudWatch和FluentD代理，并且能够在仪表板上集成我的EKS CLuster和Metrics。

现在，如果考虑到需求，我们可以使用三个选项来执行自动缩放策略。

目标跟踪策略-在此类别中，我们可以选择实现自定义指标（但我们可以仅使用AWS CLI或Terraform来实现）

当前，我已经尝试过此操作，但是我不确定，它是否可以正常工作，因为没有使用下面实现的特定用例，而且在创建此策略时，在控制台上也显示“执行此策略”保持等于1“的值

resource "aws_autoscaling_policy" "eks-asg-policy" {
  count                     = var.create_eks_cluster ? 1 : 0
  name                      = "failed-node-count-step-scale-up"
  adjustment_type           = "ChangeInCapacity"
  policy_type               = "TargetTrackingScaling"
  autoscaling_group_name    = "SomeASGName"
  estimated_instance_warmup = 180

  target_tracking_configuration {
    customized_metric_specification {
      metric_dimension {
        name  = "ClusterName"
        value = "my-Cluster"
      }
      metric_name = "cluster_failed_node_count"
      namespace   = "AWS/ContainerInsights"
      statistic   = "Average"
    }

    target_value = 1
  }

另一种选择，如果我可以仅通过“步骤缩放”或“简单缩放”（而不是“目标跟踪自定义策略”）实现上述目标，并且如果是这种情况，我将如何为cluster_failed_node_count设置阈值，那么缩放比例又需要什么呢？在这种情况下要考虑。例如

step_adjustment {
    scaling_adjustment          = 1
    metric_interval_lower_bound = 2.0
    metric_interval_upper_bound = 3.0
  }

当前我正在实现此功能，但是它给我一个界限错误

resource "aws_autoscaling_policy" "eks-asg-step-scale-up-policy" {
  count                  = var.create_eks_cluster ? 1 : 0
  name                   = "failed-node-count-step-scale-up"
  autoscaling_group_name = "SomeASGName"
  adjustment_type        = "ChangeInCapacity"
  step_adjustment {
    scaling_adjustment = 1
  }
  policy_type = "StepScaling"
}

resource "aws_cloudwatch_metric_alarm" "cluster_failed_node_count-alarm" {
  count               = var.create_eks_cluster ? 1 : 0
  alarm_name          = "cluster_failed_node_count-alarm"
  alarm_description   = "alarm for failed node count"
  comparison_operator = "GreaterThanOrEqualToThreshold"
  evaluation_periods  = "2"
  metric_name         = "cluster_failed_node_count"
  period              = "60"
  threshold           = "1"
  tags                = local.required_tags
  dimensions = {
    "AutoScalingGroupName" = "SomeASGName"
  }
  actions_enabled = true
  alarm_actions   = ["${aws_autoscaling_policy.eks-asg-step-scale-up-policy[0].arn}"]
}

错误：放置缩放策略时发生错误：ValidationError：不能不指定StepAdjustment的上下限。

使用Terraform的带有AWS Container Insights指标的EKS AutoScaling

如何解决使用Terraform的带有AWS Container Insights指标的EKS AutoScaling

相关推荐