reduce ecs api call in running_essential_task? method #37

woshidan · 2018-08-09T12:24:27Z

Modify instance deregister logic to reduce ECS API calls.

woshidan · 2018-08-09T12:25:28Z

lib/ecs_deploy/auto_scaler.rb

-        task_groups = service_config.client.describe_tasks(cluster: service_config.cluster, tasks: task_arns).tasks.map(&:group)
-        task_groups.include?("service:#{service_config.name}")
+        container_arns = service_config.client.list_container_instances(cluster: service_config.cluster, filter: "task:group == service:#{service_config.name}")[0]
+        container_arns.include?(instance.container_instance_arn)


I confirmed that this diff return the same results.

The test environment like this:

1 cluster

1 autoscaling group for the cluster

2 services in the cluster

1 service is specified with services:name key in YAML config file for autoscaler

then I stop and run this specified service's task and check result

detail for test environment: https://gist.github.com/woshidan/5463dfeba59cb506c3ca2cc4d2106e4a

joker1007 · 2018-08-09T15:16:09Z

lib/ecs_deploy/auto_scaler.rb

-        task_arns = service_config.client.list_tasks(cluster: service_config.cluster, container_instance: instance.container_instance_arn).task_arns
-        task_groups = service_config.client.describe_tasks(cluster: service_config.cluster, tasks: task_arns).tasks.map(&:group)
-        task_groups.include?("service:#{service_config.name}")
+        container_arns = service_config.client.list_container_instances(cluster: service_config.cluster, filter: "task:group == service:#{service_config.name}")[0]


このAPI呼び出しをこのメソッドから分離できる。
これを分離して339行目のループの外側で、serviceに関連するコンテナインスタンスを事前に取得して、このメソッドでは引数として存在するinstanceのarnと比較するだけにすれば、API呼び出し回数は50分の1ぐらいになる。

多分メソッドの置き場所としては242行目にあるServiceConfig#fetch_container_instancesですね。
まず、このメソッドの名前をfetch_all_container_instancesに変える。
でServiceConfig#fetch_container_instancesでサービスが起動したタスクをホストしているcontainer_instancesを取れる様にする。
そうすれば割と自然になりますね。

8ecb03a と e6ccd81 のコミットで対応しました。

検証に利用したecs_autoscalerの設定

polling_interval: 30 auto_scaling_groups: - name: ag_woshidan_test region: ap-northeast-1 buffer: 1 # タスク数に対する余剰のインスタンス数 services: - name: woshidan-test-service cluster: woshidan-test-cluster region: ap-northeast-1 auto_scaling_group_name: ag_woshidan_test step: 3 idle_time: 120 max_task_count: [15] cooldown_time_for_reach_max: 600 min_task_count: 3 upscale_triggers: - alarm_name: "TEST ALARM TO TRIGGER UPSCALE" state: ALARM downscale_triggers: - alarm_name: "TEST ALARM TO TRIGGER DOWNSCALE" state: ALARM step: 6

検証内容・結果

アラートを

時刻アラートの状態変化

16:39 TEST ALARM TO TRIGGER UPSCALE: OK => ARARM

16:51 TEST ALARM TO TRIGGER UPSCALE: ARARM => OK

16:51 TEST ALARM TO TRIGGER DOWNSCALE: OK => ARARM

のように変化させて、

ecs_autoscalerで管理しているサービス( woshidan-test-service )のタスクの数

同じクラスタのコンテナインスタンスに乗っているecs_autoscalerで管理していないサービス( woshidan-test-service-2 )のタスクの数

AutoScalingグループのインスタンスのデタッチ、アタッチの履歴

は以下なのでよさそうです(環境構築の詳細は https://gist.github.com/woshidan/2235b3e6a194bd37d795379855d3f2be)。

…_instances

… and fetch the list out of the check if conatiner instance in cluster is deregisterable

joker1007 · 2018-08-16T05:41:02Z

lib/ecs_deploy/auto_scaler.rb

+          break unless resp.next_token
+        end
+
+        chunk_size = 50


これ、arnしか使ってないので、ここから下必要無くて、arnだけ返せば良いと思う。
他の部分は良さそうですね！

コメントありがとうございます！

そこはたしかにARNしかつかってないんですが、fetch_container_instances_in_service と fetch_container_instances_in_cluster で対照的なメソッドがあるとき、その両方で扱ってるオブジェクトのクラスが同じ方がわかりやすいかもなーってちょっと迷ってます。

直しておいたほうがいいでしょうか。

そもそもAPIリクエスト回数を減らすのが目的だし、そこが対照的であったとしても、そんなに分かり易さに寄与しないと私は思います。
まあ、これで減るのはserviceの数 * せいぜい1回か2回分ぐらいですが。

後、実質的に使ってないコードがあるより、無い方が良い。
使ってなさそうなコードが残ってると後になって読んだ時にそうである意図に悩むことになる。

では、直しちゃいます 💪

…ist because container instances in service is not used in running_essential_task?

joker1007

LGTM。
動作確認取れたらマージして良いです。

woshidan · 2018-08-16T07:08:04Z

アラートを

時刻	アラートの状態変化
15:37	TEST ALARM TO TRIGGER UPSCALE: OK => ARARM
15:53	TEST ALARM TO TRIGGER UPSCALE: ARARM => OK
15:53	TEST ALARM TO TRIGGER DOWNSCALE: OK => ARARM

のように変化させて、 #37 (comment) と同様の検証を行ったところ、動作確認の結果はよさそうです 🙆

woshidan · 2018-08-16T07:08:35Z

ということでマージします！

reduce ecs api call in running_essential_task? method

80f0335

woshidan commented Aug 9, 2018

View reviewed changes

joker1007 reviewed Aug 9, 2018

View reviewed changes

rename ServiceConfig#fetch_container_instances to fetch_all_container…

8ecb03a

…_instances

woshidan force-pushed the reduce-ecs-api-call-in-running-essential-task-method branch from 0d2ca6e to 1da8ccc Compare August 14, 2018 03:52

add method to container_instances list related with specified service…

e6ccd81

… and fetch the list out of the check if conatiner instance in cluster is deregisterable

woshidan force-pushed the reduce-ecs-api-call-in-running-essential-task-method branch from 1da8ccc to e6ccd81 Compare August 14, 2018 05:35

joker1007 reviewed Aug 16, 2018

View reviewed changes

return container instance arns instead of container instance object l…

cbca661

…ist because container instances in service is not used in running_essential_task?

joker1007 approved these changes Aug 16, 2018

View reviewed changes

woshidan merged commit 6ae37b7 into reproio:master Aug 16, 2018

woshidan deleted the reduce-ecs-api-call-in-running-essential-task-method branch August 16, 2018 07:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reduce ecs api call in running_essential_task? method #37

reduce ecs api call in running_essential_task? method #37

woshidan commented Aug 9, 2018

woshidan Aug 9, 2018

joker1007 Aug 9, 2018

woshidan Aug 14, 2018

woshidan Aug 14, 2018

joker1007 Aug 16, 2018

woshidan Aug 16, 2018

joker1007 Aug 16, 2018

woshidan Aug 16, 2018

joker1007 left a comment

woshidan commented Aug 16, 2018

woshidan commented Aug 16, 2018

時刻	アラートの状態変化
16:39	TEST ALARM TO TRIGGER UPSCALE: OK => ARARM
16:51	TEST ALARM TO TRIGGER UPSCALE: ARARM => OK
16:51	TEST ALARM TO TRIGGER DOWNSCALE: OK => ARARM

reduce ecs api call in running_essential_task? method #37

reduce ecs api call in running_essential_task? method #37

Conversation

woshidan commented Aug 9, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

検証に利用したecs_autoscalerの設定

検証内容・結果

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joker1007 left a comment

Choose a reason for hiding this comment

woshidan commented Aug 16, 2018

woshidan commented Aug 16, 2018