Consider availability zone when picking an instance type #25

nmeierpolys · 2016-10-29T15:02:00Z

Our instances all run in the us-east-1d availability zone. When the autospotting process runs, it's finding the cg1.4xlarge instance type as the cheapest option and trying to use that to request spot instances. Unfortunately, the cg1.4xlarge instance type isn't available in that us-east-1d AZ, only us-east-1c. We get this error on the spot request "capacity-not-available: There is no Spot capacity available that matches your request. "

For use cases like ours that are limited to a specific AZ, it would be really helpful to consider the AZ when retrieving spot pricing info, and only pick an instance type if it's available in the AZ.

When this happens, it continues to request a new spot instance each time the process runs, which fairly quickly uses up the maximum number of open spot instance requests that AWS allows, preventing other spot instance requests.

nmeierpolys · 2016-10-29T18:12:21Z

I was able to get past this for our instances by changing the code to skip that instance type and hosting our own binaries. I'll see if I can figure out why it's not handling the case better in general and hopefully put in a PR when I get a chance.

cristim · 2016-10-30T08:15:40Z

Good catch @nmeierpolys, thanks for reporting this!

Off-topic: I'd like to learn more about your use case for only running in a single region, for reliability concerns that's not something I would normally do, especially with spot instances, and I'm always curious to learn about such interesting use cases happening 'in the wild'.

Regarding the issue you noticed, that sounds like a real problem. As far as I can see in the spot instance pricing history in my AWS console, that instance type is indeed only available in a single AZ in US-East-1(in my account that is labeled as us-east-1a), so I guess the algorithm is somehow not checking against this edge case.

I will try to reproduce this in my own AWS account and I will have a look at the source code to see why does that unavailable instance type appear to be the cheapest.

Somehow related to this issue, I think I could implement a way to track instance launch failures for a given instance type/AZ combination, and somehow temporarily blacklist them for a few hours if the instance fails to launch over multiple autospotting runs in a row. I will create a new issue for implementing this kind of feature.

Since you are the first user to report self-hosting binaries, please consider submitting a PR documenting how you did it, I'd really like to have a bit of documentation for this.

nmeierpolys · 2016-10-31T19:25:03Z

Thanks @cristim. Right now, we're running in a single AZ because our system and deploys expect everything to be in a single subnet. There's nothing technically preventing us from changing this, it just hasn't been a big enough priority to commit time to it yet.

I'd be happy to add some documentation for self-hosting the binaries. Hopefully I can get a PR for that your way this week sometime.

cristim · 2017-08-05T15:22:23Z

@nmeierpolys, is this still happening on your environment with the latest version?

nmeierpolys · 2017-08-07T12:55:50Z

Sorry, but I'm no longer working on the project that used autospotting, so I'm not able to test it out with the latest version.

cristim · 2017-08-07T15:13:26Z

Thanks, in this case I'm closing this issue.

This was referenced Oct 30, 2016

Temporarily blacklist spot instance types after multiple launch failures in a given AZ #26

Closed

Document self-hosting binaries #27

Closed

cristim added the Type: Bug label Nov 3, 2016

xlr-8 mentioned this issue Dec 24, 2016

Big refactoring #46

Merged

cristim closed this as completed Aug 7, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider availability zone when picking an instance type #25

Consider availability zone when picking an instance type #25

nmeierpolys commented Oct 29, 2016 •

edited

Loading

nmeierpolys commented Oct 29, 2016

cristim commented Oct 30, 2016 •

edited

Loading

nmeierpolys commented Oct 31, 2016

cristim commented Aug 5, 2017

nmeierpolys commented Aug 7, 2017

cristim commented Aug 7, 2017

Consider availability zone when picking an instance type #25

Consider availability zone when picking an instance type #25

Comments

nmeierpolys commented Oct 29, 2016 • edited Loading

nmeierpolys commented Oct 29, 2016

cristim commented Oct 30, 2016 • edited Loading

nmeierpolys commented Oct 31, 2016

cristim commented Aug 5, 2017

nmeierpolys commented Aug 7, 2017

cristim commented Aug 7, 2017

nmeierpolys commented Oct 29, 2016 •

edited

Loading

cristim commented Oct 30, 2016 •

edited

Loading