7

我经常在 EC2 上运行 Spot 实例(用于 Hadoop 任务作业、临时节点等)。其中一些是长时间运行的 Spot 实例。

计算按需或预留 EC2 实例的成本相当容易——但我如何计算作为 Spot 实例运行的特定节点(或多个节点)所产生的成本?

我知道现场实例的成本每小时都会根据市场价格变化 - 那么有没有办法计算正在运行的现场实例的累积总成本?通过 API 还是其他方式?

4

4 回答 4

6

好的,我在 Boto 库中找到了一种方法。这段代码并不完美 - Boto 似乎没有返回确切的时间范围,但它确实或多或少地在一个范围内获得了历史现货价格。以下代码似乎运行良好。如果有人可以改进它,那就太好了。

import boto, datetime, time

# Enter your AWS credentials
aws_key = "YOUR_AWS_KEY"
aws_secret = "YOUR_AWS_SECRET"

# Details of instance & time range you want to find spot prices for
instanceType = 'm1.xlarge'
startTime = '2012-07-01T21:14:45.000Z'
endTime = '2012-07-30T23:14:45.000Z'
aZ = 'us-east-1c'

# Some other variables
maxCost = 0.0
minTime = float("inf")
maxTime = 0.0
totalPrice = 0.0
oldTimee = 0.0

# Connect to EC2
conn = boto.connect_ec2(aws_key, aws_secret)

# Get prices for instance, AZ and time range
prices = conn.get_spot_price_history(instance_type=instanceType, 
  start_time=startTime, end_time=endTime, availability_zone=aZ)

# Output the prices
print "Historic prices"
for price in prices:
  timee = time.mktime(datetime.datetime.strptime(price.timestamp, 
    "%Y-%m-%dT%H:%M:%S.000Z" ).timetuple())
  print "\t" + price.timestamp + " => " + str(price.price)
  # Get max and min time from results
  if timee < minTime:
    minTime = timee
  if timee > maxTime:
    maxTime = timee
  # Get the max cost
  if price.price > maxCost:
    maxCost = price.price
  # Calculate total price
  if not (oldTimee == 0):
    totalPrice += (price.price * abs(timee - oldTimee)) / 3600
  oldTimee = timee

# Difference b/w first and last returned times
timeDiff = maxTime - minTime

# Output aggregate, average and max results
print "For: one %s in %s" % (instanceType, aZ)
print "From: %s to %s" % (startTime, endTime)
print "\tTotal cost = $" + str(totalPrice)
print "\tMax hourly cost = $" + str(maxCost)
print "\tAvg hourly cost = $" + str(totalPrice * 3600/ timeDiff)
于 2012-08-02T21:51:58.217 回答
4

我已经重写了 Suman 的解决方案来使用 boto3。确保将 utctime 与 tz 集一起使用!:

def get_spot_instance_pricing(ec2, instance_type, start_time, end_time, zone):
    result = ec2.describe_spot_price_history(InstanceTypes=[instance_type], StartTime=start_time, EndTime=end_time, AvailabilityZone=zone)
    assert 'NextToken' not in result or result['NextToken'] == ''

    total_cost = 0.0

    total_seconds = (end_time - start_time).total_seconds()
    total_hours = total_seconds / (60*60)
    computed_seconds = 0

    last_time = end_time
    for price in result["SpotPriceHistory"]:
        price["SpotPrice"] = float(price["SpotPrice"])

        available_seconds = (last_time - price["Timestamp"]).total_seconds()
        remaining_seconds = total_seconds - computed_seconds
        used_seconds = min(available_seconds, remaining_seconds)

        total_cost += (price["SpotPrice"] / (60 * 60)) * used_seconds
        computed_seconds += used_seconds

        last_time = price["Timestamp"]

    # Difference b/w first and last returned times
    avg_hourly_cost = total_cost / total_hours
    return avg_hourly_cost, total_cost, total_hours
于 2015-10-08T02:52:40.330 回答
3

您可以订阅 Spot 实例数据馈送,以获取转储到 S3 存储桶的正在运行的实例的费用。安装 ec2 工具集,然后运行:

ec2-create-spot-datafeed-subscription -b bucket-to-dump-in

注意:您的整个帐户只能订阅一个数据馈送订阅。

大约一个小时后,您应该会开始看到 gzipped 选项卡分隔文件出现在存储桶中,如下所示:

#Version: 1.0
#Fields: Timestamp UsageType Operation InstanceID MyBidID MyMaxPrice MarketPrice Charge Version
2013-05-20 14:21:07 UTC SpotUsage:m1.xlarge RunInstances:S0012  i-1870f27d  sir-b398b235    0.219 USD   0.052 USD   0.052 USD   1
于 2013-05-20T17:14:37.173 回答
1

我最近开发了一个小型 python 库,可以计算单个 EMR 集群或集群列表(给定几天)的成本。

它还考虑了 Spot 实例和任务节点(在集群仍在运行时可能会上下波动)。

为了计算成本,我使用出价,(在许多情况下)可能不是您最终为实例支付的确切价格。但是,根据您的出价政策,此价格可能足够准确。

你可以在这里找到代码:https ://github.com/memosstilvi/emr-cost-calculator

于 2015-05-27T09:59:56.697 回答