AWS Lambda pricing now per ms

catchmeifyoucan · on Dec 1, 2020

For reference - Lambda functions used to billed at 100ms intervals. My Node.Js function usually only takes 37-40ms to run. So this is a pretty good advancement for cost savings.

munns · on Dec 1, 2020

Awesome! That was the idea here. Lots of sub 100ms workloads and we really want you to be able to pay for what you use.

- Chris, Serverless@AWS

ignoramous · on Dec 1, 2020

> ...we really want you to be able to pay for what you use.

Cloudflare Workers has the right pricing model. They only charge for CPU time and not wall time. They also do not charge for bandwidth.

> Lots of sub 100ms workloads...

AWS Lambda (or Lambda at Edge), as it stands, is 10x more expensive for sub 50ms workloads (Workers does allow upto 100ms for the 99.9th percentile) that can fit 128MB RAM.

https://medium.com/@zackbloom/serverless-pricing-and-costs-a...

advisory5739f2 · on Dec 2, 2020

That's because keeping track of request state is not free. Ask an edge router. If you have a request open, even though it's not doing CPU, that request has to be in a queue somewhere, tracked for a response that can be transmitted back.

I don't know the infra costs of operating lambda, but my guess is that it's far from CPU-dominated.

I would not be surprised if the Cloudflare pricing model is making a tradeoff to make CPU-bound workloads pay for more of the infra than the rest. It's a valid trade-off to make as a business offering, and it might be feasible given the mixture of workloads. Whether it's the right way is debatable. Whether this model can be tanked by an army of actors taking advantage of CPU-insensitive pricing remains to be seen, or is an acceptable risk that you can take (which you can observe and protect against).

munns · on Dec 1, 2020

Except that none of the rest of your infrastructure is there, and that APIs represent just a non-majority part of Lambda workloads.

ukd1 · on Dec 1, 2020

Yet, if you're a Cloudflare user, all of your edges are there - so it doesn't matter. We use Workers extensively for "edge" related things. Lambda, never - but for working with S3 buckets, sure. They feel similar, but differently specialized.

sitkack · on Dec 3, 2020

Lambdas are UDFs for S3.

mrits · on Dec 1, 2020

To be clear, you're acknowledging Cloudflare has a much better pricing model but just not as many other services yet?

CapriciousCptl · on Dec 1, 2020

They're not easily comparable (I tried using Cloudflare Workers before going back to AWS). Lambda@Edge runs Node or Python. Cloudflare Workers runs V8 with "worker isolates" which has a few more caveats, an imperfect but improving dev experience, and doesn't work with a lot of npm packages.

eins1234 · on Dec 1, 2020

What would be really useful for my use case (running browser tests on a schedule) is if Cloudflare workers actually supported running full headless chromium automation in addition to just V8 isolates. Right now I'm using puppeteer/playwright + Lambda, but would love to have more options.

Normal_gaussian · on Dec 1, 2020

Headless browser tests seem to be quite far away from the problems cloudflare workers are trying to solve.

https://developers.cloudflare.com/workers/platform/limits

Workers aren't the same as lambdas, they are a super slim JS environments. At 50ms max runtime most browsers won't even start, let alone fetch and process a page.

ignoramous · on Dec 1, 2020

CloudWatch Synthetics may fit your usecase? https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitori...

Then there are more specialized browser-testing providers like LambdaTest.com and BrowserStack.com

chrisweekly · on Dec 2, 2020

and the venerable WPT (private instances, not the free public one at webpagetest.org)

wdb · on Dec 2, 2020

I wished Google Cloud supported something similar Lambda@edge but I think my only alternative is Cloudflare Workers at the moment.

munns · on Dec 1, 2020

No to be clear I'm saying you are comparing things that are way more different than our friends at Cloudflare would like you to think. They aren't brought up in any of the convos I have with customers.

gwerbret · on Dec 1, 2020

> No to be clear I'm saying you are comparing things that are way more different than our friends at Cloudflare would like you to think.

Care to expand on that? What exactly do you mean by "things are way more different"?

ctvo · on Dec 1, 2020

Last I looked Cloudflare Workers had different limits and constraints:

https://developers.cloudflare.com/workers/platform/limits

It's a quick Google. 128MB max memory, 6 concurrent out going connections max, 1MB code size limit. The use case here is a subset of what AWS Lambda can handle. The supported languages also differ (only things that have a JS / wasm conversion for Cloudflare Workers).

I haven't looked deeply, so please correct me if I'm wrong, but I understand there's also restrictions on the built-in APIs available [1] and npm packages supported for NodeJS.

I would assume some of the above contributes to the price difference.

1 - https://developers.cloudflare.com/workers/runtime-apis/web-s...

acer4666 · on Dec 1, 2020

This is pretty standard AWS marketing spiel. They make vague assertions of "ah but you're not considering the big picture...." with no details

staticassertion · on Dec 1, 2020

If you want to compare two things that are different in a ton of ways, don't be mad when someone points it out.

ignoramous · on Dec 2, 2020

It isn't about the products, it is about the pricing model in a similar market.

Second, for sub 50ms workloads [0], Workers is absolutely a superior solution to API Gateway + Lambda or Cloudfront + Lambda at Edge if the workloads can fit 128MB RAM and package/compile to 1MB JavaScript or WASM executables, in terms of cost, speed, latency, ease of development etc

[0] For Workers, 50ms is all CPU time and that is definitely not the case with Lambda which may even charge you for the time it takes to setup the runtime to run the code and time spent doing Network IO and bandwith and RAM and vCPUs and what not.

sushshshsh · on Dec 1, 2020

Based. "That's just an edge case. Our customers love this service!"

It's like going to a restaurant that uses bottled water instead of tap water, and they dont provide an answer as to what the benefits of bottled water are

nl · on Dec 2, 2020

Not from AWS, but the isolation model is completely different.

On Lambda you get a full environment inside an OS container. On CloudFlare you get a WASM process.

The Lambda model is more compatible which can be a real benefit.

ignoramous · on Dec 1, 2020

But you're telling us that Lambda's prices are justifiably higher because of the strong vendor lock-in? AWS is starting to sound more like Oracle. Ironic. :)

Besides the fact that Cloudflare's part of the Bandwidth Alliance with GCP and other infrastructure providers from which AWS is conspicuously absent, Cloudflare's also slowly but surely building a portfolio of cloud services.

jrd259 · on Dec 1, 2020

This reply is in bad faith. He did not attempt to "justify" the pricing with "vendor lock-in". Indeed, the prices went down, not up.

ignoramous · on Dec 1, 2020

Lambda's pricing is indeed higher than Cloudflare Workers for sub 50ms workloads (that fit 128MB RAM).

Cloudflare's alliance with other infrastructure providers mean Cloudflare's platform isn't really limited to "API" workloads. This is discounting the fact that Cloudflare recently announced Workers Unlimited for workloads that need to run longer (upto 30mins) though then they do charge for bandwidth.

alisonkisk · on Dec 1, 2020

The question here isn't the price change here (which is in some sense mainly about balancing short functions and long functions, removing the penalty for short functions) , it's where the pricing is at overall vs Cloudflare.

alisonkisk · on Dec 1, 2020

This comment would be much more useful if you gave some clear examples of the difference (presumably something you get on Lambda that makes it worth more per ms than Cloudflare).

Otherwise it's just "AWS said, Cloudflare said"

alex_reg · on Dec 1, 2020

It's nice to get a semi-official confirmation of AWS pricing strategy: create lock in, then overcharge.

StreamBright · on Dec 2, 2020

>> AWS Lambda (or Lambda at Edge), as it stands, is 10x more expensive for sub 50ms workloads

Not sure about this, most use cases of Lambda use other resources and do not exist in a vacuum. Comparison should be made using complete systems not only parts.

tejohnso · on Dec 1, 2020

> They also do not charge for bandwidth.

Is there fine print on this? Can I put 100TB / mo through their caching servers at the lowest $20 price tier?

manigandham · on Dec 1, 2020

Not if you're actually taking up that much cache storage but bandwidth has plenty of examples of high usage on low tiers. They usually allow it as long as you're not affecting the rest of the network adversely since the lines are already paid for (which is the right approach IMO).

anonymoushn · on Dec 1, 2020

Yes, but you'll probably get an email about it.

ignoramous · on Dec 1, 2020

Very high bandwidth usage for Cloudflare Workers workloads is not against ToS according to Cloudflare's CEO: https://news.ycombinator.com/item?id=20790857

jthomerson · on Dec 1, 2020

Chris, while I've seen the change in my accounts on regular Lambda, I don't yet see it on Lambda@Edge. I think Lambda@Edge is the place where we'd benefit from this change the most, because many L@E scenarios take single-digit milliseconds, and the cost of L@E is 3x regular Lambda.

Any word on whether we'll also see this change on L@E billing?

munns · on Dec 1, 2020

Yes, to be clear this change was just for Lambda. L@E is honestly a completely different service run by a different part of AWS that just happens to share parts of our core worker platform. I am not 100% aware of when they might adjust their own pricing on this, but also couldn't share any roadmap here (sorry).

daxfohl · on Dec 1, 2020

How does that even work? Lambda seems like a challenge even with the entirety of the datacenter resources to work with. Running it in constrained edge environments with a VM per function seems like black magic.

munns · on Dec 1, 2020

The naming is a bit of a misnomer, today L@E doesn't run at the edge (in our PoPs) but when you deploy it copies to every region and then CloudFront routes you to the lowest latency region for your request.

austinpena · on Dec 1, 2020

So is L@E multi region then? Like if I have two concurrent requests across the globe are they serviced from two locations?

munns · on Dec 1, 2020

Yes, but you deploy it from one region.

hlieberman · on Dec 1, 2020

Wait, are you Chris Munns from Music2Go? If so, massively small world. My email is username @setec.io; would love to hear from you.

-Harlan, the ex-intern

munns · on Dec 1, 2020

It me.

Roritharr · on Dec 1, 2020

I have only a lambda@edge function which usually runs between 10-20ms.

If this also covers lambda@edge, this will save us quite some money.

groundthrower · on Dec 1, 2020

Hi, are there any plans on offering instances with more CPU cores than the maximum 2 as I guess you have today?

munns · on Dec 1, 2020

Yes, announced today you can go up to 10gb/6 vCPUs: https://aws.amazon.com/blogs/aws/new-for-aws-lambda-function...

groundthrower · on Dec 1, 2020

Okay, nice. And if I would like like 32 vCpus? Having an application today that has a huge degree of parallelism, but utilizing an external cloud provider that offers dedicated machines with very affordable pricing. Would really like to use lambdas instead though.

munns · on Dec 1, 2020

Interesting. Not possible today. We'd still encourage paralyzation up through multiple concurrency of functions being executed.

JoshTriplett · on Dec 1, 2020

I would love to see this as well: having 96-vCPU Lambda instances (or instances that match the biggest C-family instance you have) would solve a lot of problems for me. The execution model of Lambda (start a runtime, handle requests, AWS handles pool management) feels much easier to use than managing a pool.

Someone from AWS once commented to me that "if you're ever having to manage a pool rather than letting us manage it, that's a gap in our services".

chrisweekly · on Dec 2, 2020

"paralyzation" -> parallelization, yeah? :)

munns · on Dec 2, 2020

It was a long day yesterday, thank you :)

YuriNiyazov · on Dec 1, 2020

Hey Chris. Happy to see your awesome trajectory from Meetup admin to The Serverless AWS guy.

munns · on Dec 1, 2020

::waves at Yuri::

Thanks! It's been a fun/interesting 4 years in this space :)

jeffbee · on Dec 1, 2020

Just out of curiosity, what are you getting out of your 160 million CPU cycles? Are you mostly on the CPU, or mostly waiting for something (database call or whatever)?

londons_explore · on Dec 1, 2020

I want to do something that a low level hacker could do in 100 clock cycles with hardcoded bit twiddling and some avx-512, but I want to use nodejs, so I'm gonna need at least 100 million clock cycles to parse all the npm modules...

fwip · on Dec 1, 2020

Not sure why you need a whole bunch of npm modules to do bit twiddling in a performance-sensitive lambda function. Sounds like you just don't like Javascript.

Edit: You're spending like 80ms on cold-start of your lambda function, plus network overhead. If you can spare that, you can likely spare the half a millisecond for the 999,900 cycles you're complaining about.

eb0la · on Dec 1, 2020

So this confirms there is a lot of competition in the serverless space: aws lambda, Azure cloud functions, Google cloud functions, serverless containers. Like knative, Google cloud run...

franga2000 · on Dec 2, 2020

Just out of curiosity, could you share what kind of things you use it for?

I've never used Lambda, but any time I have a function that I need to run in response to some event or perodically (that's what Lambda is, right?), it's set up in a background worker specifically because it's long and slow, as anything fast can be done synchronously without the overhead.

Cthulhu_ · on Dec 2, 2020

Lambda is not specifically made for long and slow tasks; AWS Lambda specifically has a maximum execution time of 15 minutes (and you have to explicitly configure it to do so, see https://aws.amazon.com/about-aws/whats-new/2018/10/aws-lambd....).

For longer tasks, spinning up an EC2 or Beanstalk instance is probably the way to go.

As for what to use it for, we used it in our application (deployed to Netlify which uses Lambda under the hood) where lambdas operated like a 'proxy' to various 3rd party API suppliers (Commercetools, Adyen, some age verification service), and those too would use a lambda function to ping back at us (e.g. when payment was confirmed). Worked pretty well, although in retrospect I would've preferred a 'normal', monolithic server to do the same thing.

tgv · on Dec 1, 2020

Warm or cold?

jimmaswell · on Dec 1, 2020

How many people actually use lambda? Always came off as a gimmick to me.

munns · on Dec 1, 2020

Enough that in 2019 it was the most popular topic at re:Invent (our big user conference) and that today per our re:Invent announcement almost half of all new compute workloads in Amazon are based on it. Pretty heavily used across different industries and verticals.

tealpod · on Dec 7, 2020

I rarely use lambda but I use alot of Google firebase-functions for majority of my server code. From my experience lambda/firebase-functions/Azure-functions are very popular.One simple usecase I can tell is payment-successful return-hook from payment servers like Stripe.Its a tiny task which just logs payment-success info and triggers a email..etc.

diegocg · on Dec 1, 2020

They can seem not that useful by themselves. Their power comes from the integration with the rest of AWS services.

Supermancho · on Dec 1, 2020

> So this is a pretty good advancement for cost savings.

For some people. Those cost savings are made up somewhere else. Ultimately, Amazon is not a loss leader.

Smaug123 · on Dec 1, 2020

… yes? I don't use AWS Lambda at all, so I don't see any cost savings. It's implicit that not everyone will see the savings.

camhart · on Dec 1, 2020

Why only for some people? It always rounded up to 100ms. If your function took 101ms, it was billed as 200ms.

jlouis · on Dec 1, 2020

If it takes 60 seconds, that's 600 steps. 601 steps isn't that much of a saving.

It matters close to 0.

alpha_squared · on Dec 1, 2020

If a process takes that long, lambda would be a poor architectural choice.

_qwfv · on Dec 1, 2020

Not necessarily. For low frequency workloads with reasonably long step times, Lambda can still make sense. (E.g. When videos appear in this S3 bucket, process them.)

You might only drop videos in once a week, but when you do you want to run some code against them. There are plenty of distributed workflow reasons to run long running Lambdas infrequently rather than spinning up and down an EC2 instance.

alpha_squared · on Dec 1, 2020

Lambdas are underpowered and often poor choices for compute-heavy workloads. Unless there's an urgency to processing infrequent videos, it might make more sense to backlog messages to the queue and use spot instances for draining the queue and processing videos, especially from a cost perspective. Though I acknowledge that this is a more complex setup.

throwaway894345 · on Dec 1, 2020

I haven’t heard that lambdas are “underpowered” before, but I’m interested to learn more. Could you elaborate just a bit on why they are underpowered?

alpha_squared · on Dec 1, 2020

As was mentioned by qvrjuec in a sibling comment, hardware is limited. I seem to remember CPU speeds listed alongside available memory for AWS Lambdas, but the pricing page seems to just list memory now[0]. At the highest end, you're still limited to ~10.2GB of memory, which is considerably lower than what's available via EC2. And while I have no personal experience with the EC2 finer-grained pricing that was announced[1], it sounds like that approach may be a better approach to the described scenario above. We can nitpick on these architectural details, but my response was largely that there are other architectural alternatives that could be more ideal; especially in response to a comment that seems to dismiss the value of pricing at finer time intervals.

[0] https://aws.amazon.com/lambda/pricing/

[1] https://www.cnbc.com/2017/09/18/aws-starts-charging-for-ec2-...

throwaway894345 · on Dec 1, 2020

> We can nitpick on these architectural details, but my response was largely that there are other architectural alternatives that could be more ideal; especially in response to a comment that seems to dismiss the value of pricing at finer time intervals.

Not trying to nitpick anything; just curious what was meant by "underpowered". Seems like there's still a breadth of compute-intensive use cases that are more appropriate for lambda--e.g., cost is more sensitive than latency and I have too low a volume of requests for a dedicated EC2 instance to make economic sense. This has been where I've spent most of my career, but no doubt there are many use cases where this doesn't hold.

qvrjuec · on Dec 1, 2020

Limitations on hardware one can run a lambda function on and constraints on execution time mean they are "underpowered" compared to other options, like ECS Fargate tasks.

throwaway894345 · on Dec 1, 2020

Does Fargate allow you to run on beefier hardware? I know you can bring your own hardware with vanilla ECS. I’m aware of the execution time constraints (15 minutes), but I thought we were talking about 60s?

lstamour · on Dec 1, 2020

Right but the first thought I had was, couldn’t you fan out and run a lambda for each frame or a group of related frames? (E.g. Batch HLS processing would be really easy!) If so, you’re back to short lambdas again. It’s really the sweet spot for using Lambda after all: lots of big jobs can be broken down into lots of little jobs, etc.

_qwfv · on Dec 1, 2020

Plausibly. But that might be more effort than just writing the code to ingest a video file (or some other big data blob) in the simplest, most straightforward way possible.

CodeWriter23 · on Dec 1, 2020

If your process takes more than 10 seconds, the extra 100ms charge is pretty much noise.

roland35 · on Dec 1, 2020

Lambda has a 15 minute limit, I'm not sure exactly how it compares to ec2 but for a low duty cycle application it still makes sense! It is also pretty easy to combine a lambda to SNS or SQS

staticassertion · on Dec 1, 2020

My lambdas run for 15 minutes. I feel that they're still a great choice :)

archgoon · on Dec 1, 2020

Huh... that's the limit of lambda functions, are you doing some sort of work in 15 minute chunks?

staticassertion · on Dec 1, 2020

Instead of processing 10 messages off of SQS per lambda we process 10, then start polling for more using the same lambda, and don't stop until the lambda is just about to die.

_rs · on Dec 2, 2020

Forgive me if this is naive, but why not trigger a lambda for each message separately? I think they’ll automatically reuse lambdas instead of spinning down

jlouis · on Dec 1, 2020

I used an extreme to show the point. At 800 ms the savings are also less than closer to 0.

SamBam · on Dec 1, 2020

Yes, but even then it's still a saving, however small, not a loss, so I feel to see this point of this.

bpodgursky · on Dec 1, 2020

If Amazon reduces costs behind the scenes, they can maintain the same revenue while lowering prices for everyone, by helping people deploy previously cost-prohibitive infrastructure.

(ie, if people now use 1.5x as many Lambdas, they can lower costs by 1/3, and everyone wins).

pdelgallego · on Dec 1, 2020

The Jevons paradox applies to compute. In economics, the Jevons paradox occurs when technological progress increases the efficiency with which a resource is used, but the rate of consumption of that resource rises due to increasing demand. The more efficient (or cheaper) compute gets the more uses we find for it, rising the consumption of it.

People think about pricing as a zero sum game. In reality, there are very few things in the world that are zero sum games.

vikramkr · on Dec 1, 2020

This is also a competitive market. Aws has to fight against gcp and azure for new customers, and in a competitive market, you don't get a guarantee to make up the money you lose somewhere else.

jlouis · on Dec 1, 2020

Necessary change. Now writing stuff in fast languages suddenly matter in cost, changing the landscape of when these solutions might become viable for a chase.

koolba · on Dec 1, 2020

And you can pick any language you'd like as it supports arbitrary containers now too: https://news.ycombinator.com/item?id=25267182

freeqaz · on Dec 1, 2020

This should be at the top of this thread! This is huge and fixes my #1 gripe with Lambda -- that managing dependencies is "non-standard" and you can't use tools like Docker. Plus, the 250MB limit is brutal.

This is really, really exciting!

adwww · on Dec 1, 2020

Oh that's quite big news if you run an app that has to be deployed cross cloud.

We use very little serverless at the moment, because the three clouds we need to deploy to have infuriating differences between their execution and deployment environments. Eg. How they manage dependancies, the runtimes, how you describe and deploy each function.

Compared at least to K8 where the containers you build run just fine wherever you put them.

throwaway894345 · on Dec 1, 2020

Wow, I haven’t read yet, but I’m very curious if this affects the size limits on the lambdas. I’ve tried to build lambdas for a small Python use case, but by the time I imported pandas and a few other libraries I exceeded the 250 mb limit and migrated to ECS/Fargate, but the startup times were much longer.

fitzoh · on Dec 1, 2020

https://twitter.com/chrismunns/status/1333825503464214530

Docker containers can be up to 10gb, traditional lambdas are still limited to 250mb.

munns · on Dec 1, 2020

Thats me. We've got some fun things we do behind the scenes to keep Lambda container image support snappy. SO yes, up to 10gb artifacts with container image support.

thetrooper · on Dec 1, 2020

[flagged]

throwaway894345 · on Dec 1, 2020

I don’t know how this is related to my comment. In my case we used a slow language (although Python’s poor performance and its large bundle sizes are only indirectly related at best) and we had to spend a lot more by moving to ECS/Fargate. If we were using Go, our bundle sizes would’ve been 30x smaller (I checked) and would’ve fit in a lambda easily. Not only would it have fit easily, it would have made a lot of progress before the Python version even finished importing its dependencies. And in top of all of that, it would have out-performed the Python version by a good order of magnitude. If anything, my anecdote supports the idea that Amazon wants you to use fast languages, especially now that they offer per-ms pricing for lambdas.

endgame · on Dec 2, 2020

The thing you put inside the container still has to speak Lambda's API, but yes, pretty cool.

bdcravens · on Dec 1, 2020

For per ms billing to matter, you probably want to limit your language choices.

SoSoRoCoCo · on Dec 1, 2020

True: Last week HN had this to say about AWS + Rust:

https://news.ycombinator.com/item?id=25200324

thetrooper · on Dec 1, 2020

[flagged]

freehunter · on Dec 2, 2020

For a 12-hour old account you sure are trolling this thread pretty damn hard. Maybe take a step back, catch your breath?

mwcampbell · on Dec 1, 2020

We are drowning in so much cynicism these days. Can't we just accept that some things are actually good news?

narag · on Dec 1, 2020

I've been here for a while resisting the temptation to write a sarcastic comment. Speed has been the opposite of what matters for decades. Every change is always trading speed for something else. And suddenly some offer by Amazon is going to change that? Seems unlikely.

ashtonkem · on Dec 1, 2020

Speed has never not mattered; whoever told you that hand waved over a ton of nuance and did you a disservice. The reality is that for a lot of work loads an increase in speed is not worth the tradeoff (key word) of increased maintenance burden.

All else being equal, faster services are cheaper to run. Faster services can service more requests per compute/memory resource, which means you don't have to buy as many servers/containers/whatever. This is particularly important if you're being billed by the ms, which is the context we're talking about here.

jlg23 · on Dec 1, 2020

Speed has always mattered, though I agree we are light-years away from optimization levels once considered standard. OTOH, so are we WRT complexity of applications.

Amazon is not going to change software development per se, but at least at some of their customers' sites calculations will be done how many hours can be allocated for a n% reduction in runtime. So, if you live in an amazon-universe, this is a real "game changer". Bystanders may chuckle ;)

A-Train · on Dec 1, 2020

It sounds like you are assuming that faster code means you need to sacrifice something which has negative consequences. If you know upfront you need faster code you may choose a statically compiled language and I don't see it as a sacrifice.

akh · on Dec 1, 2020

In case it helps Terraform users estimate their cloud costs, I updated https://www.infracost.io to support the new ms-based Lambda pricing (https://github.com/infracost/infracost/pull/248/files, it'll be in the next infracost release).

I'm interested to hear what people think about https://www.infracost.io/docs/usage_based_resources - longer term we could extend that to fetch average_request_duration from cloudwatch or datadog.

Hortinstein · on Dec 2, 2020

Wow amazing project! Another reason to start looking at terraform for my next project

maxpanas · on Dec 1, 2020

Very cool project, well done.

pwinnski · on Dec 1, 2020

This is my favorite news so far. Hard to imagine beating this one, in terms of actual impact for me.

Very tired of this: `Duration: 58.62 ms Billed Duration: 100 ms`

Very happy about this: `Duration: 48.74 ms Billed Duration: 49 ms`

brunoluiz · on Dec 1, 2020

I am curious to see if this will mean a shift to more efficient languages (Go or Rust) for Lambda services, as usually people default to JS

valbaca · on Dec 1, 2020

From the research I did, here's how languages stack up in Lambda runtime (lowest first):

1. Python & JS 2. Go 3. C# & Java

I couldn't find any data on Rust.

The understanding at the time was that Python & JS runtimes are built-in, so the interpreter is "already running" Go is the fastest of compiled languages, but just can't beat the built-in runtimes. C# and Java were poorest as they're spinning up a larger runtime that's more optimized for long-running throughput.

https://docs.aws.amazon.com/lambda/latest/dg/best-practices....

https://medium.com/the-theam-journey/benchmarking-aws-lambda...

https://epsagon.com/development/aws-lambda-programming-langu...

https://read.acloud.guru/comparing-aws-lambda-performance-of...

Of course, benchmarks like this only go so far. Use as a starting point for your own evaluation; not as an end-all-be-all.

throwaway894345 · on Dec 1, 2020

I’m not sure I’m interested in a hello world benchmark if it takes Python 5 seconds to import its dependencies in the real world.

Tehnix · on Dec 1, 2020

Dependencies for the dynamic languages matter A LOT! Take a look at what it'll cost you for requiring the AWS SDK in Node.js, for your cold starts https://theburningmonk.com/2019/03/just-how-expensive-is-the....

stefano · on Dec 1, 2020

This is very true. Just importing Django + Django Rest Framework + some other minor libraries in Google App Engine (standard) leads to painfully slow response times when a new instance spins up. Like, more than 10s to spin up an instance. Although App Engine seems to be 3-4 times slower than my desktop computer from 2014 on this particular task. I wonder if AWS lambda is better.

orf · on Dec 1, 2020

In the real world you don't import 5 seconds worth of dependencies into a lamdba, and a 5 second boot time for a longer-lived service is acceptable.

throwaway894345 · on Dec 1, 2020

> In the real world you don't import 5 seconds worth of dependencies into a lamdba

Laughs in data science.

> a 5 second boot time for a longer-lived service is acceptable.

Not every application can tolerate the occasional 5-second-long request. Just because Python can cold boot "hello world" 3 seconds faster than Go doesn't mean that's going to hold in the real world.

orf · on Dec 1, 2020

You're mixing arguments here. It's not the occasional 5-second long request, it's "the app doesn't start serving requests for 5 seconds".

Using data science tooling in a lambda seems iffy, especially ones that are not production ready. And good luck getting such libraries in go.

Python cold booting an interpreter 3 seconds faster than Go is a big deal, especially if your target execution time is <50ms and you've got a large volume of invocations, and are not being silly and importing ridiculously heavy dependencies into a lambda for no reason other than to make a strange point about Python being unsuitable for something nobody should be doing.

throwaway894345 · on Dec 1, 2020

> You're mixing arguments here. It's not the occasional 5-second long request, it's "the app doesn't start serving requests for 5 seconds".

Lambdas cold-start during requests. So the unlucky request that triggers a cold start eats that cold start.

> Using data science tooling in a lambda seems iffy, especially ones that are not production ready.

Nonsense, there are a lot of lambdas that just load, transform, and shovel data between services using pandas or whathaveyou. Anyway, don't get hung up on data science; it was just an example, but there are packages across the ecosystem that behave poorly at startup (usually it's not any individual package taking 1-2s but rather a whole bunch of them scattered across your dependency tree that take 100+ms).

> And good luck getting such libraries in go.

Go doesn't have all of the specialty libraries that Python has, but it has enough for the use case I described above.

> Python cold booting an interpreter 3 seconds faster than Go is a big deal, especially if your target execution time is <50ms and you've got a large volume of invocations

According to https://mikhail.io/serverless/coldstarts/aws/languages/, Go takes ~450ms on average to cold start which is still up a bit from Python's ~250ms. To your point, if you're just slinging boto calls (and a lot of lambdas do just this!) and you care a lot about latency, then Python is the right tool for the job.

> not being silly and importing ridiculously heavy dependencies into a lambda for no reason other than to make a strange point about Python being unsuitable for something nobody should be doing.

Not every lambda is just slinging API requests--some of them actually have to do things with data. Maybe someone is transforming a bit of audio as part of a pipeline or doing some analysis on a CSV or something else. Latency probably matters to them, but they still have to import things to get their work done. And according to https://mikhail.io/serverless/coldstarts/aws/#does-package-s... (at least for JavaScript) just 35mb of dependencies (which will buy you half of a numpy iirc) causes cold start performance to go from ~250ms to 4000ms.

My rule of thumb (based on some profiling) is that for every 30mb of Python dependency, the equivalent Go binary grows by 1mb, moreover, it all gets loaded at once (as opposed to resolving each unique import to a location on disk, then parsing, compiling, and finally loading it). Lastly, Go programs are more likely to be "lazy"--that is, they only run the things they need in the main() part of the program whereas Python packages are much more likely to do file or network I/O to initialize clients that may or may not be used by the program.

holler · on Dec 1, 2020

Curious why it would take 5 seconds?

The way I'm using lambda, I compile the lambda build image beforehand which contains the python packages already installed, and the only "time" restraint is that of the lambda spinning up itself.

If you ran e.g. "pip install -r requirements.txt" inside the lambda, then yes it would take time to install the packages.

throwaway894345 · on Dec 1, 2020

Installing packages onto the system (“pip install”) is different than the interpreter importing them (loading them when the interpreter hits an “import” statement). Not only is it resolving imports into file paths and loading them into memory, but it’s also executing module-level code which tends to be quite common in Python, so it’s not at all uncommon for imports to take 5s or more.

Meanwhile in Go, dependencies are baked into the executable so there is no resolving of dependencies, and the analog to “module level code” (i.e., package init() functions) are discouraged and thus much less common and where they occur they don’t do as much work compared to the average Python package.

holler · on Dec 4, 2020

Interesting, I see what you mean, but in my time working with python I've never seen that as an issue. Perhaps in different domains such as big data it might be a problem.

tybit · on Dec 1, 2020

The numbers you link don’t support your ranking, unless you’re specifically ranking by cold start alone. Even then it doesn’t make sense to group Python and Node but not Go, as Node and Go are significantly closer than Node to Python.

game_the0ry · on Dec 1, 2020

Interesting, thanks for sharing! This is the opposite of what most people would expect.

munns · on Dec 1, 2020

A lot of this was based around the fact that we've seen languages become just so much more performant. This includes Go/Rust/etc, but a lot of Node.js workloads are also sub 100ms, or fast enough that they'd benefit from this pretty well.

- Chris, Serverless@AWS

tgv · on Dec 1, 2020

I've got bad experiences with go startup (i.e., cold runs). They're much more expensive than I would have expected. If node can indeed run in 40ms (as https://news.ycombinator.com/item?id=25267211 says), then I'm surely going back to JS.

philosopher1234 · on Dec 1, 2020

What is your experience? Go is an AOT compiled language so the only thing I could imagine you running into on startup is loading the binary into memory? Theres not a cold-start issue with Go, as its not an optimizing JIT.

Edit: Bizarre. Seems like Go on lambda does actually have slower cold start than JS or Python. I wonder if its just that the binary is likely larger than the equivalent JS source code? https://levelup.gitconnected.com/aws-lambda-cold-start-langu...

tgv · on Dec 1, 2020

My experience is that Go cold start takes around 900ms. Processing (parsing a JSON message, verifying it with one DynamoDB look-up, and storing it in DynamoDB) then takes between 11ms and 105ms. Go does use less memory than node, though, and that also counts in Lambda.

I hadn't expected it either, but it loads node faster. Perhaps via some VM trick?

nicoburns · on Dec 1, 2020

I'm not sure about lambda, but cloudflare workers use v8 isolates to avoid starting a new process at all.

gen220 · on Dec 1, 2020

> The function did nothing except emit ‘hello world’.

A more realistic benchmark would be parsing a 1kb protobuf blob and printing some random key from it.

(this would require importing a non-stdlib parser)

Without knowing how it's implemented, my guess is that they're conserving python/v8 processes, so that they're not cold-starting the interpreter on each lambda execution.

You can't [1] do the same thing for a Go binary, so they have to invoke a binary, which might involve running some scans against it first.

This leads to some pretty counterintuitive conclusions! If you want minimal latency (in Lambda!!), you really should be using JS/Python, I guess.

[1]: OK. Maybe you could. Go has a runtime after all, although it's compiled into the binary! I have never heard of anybody doing this, but I'd love to read something about it. :)

indymike · on Dec 1, 2020

Go binaries are pretty large compared to say, C.

Thaxll · on Dec 1, 2020

Go is not AOT compiled.

philosopher1234 · on Dec 2, 2020

... yes it is?

Tehnix · on Dec 1, 2020

I wrote this in another comment here, but just so you don't miss it: Don't.

Dependencies for the dynamic languages matter A LOT! Take a look at what it'll cost you for requiring the AWS SDK in Node.js, for your cold starts https://theburningmonk.com/2019/03/just-how-expensive-is-the...

Personal benchmarks puts Rust as the most optimal language that I've tried to run on AWS Lambda so far.

gabipurcaru · on Dec 1, 2020

same. I went through the trouble of implementing my function in Rust (Rocket), and it's actually quite slow because (a) startup is slow and (b) async/await is still pretty painful to use so I'm blocking on IO

Swizec · on Dec 1, 2020

JS is a great choice for Lambda thanks to great cold performance. I’m seeing runtimes in the 40ms to 100ms range.

Most of the time in Lambda is usually spent waiting for IO, which is slow in any language. If you’re using Lambda for heavy computation, that’s not a great choice.

tnolet · on Dec 1, 2020

Yep, we ran ~50M Node invocations last month on a small function. AVG was around 100ms but lots of sub 50ms invocations too.

k__ · on Dec 1, 2020

On the one hand I read that JS Lambdas were often already under 100ms (30-50ms)

On the other hand I heard legends about under 10ms Rust Lambdas.

wgjordan · on Dec 1, 2020

That's the point- billed per ms, a Lambda that executes in 5ms is 10x cheaper than one that takes 50ms. Billed per 100ms interval, the total cost of the two is the same.

k__ · on Dec 1, 2020

Yes,

I'm not questioning that 5 is a tenth of 50. I'm questioning the Rust speed :D

JJJollyjim · on Dec 2, 2020

I just checked cloudwatch for my rust-based function, I'm now being billed 1ms :D

cle · on Dec 1, 2020

IME Lambda functions are mostly sitting around waiting on I/O, so I don't think it would make much of a difference for those workloads. The important technical factors for those workloads are startup time and I/O capabilities...JS is strong in both of those areas. For simple Lambda functions JS still seems like a great choice, along with Go. Rust would be overkill IMO unless you need to share a codebase or aren't I/O bound or have some other unique requirements.

camhart · on Dec 1, 2020

I've learned that AWS pricing tends to improve over time, and I appreciate it. I just recently switched from a startup offering authorization to AWS Cognito because the startup kept raising their price(s).

It's nice to see this drop, though I'm sure Amazon does it due to competition as well.

Supermancho · on Dec 1, 2020

This is an analysis from 2018ish - https://www.stayclassyinternet.com/articles/investigating-AW...

astuyvenberg · on Dec 1, 2020

It's nice to see that AWS is using their economy of scale to reduce their own costs, and passing that on to the consumer.

irrational · on Dec 1, 2020

I don't think they are doing it to be nice to consumers. I think they are doing it to cost less than competitors.

DenisM · on Dec 1, 2020

FWIW I was informed by an AWS employee that their internal philosophy is to keep pricing at cost+ levels, which is a strategic play - it forces the operations to remain lean and discourages many competitors from trying to wedge themselves into the cost-price gap.

Fat profit margins attract competition, this is what happened when Oracle/Unix combo were chewed up by Microsoft Windows/SQL from the bottom, and then Linux/MySQL started chewing up Microsoft from their bottom. It's the dog-eats-dog world.

safog · on Dec 1, 2020

"Your margin is my opportunity."

- Jeff Bezos

trisiak · on Dec 1, 2020

Isn't that the same in practice?

You get customers by being nice to them. Being nice to customers means competitive pricing, high quality support, good documentation, easy integration, etc. It's all driving towards the same goal.

bosie · on Dec 1, 2020

Naming good documentation and high quality support in conjunction with AWS is a bit weird to me. Though parent was talking about using their scale to improve prices. They might just reduce their margins at the moment.

irrational · on Dec 1, 2020

It's not necessarily the same in outcome. Undercutting competitors can be a temporary thing. As soon as the competitors are eliminated you jack the prices up. Doing it to be nice to customers can potentially last even after competitors go belly up. Then again, Google's motto used to be "do no evil" (basically be nice to customers). That obviously went the way of the dodo bird.

Lord_Baltimore · on Dec 1, 2020

>>As soon as the competitors are eliminated you jack the prices up

And then you provide a competitor or startup another opportunity.

irrational · on Dec 1, 2020

Eventually. But Amazon has the headspace to drop prices for as long as they need to kill the new competition. Only someone like Google or MS will be able to keep up as long as they can automate a lot and use money from ads or software licenses to prop up their cloud business.

astuyvenberg · on Dec 1, 2020

I didn't mean to imply that they did it to be kind, I think it's clear they made this change to be more competitive in the market place.

It's still great to see.

cortesoft · on Dec 1, 2020

Are you implying that they are trying to make money?

irrational · on Dec 1, 2020

I'm implying that they are trying to undercut competition to drive them out of business so they can then raise prices across the board.

staticassertion · on Dec 1, 2020

I think everyone is already aware of the fact that AWS is a company.

colechristensen · on Dec 1, 2020

It’s more that computing gets cheaper over time because of advancements in cpu performance and lowering of storage costs.

AWS doesn’t really ever have a reason to raise costs, it doesn’t have to lowball costs to attract customers in the first place.

Spooky23 · on Dec 1, 2020

I used to own some internal services where we had a model very similar to AWS for cost recovery.

It’s an interesting model because apps either optimize for or happen to fall into “loopholes” where some customers end up getting more value than others or may turn into a financial liability at scale.

For example, think about authentication... charging per auth will mean that some use cases will be nearly free, as some external users may only sign in once per quarter. But charging a flat rate has the opposite effect. You have to design the service and tweak the metrics and rates to make it work.

Scarbutt · on Dec 1, 2020

I just recently switched from a startup offering authorization to AWS Cognito because the startup kept raising their price(s).

Maybe because they know Cognito is horrible? ;)

OTOH, I have never seen DynamoDB prices decreasing.

Dunedan · on Dec 1, 2020

The introduction of DynamoDB on-demand pricing was a huge price reduction for some workloads with the additional benefit of also reducing the complexity of scaling capacity as well.

ashtonkem · on Dec 1, 2020

On demand pricing dropped our DynamoDB cost by ~20%.

jacobra2 · on Dec 1, 2020

What don't you like about cognito? I've been able to solve every auth problem I've encountered with it.

4lejandrito · on Dec 1, 2020

I am impressed that computation is billed by the ms nowadays.

I'm an ignorant in AWS Lambda but how do you know if their ms measurement is accurate? Is there any way to verify this?

munns · on Dec 1, 2020

You are billed by the execution time of the function. So from the millisecond we hand the event over to you until you return a response or timeout.

- Chris - Serverless@AWS

pistoriusp · on Dec 2, 2020

Are 15 minute max execution times still the norm?

rajinikantham · on Dec 2, 2020

Yes. I didn't see anything about relaxing those limits today in the keynote. Once can hope and dream..

digianarchist · on Dec 2, 2020

I saw the news around running container images as artifacts which is welcome. Still the 15 min limit restricts a lot of use cases.

https://aws.amazon.com/blogs/aws/new-for-aws-lambda-containe...

GreekPete · on Dec 2, 2020

As far as I know, if you run for more than a few seconds Lambda’s cost will _really_ not worth it. One should prefer ECS or Batch.

digianarchist · on Dec 5, 2020

On Fargate? I agree.

astuyvenberg · on Dec 1, 2020

It's still measured by wall clock time - not CPU time. I'd love to see them bill for actual CPU time.

donor20 · on Dec 1, 2020

They would be willing to do this probably if you let them evict your entire workload from memory during the period you were not paying for it, and then were able to charge you for CPU time and some additional charge to reload workload into memory from hibernation.

Most workloads ALSO hold memory (which is a key constraint) over the entire wall clock time, and the delays and impacts/costs of hibernating out the memory and then bringing it back so you can just be charged for CPU time may not make sense.

alexeldeib · on Dec 1, 2020

They could also charge you some rate for CPU seconds + Gb-seconds of memory used? Sort of the ultimately flexible cloud platform. You could apply the same sort of thinking to other resources, but it works best for CPU/memory I think.

jacques_chester · on Dec 1, 2020

That would be ideal, as it more closely fits consumption to billing. But potentially harder for end users to reason about. AWS's bills are already notorious.

cle · on Dec 1, 2020

I'd bet a significant portion of their margins come from this, given Lambda's focus on I/O-bound workloads.

heimatau · on Dec 1, 2020

If the profits exist there, then others might eventually find it advantageous to have this metric in the future.

Maybe even accounting for strong and weak nuclear forces (not sarcasm...we are engineering at the quantum level now, soon it will be apart of a business metric. Instead of 'equipment' being servers, it might be the domain-space-time used).

tylersmith · on Dec 1, 2020

You pay for more than cpu cycles. Yu are paying for those cycles to occur at a particular wall time and for some segment of memory to be reserved during that time as well.

loxias · on Dec 1, 2020

Exciting! As a primarily C & C++ programmer this makes me happy. Also, I see that there's now examples for C++ that don't involve "step 1, download nodejs". Progress!

forrestbrazeal · on Dec 1, 2020

More detailed breakdown from AWS's James Beswick: https://acloudguru.com/blog/engineering/building-more-cost-e...

ite07 · on Dec 1, 2020

this is a good reference, thanks

maverwa · on Dec 1, 2020

Interesting: the german version (and other non-english versions, if I parse that correct) of the page still mentions rounding up to 100ms while the english version to 1ms. Cache? Not yet translated? Different pricing model?

ashtonkem · on Dec 1, 2020

Oh, that absolutely changes the price calculation for Lambda. Historically the 100ms minimum billing interval made Lambda significantly more expensive than EC2 for large numbers of work loads.

SV_BubbleTime · on Dec 1, 2020

My needs were different and the EC2 vs Lambda different was state vs stateless.

This won’t save me a ton of money; but at least I won’t have to guess on if bumping up to the next CPU+MEM tier will get me from 101ms to 99ms

code4tee · on Dec 1, 2020

This is great. For many high velocity low runtime workloads this is going to be a result in a significant cost savings.

gitweb · on Dec 1, 2020

2020 is the year when companies will gladly pay 20x as much as a dedicated server in a DC because cloud.

ratdragon · on Dec 1, 2020

But... But....It's cloud. And savi gs.

simlevesque · on Dec 1, 2020

This means that using bigger CPUs (by allocating more RAM) just became even more important.

bmm6o · on Dec 1, 2020

That was my first thought. Over a lot of the curve it's almost free to run on a bigger slice. When you get down to just a few billing quanta the math didn't work out in your favor.

cordite · on Dec 1, 2020

This is great, AWS.

A feature I'd really like next is secrets as environment variables like ECS.

Retrieving SecretsManager secrets and SSM Secure Parameters in application code is messy and provides significant friction for developers on my team.

GreekPete · on Dec 2, 2020

I’m confused. This is available already for more than a year (maybe 2).

https://docs.aws.amazon.com/AmazonECS/latest/developerguide/...

cordite · on Dec 2, 2020

Yes, it's in ECS, which is why I said "like ECS"

I'm asking for Lambda to have externally supplied secrets

raphaelj · on Dec 1, 2020

I find it annoying to have all these pricing per second, and now per millisecond. It's really hard for my mind to visualize what `$0.0000000021 per millisec` actually is.

Being billed by the millisecond does not mean that you should give a pricing per millisecond.

I prefer Digital Ocean or Heroku's approach of billing by the second, but giving the price per month. How on hell is `$0.0000000021 per millisec` better than `$5/month, billed by the millisecond`? If I know that my workload will be about 20% of a dedicated CPU, I know that I'll end up paying about $1 per month.

Tehnix · on Dec 1, 2020

There is simply an enourmous amount of assumptions that would go into estimating anything else, because ms is the only correct metric. Lambda billing for a month? What on earth does that say? 10 invocations running 15 minutes? 90,000 invocations running 100ms? (Those two are equivalent btw).

If I know my function takes around ~35ms ballpark, and I will probably invoke it 5,000 times per day, then I can calculate my monthly: 0.0000000021 $/ms * 35ms * 5,000 * 30 = 0.011 $/month.

AWS usually shows a neat example of usecase and what the billing would be on their pricing pages.

stkdump · on Dec 3, 2020

Gotta be sure to have that number of 0s exactly right.

yeldarb · on Dec 1, 2020

Anyone know why there's a hard limit of 15 minutes for Lambda (and 9 minutes for Google Cloud Functions)? Still seems really weird to me.

jlouis · on Dec 1, 2020

Safety.

No function runs forever.

Also if you want to reboot/repurpose the server, 15 minutes is max wait time.

And finally, you don't want people running long jobs here when you have a solution there.

jedberg · on Dec 1, 2020

Bin packing. It's a lot easier to distribute workloads around a cluster of compute if you can guarantee the maximum runtime.

BoorishBears · on Dec 1, 2020

If I had to imagine designing a system like Lambda, people running really long operations would really throw a wrench in things.

Maybe you could let users indicate the operation will take a long time... but if the user knows the operation is long running in advance, why not just guide them to a more suitable system?

generj · on Dec 1, 2020

One way around this limitation while still using server less (for Python only) are Glue python shell jobs. They can run for hours if not days, and default at 1 vCPU and 1GB of memory for 2.75¢ an hour.

tanilama · on Dec 1, 2020

To limit the scope, and type of applications people should use Lambda for and Lambda is best at running them.

15 mins max runtime simplifies the resource management and avoid abuse. If you have workload for long running jobs, then that should go to something like AWS EKS/Batch/SageMaker.

That being said, things can change, if more and more people requires long running capacity for Lambda (though I am skeptical of that, as Lambda abstracts the underlying hardware away and is supposedly flexible to the requirements)