Servers as they should be – shipping early 2022

atonse · on May 26, 2021

This looks interesting (although I'm not in the target market, too small)...

But if I were looking at this, judging from the quality of people they've amassed in their engineering team, is there any chance they won't be acquired in 6 months?

To anyone looking to take a bet on this, what is the answer to "what's your plan for when your stellar team gets acquired?" And what answer will satisfy that buyer?

Update: Adding another question, does this "environment" (where any really great product with great talent in it can be acquired very quickly) have a chilling effect on purchases for products like this?

Hopefully some Oxide people can answer :-)

bcantrill · on May 26, 2021

Hi! So, at every step -- from conception to funding to building the team and now building the product -- we have done so to build a big, successful public company. Not only do we (the founders) share that conviction, but it is shared by our investors and employees as well. For better or for ill, we are -- as we memorably declared to one investor -- ride or die.

Also, if it's of any solace, I really don't think any of the existing players would be terribly interested in buying a company that has so thoroughly and unequivocally rejected so many of their accrued decisions! ;) I'm pretty sure their due diligence would reveal that we have taken a first principles approach here that is anathema to the iterative one they have taken for decades -- and indeed, these companies have shown time and time again that they don't want to risk their existing product lines to a fresh approach, no matter how badly customers want it.

piva00 · on May 27, 2021

I read through your "Compensation as a Reflection of Values" article [1] and just wanted to say that I love it. It reflects and relates so much to my own values towards work, as a life philosophy, that I felt refreshed knowing others not only think this way but also have the power to implement such a culture. Thanks for trying that, I hope it becomes something more common to workers in general.

[1] https://oxide.computer/blog/compensation-as-a-reflection-of-...

chuckdries · on May 26, 2021

Your approach to pay is really refreshing and attractive as an engineer, and also seems like the exact type of thing most VC or larger tech firms would really hate. That alone feels like evidence of your conviction

fierro · on May 26, 2021

https://oxide.computer/blog/compensation-as-a-reflection-of-...

xxpor · on May 27, 2021

This would be a lot more convincing if the number was double what it is.

stavros · on May 27, 2021

Why not double? Why not ten times, or a hundred, for extra convincingness?

The fact is that it only has to be good enough for them to find the people that they want, rather than for pleasing random HN commenters.

Aeolun · on May 27, 2021

Wut? This number is plenty convincing to me.

fakename11 · on May 27, 2021

They also provide equity in the company. Most people dont work in startups for instantaneous salary.

xvector · on May 27, 2021

This company will be hugely successful. The equity alone is convincing.

bcantrill · on May 26, 2021

Ha! Well, I think our investors think we're very idiosyncratic -- but they also can't help but admire the results: a singular team, drawn in part by not having to worry about the Game of Thrones that is most corporate comp structures. ;)

kaliszad · on May 26, 2021

Smaller teams will always win the communication overhead comparison even without thinking about organizational trees and therefore indirections too much. Communication is one of the biggest problems in organizations and the society, so more direct and therefore clearer communication can make the organization more efficient and keep the spirits high. It also doesn't hurt to have a team made only of extremely senior engineers or other professionals in their field. Even better, if those engineers are great personalities too. There is only one culprit: you have to have a very capable driver to put this powerful engine to good use so to say. If you drive the powerful engine in the wrong direction, you are actually putting more, not less distance between the destination and your current position. It seems, the goal for Oxide Computer is clear and I wholeheartedly wish you the best of luck.

CogitoCogito · on May 27, 2021

I hope you are able to keep the investors convinced and stick with it! I'm a Swedish-American that's mostly lived in the US, but has been working back in Sweden for the last 4 years. I'm culturally mostly Californian, but the work atmosphere in Sweden is just less cut throat and much nicer. You pay with salary sure, but it's definitely worth it. Your descriptions on the website feel a bit similar.

I presume you wouldn't consider European remote given your PST timezone requirement, but I guess I'll consider your company one of those dream places to work were I to make my return to the US!

TimTheTinker · on May 27, 2021

Just curious - do 0xide employees currently receive ISOs or RSUs?

sturadnidge · on May 27, 2021

I was literally remarking to a workmate 'this looks like Sun 2.0', then I see who's on the team :). Congrats, I'll be keeping an eye out if you ever start shipping to Australia.

newsclues · on May 26, 2021

Unfortunately you didn’t pivot to becoming a podcast company. But this looks cool

I hope you drop a new episode soon

bcantrill · on May 26, 2021

We're getting there! My second shot was yesterday, and Steve and Jess are both completely done -- so we expect to get back to the garage soon! In the meantime, fellow Oxide engineer Adam Leventhal and I have been doing a Twitter Space every Monday at 5p Pacific; so far, it's been incredible with some amazing people dropping by -- come hang out!

Timothycquinn · on May 27, 2021

Any thoughts on re-listening to your previous podcasts, finding interesting topics that were either skipped or digressed from and inviting back the guests to do a Q&A style podcast? I feel there are deep wells of interesting topics to be discussed.

tw04 · on May 26, 2021

Isn't that what some of your old Sun compatriots thought with DSSD? :)

Congrats on the announcement, here's hoping you're right! This looks too interesting to be swallowed by Oracle or HPe.

yjftsjthsd-h · on May 26, 2021

See, my immediate reaction was that Oxide is by Sun people who are still scarred by that acquisition; they'll fight tooth and nail to avoid a repeat and if it did get forced through there would be an immediate and complete exodus.

bcantrill · on May 26, 2021

Well, that one too. Honestly, one of the consequences of having a team consisting mostly (but not entirely!) of industry vets: we've collectively seen a lot of shit. In fact, a topic I would never want to bring up among Oxide employees: who has had an acquisition in their career gone the worst? There are just too many contenders, which itself is a sad commentary on the industry!

jeffrallen · on May 27, 2021

> a complete exodus

A catastrophic oxidation reaction leading to total consumption of the fuel?

bcantrill · on May 26, 2021

Fortunately, some of those same DSSD folks have joined us at Oxide -- and let's just say that they are of like mind with respect to Oxide's approach. ;)

bombcar · on May 26, 2021

Companies have been acquired to shut down their product line before.

masklinn · on May 27, 2021

That requires them to be publicly traded (which I don’t think oxide is), or for a majority of the private share holders to essentially give up on the company.

Now I don’t know how oxide is set up, but I’d assume the founders still retain the large majority of shares.

yazaddaruvala · on May 27, 2021

If they haven't invested already you might have just found one more investor :)

https://a16z.com/2021/05/27/cost-of-cloud-paradox-market-cap...

jacquesm · on May 27, 2021

This is pretty refreshing. I wish you all the success that you deserve, really curious how this will work out. Best of luck there!

timClicks · on May 26, 2021

Brian, well done for the launch. I am in awe of your team's audacity, patience and execution.

atonse · on May 26, 2021

That is awesome to hear. :-) Wishing you all a ton of success!

emmelaich · on May 26, 2021

I was going to shrug at this a little until I saw some of the names. Good luck.

Not sure that 'with the software baked in' is a good phrase to use. Sounds inflexible. Perhaps a different phrasing would help?

_e · on May 28, 2021

I agree... what I think they meant to say is something along the lines of software defaults are already optimized to maximize and take advantage of the hardware's abilities so work is completed faster. The 'with the software baked in' should be changed to reflect the value proposition that Oxide is alluding to.

paxys · on May 26, 2021

Going by that logic, you should never take a chance on a bad company because they are bad, and a good company because they are too good and might get acquired. So should you just never rely on a small company for anything?

atonse · on May 26, 2021

That's the question I was genuinely asking. Do longer-term minded buyers think this way? Our company is too small and just use AWS, we're just not perspective buyers. But I'm trying to understand the mindset of a CapEx style buyer whose timelines are multiple years.

This team is, by all measures, going to hit it out of the park. There's just a solid amount of talent, experience and insight all-round.

And to be clear, I am not at all disparaging teams that get acquired – that would be silly. I'm just saying that we are in an environment these days where very few of these kinds of companies get a chance to grow before being acquired and WE are the ones that lose even though the people working at the company rightfully earn a nice payout.

I have the same "fear" about Tailscale, a company whose product we love and have started using, and are about to purchase.

But the fact that a member of the founding team themselves answered my message above in plain english (not surprising), is honestly refreshing.

wd45 · on May 27, 2021

No one is going to bet the farm on that solution. I'd be surprised if big SaaS vendors like Atlassian or DropBox go with it.

But on the other hand I can see F500s (oil & gas companies, big engineering and defense firms, etc.) getting a rack or two to run their cloud-like stuff. They would not be taking much risk; this would be one system among many others they have, and it will have a life of 5 to 7 years anyway (a few million dollars and 7 years is peanuts for an oil & gas or mining company whose CapEx goes into the billions, over 50+ years horizons). I think the value is in having a cloud-like system that doesn't require an entire IT/Ops team to run.

dadrian · on May 27, 2021

Private companies can't just get bought out. They have to agree to be acquired. There is not some roaming force of Big Corp M&A people who forcefully acquihire companies.

I don't understand this concern at all.

perryizgr8 · on June 8, 2021

When enough money is at stake, it may become near impossible to deny a buyer, no matter what the founders initially thought.

boulos · on May 26, 2021

First, congrats!

But second, I'd love to understand the compute vs storage tradeoff chosen here. Looking at the (pretty!) picture [1], I was shocked to see "Wow, it's mostly storage?". Is that from going all flash?

Heading to https://oxide.computer/product for more details, lists:

- 2048 cores

- 30 TB of memory

- 1024 TB of flash (1 PiB)

Given how much of the rack is storage, I'm not sure which Milan was chosen (and so whether that's 2048 threads or 4196 [edit: real cores, 4196 threads]), but it seems like visually 4U is compute? [edit: nope] Is that a mistake on my part, because dual-socket Milan at 128 threads per socket is 256 threads per server, so you need at least 8 servers to hit 2048 "somethings", or do the storage nodes also have Milans [would make sense] and their compute is included [also fine!] -- and so similarly that's how you get a funky 30 TiB of memory?

[Top-level edit from below: the green stuff are the nodes, including the compute. The 4U near the middle is the fiber]

P.S.: the "NETWORK SPEED 100 GB/S" in all caps / CSS loses the presumably 100 Gbps (though the value in the HTML is 100 gb/s which is also unclear).

[1] https://oxide.computer/_next/image?url=%2Fimages%2Frenders%2...

vluft · on May 26, 2021

It looks like they're doing 2u half width nodes, so I'd strongly suspect each node is 1TB of ram, one epyc 7713p, and 10 3.2TB u.2/u.3 drives.

eta: also suspect 30TB total just means they're leaving 64GB ram for the hypervisor OS on each node.

kaliszad · on May 26, 2021

Leaving that RAM for ZFS L2 ARC perhaps? I don't think they would use Illumos as the hypervisor OS without also using OpenZFS with it. They also need some for management, the control UI, a DB for metrics and more.

Btw. if I count correctly, they have 20 SSD slots per node (if a node is full width) and 16 nodes. They would need 2 TB to reach 1 PB of "raw" capacity with the obvious redundancy overhead of ~ 20%.

It is also quite possible, they don't use ZFS at all and use e.g. Ceph or something like it but I don't think that is the case, because that wouldn't be cantrillian. :-) E.g. using Minio, they can provide something S3 like on top of a cluster of ZFS storage nodes too but they most likely get better latency with local ZFS and not a distributed filesystem. Financial institutions especially seem to be part of the target here and there latency can be king.

vluft · on May 26, 2021

I'm fairly confident the nodes are half width; if you look at the latches it very much would appear you can pull out half of every 2u at once, and if you look at the rear there's 2 net cables going into each side.

kaliszad · on May 26, 2021

Good observation, it looks like it. It probably makes upgrading/ maintenance easier since the unit of failure is smaller. Of course, you can also only tackle stuff, that demands no more than 64 cores before you have to rearchitecture your monolith into a distributed system, which has lots of overhead.

solarengineer · on May 27, 2021

Some of them are the creators of Manta, an object storage based on OpenZFS.

richardwhiuk · on May 26, 2021

Suspect each node is both storage and compute.

Guessing they aren't counting threads (they say "cores"), so 64 cores per socket, 128 cores per server, 16 servers => 2048 cores.

boulos · on May 26, 2021

Duh! I got tricked by the things near the PDU as "oh, these must be the pure-compute nodes".

So maybe that's the better question: what are the 4U worth of stuff surrounding the power? More networking stuff? Management stuff? (There was some swivel to the back of the rack / with networking, but I can't find it now)

Edit: Ahh! The rotating view is on /product and so that ~4U is the fiber. (Hat tip to Jon Olson, too)

samstave · on May 26, 2021

Control-plane most likely, and having a mid-centered PDU probably adds to heat on the upper stack, which shortens life over time.

As someone who has designed quite a few datacenters, whats more interesting to me in this evolution of computing is the reduction in cabling.

Cabling in a DC is a huge suck on all aspects - plastics, power, blah blah blah - the list is long....

But there are a LOT of cabling companies that do LV out there - so the point is that when these types of systems get more "obelisk" like, are many of these companies going to die? (I'm looking at you Cray and SGI.)

When I worked at Intel - I had a friend who was a proc designer at MIPS - and we talked about rack insertion and a global back-plane for the rack (which we all know to be common now) - but this was ~1997 or so... but when I built the Brocade HQ - cables were still massive and it was an art to properly dress them.

Lucas was the same - so many human work hours spent on just cable mgmt...

Their diagrams of system resiliency is odd in my opinion:

https://i.imgur.com/GB0fzIl.png

That looks like a ton of failures that they can negotiate...

Whats weird is the SPF isn't going to be in your DC/HQ/Whatever - its going to be outside - this is why we have always sought +2+ carrier ISPs or built private infra...

A freaking semi truck crashed into a telephone pole in Sacramento the other day and wiped comcast off the map to half the region.

https://sacramento.cbslocal.com/2021/05/25/citrus-heights-an...

Thats ONE fiber line that brought down 100K+ connections...

---

EDIT: I guess what I am actually saying is that this entire marketing strat is to convince any companies that *"failure is imminent and please buy things that are going to fail, but don't worry because you bought plenty more things to live beyond the epic failure that these devices will have"*

---

Not to discredit anything this company has going for its product - but their name is literally "RUST" (*oxide*) --- which we all know is what kills metal.

And what do we call servers: *Bare Metal*

bacheaul · on May 27, 2021

On the topic of naming, there was thought put into it...

> With accelerating conviction that we would build a company to do this, we needed a name — and once we hit on Oxide, we knew it was us: oxides form much of the earth’s crust, giving a connotation of foundation; silicon, the element that is the foundation of all of computing, is found in nature in its oxide; and (yes!) iron oxide is also known as Rust, a programming language we see playing a substantial role for us. Were there any doubt, that Oxide can also be pseudo-written in hexadecimal — as 0x1de — pretty much sealed the deal!

http://dtrace.org/blogs/bmc/2019/12/02/the-soul-of-a-new-com...

javajosh · on May 27, 2021

Oxides of silicon are used extensively in the CPU lithography process, which is where I assume their name comes from.

samstave · on May 27, 2021

I accept your rebuttal and raise you a kitten.

https://imgur.com/gallery/dVzW0Le

neurotixz · on May 26, 2021

Power footprint also confirms that the compute density is pretty low.

We built a few racks of Supermicro AMD servers (4 X computes in 2U), and we load tested it to 23kva peak usage (about 1/2 full with nthat type of nodes only, our DC would let us go further)

Were also over 1 PB of disks (unclear how much of this is redundancy), also in NVMe (15.36 TB x 24 in 2U is a lot of storage...)

Other then that not a bad concept, not sure of a premium they will charge or what will be comparable on price.

jsolson · on May 26, 2021

+1 to congrats -- my read on this:

- There's a bunch of RJ45 up top that I don't quite understand :)

- A bunch of storage sleds

- A compute sled, 100G QSFP switch, compute sled sandwich

- Power distribution (rectifiers, I'd think, unless it's AC to the trays?)

- Another CSC sandwich

- More storage.

I assume in reality we'd have many more cables making things less pretty, given the number of front-facing QSFPs on those ToRs.

kaliszad · on May 26, 2021

They use a bus bar design. That is what @bcantrill also said in an interview.

SSLy · on May 28, 2021

>- There's a bunch of RJ45 up top that I don't quite understand :)

Out of data-plane HW mgmt probably

ThinkBeat · on May 26, 2021

Hmm.

They basically reinvented mainframes. Seems it has a lot in common with Z series.

Scalable locked in hardware, virtualization, reliability, engineered for hardware swaps, upgrades.

A proprietary operating system (?) from what someone said. (Offshoot of Solaris +++ (???) By that I mean that most of it, or all of it might be open sourced forks, but it will be an OS only meant to run on their systems.

(It would be fun to get it working at home, on a couple of PCs or a bunch of PIs)

They lack specialized processors to offload some workloads to.

Perhaps in modern terms shelfs of GPUs or a shelf fast FPGA , DSP processors. The possibilities are huge.

I didn't find any mention of from what I read.

They also lack the gigantic legacy effort to be compatible, which is a good thing.

zozbot234 · on May 26, 2021

Their approach to reliability isn't quite on par with mainframes, AIUI. At least, not yet. And the programming model is also quite different - a mainframe can seamlessly scale from lots of tiny VM workloads (what Oxide seems to be going for) to large vertically-scaled shared-everything SSI, and anything in between.

wd45 · on May 27, 2021

Ignoring hardware reliability, thanks to the integration, their solution should be more reliable than whatever byzantine solutions are currently used in their target market. I've worked in a shop (a well-known name that I won't mention) that had a mix of "chat ops" and Perl scripts integrated with JIRA where you could request a Linux VM through a JIRA ticket and get it automatically provisioned, I assume from some big chassis running VMWare, and then use git+Puppet to configure it. It works, but it's a lot of software from different sources and there is always one thing or the other failing. And the security of all that stuff is probably patchy, regardless of audits.

That being said, this solution is the mother of all lock ins...

I could see it used for the non-critical part of a company's infrastructure. I would not run production stuff on it, but it could work for development systems, test boxes, etc. Basically give developers access and let them create and destroy as many VMs as they need, whenever they need.

discardable_dan · on May 26, 2021

I don't know if I have a use for something like this, but the website aesthetics are just plain awesome.

tyingq · on May 26, 2021

Yeah, I noticed that too. The green wireframe looking stuff is actually text in spans/divs next to, or overlayed on pictures. The little "nodes" are this character, for example: ⎕. The effect is pretty unique.

goodpoint · on May 26, 2021

Looks like simple ascii art, typically used by CLI tools.

megous · on May 27, 2021

I like them too but just viewing the page on Firefox loads 5950X to 250%. A bit too much.

mcdevilkiller · on May 27, 2021

Wow. It loads almost instantly and doesn't slow down on my phone (LG G7). Must be something else?

megous · on May 27, 2021

You have to scroll all the way through the page to activate all the gimmicks. Then it never stops, and loads 2.5 cores permanently to 100%, which makes CPU fan spin to the max.

kfajdsl · on May 27, 2021

Oh my god, that's hilarious. I was wondering why my lap was warm all of a sudden. Htop said firefox was the culprit so I closed out all my tabs except this one. Then I read your comment, opened the page, and scrolled all the way to the bottom—my cpu temp just steadily rose till it throttled. All the animations are smooth, though.

Note: I'm typing this from a 9 year old thin-and-light, so that's probably part of the problem.

megous · on May 29, 2021

Fixing it should be as easy as disabling the animation that's outside the viewport, I guess.

top_kekeroni_m8 · on May 27, 2021

Same here on Firefox 5900x.. it feels super laggy

corysama · on May 26, 2021

If you haven’t been listening to their “On The Metal” podcast you are really missing out! https://oxide.computer/podcasts

It’s all fun stories from people doing amazing things with computer hardware and low level software. Like Ring-Sub-Zero and DRAM driver level software.

ohazi · on May 26, 2021

> Our firmware is open source. We will be transparent about bug fixes. No longer will you be gaslit by vendors about bugs being fixed but not see results or proof.

There are lots of reasons to be enthusiastic about Oxide but for me, this one takes the cake. I hope they are successful, and I hope this attitude spreads far and wide.

_lqaf · on May 26, 2021

It sure looks pretty, but appears to be -

- dedicated to virtualization, done their way

- rather inflexible in hardware specs

- vendor-locked at the rack - if you have hardware from someone else, it can't live in the same cabinet

I guess if you just want a pretty data center in a box and look like what they consider a 'normal' enterprise to be, it might appeal. But I'm not sure how many people asked for Apple-style hardware in the DC.

rapsey · on May 26, 2021

Why is it important what kind of virtualization? It works and since it is built for this hardware it will likely be more reliable then anything you're putting together yourself.

The specs are damn good. When it is all top-of-the-line, inflexibility is kind of a mute point. Where else are you going to go?

> But I'm not sure how many people asked for Apple-style hardware in the DC.

Well integrated, performant and reliable hardware that runs VMs where you can put anything on it is pretty much all everyone running their own hardware is looking for.

Honestly I am surprised how many here completely misunderstand what their value proposition is.

_lqaf · on May 26, 2021

> Why is it important what kind of virtualization?

Because if I ran this, would have to manage it. Given that I have lots of virtualization to manage already, I would want it to use the same tooling for rather obvious reasons.

> is pretty much all everyone running their own hardware is looking for.

I don't think you talk to many people who do this, but as someone who manages 8 figures worth of hardware, I can tell you that is absolutely not true.

> The specs are damn good. When it is all top-of-the-line, inflexibility is kind of a mute point. Where else are you going to go?

To some hardware that actually fits my use case, that is managable in an existing environment? Oh wait - I already have that. I mean, seriously - do you think they're the only shop selling nice machines?

The value-add is all wrong, unless you are a greenfield deployment willing to bet it all on this particular single vendor, and your needs match their offering.

adrianmonk · on May 26, 2021

> lots of virtualization to manage already, I would want it to use the same tooling

I'm not saying you would want to, but maybe their expectation is that you'd plan to transition everything to their system. Either gradually as part of the normal cycle of replacing old hardware or all at once if you want to be aggressive.

If their way is actually better, then it might make sense. You'd go through an annoying transition period but be better off in the end.

The hardware options do seem limited, but maybe that would change if their business takes off and they get enough customers to justify it. They're definitely saying simplicity is a good thing, but maybe that's just marketing spin that sounds better than the alternative of saying they're not yet in a position to offer that flexibility.

mappu · on May 26, 2021

I don't see details on the API, but it seems likely you could write a libvirt provider for it and use existing virsh tooling (Cockpit / CloudStack / ...).

GordonS · on May 29, 2021

Sorry to be that guy, but it's "moot point", not "mute point".

zozbot234 · on May 26, 2021

> - dedicated to virtualization, done their way

> - rather inflexible in hardware specs

> - vendor-locked at the rack - if you have hardware from someone else, it can't live in the same cabinet

This describes legacy IBM platforms quite well. If they can leverage hyperscaling tech to be better and cheaper than what IBM is currently offering, that's enough to make it worthwhile.

yesitdoes · on May 27, 2021

It kind of does. One main difference with IBM is the three letters I, B and M.

JeremyNT · on May 26, 2021

> dedicated to virtualization, done their way

This is a selling point - if it's actually better (which, why not? most of the existing virtualization management solutions either suck or are hugely expensive).

If it's not better, big deal? I'm assuming you could just throw Linux on these things and run on the metal or use something different, right? Given how much bcantrill (and other Oxide team members) have discussed loving open hardware, I seriously doubt they would intentionally try to lock down their own product!

> vendor-locked at the rack - if you have hardware from someone else, it can't live in the same cabinet

This is aimed at players so big that they want to buy at the rack level and have no desire to ever touch or carve up anything. It's a niche market, but for them this is actually a plus.

tyingq · on May 26, 2021

"But I'm not sure how many people asked for Apple-style hardware in the DC."

It's probably selling to the "Amazon-style hardware in your DC market", which I think should be fairly ripe. Building your own private cloud from parts defeats a lot of the purpose...avoiding your own plumbing.

wmf · on May 26, 2021

A lot of customers are asking for private cloud.

azernik · on May 27, 2021

"But I'm not sure how many people asked for Apple-style hardware in the DC."

If "Apple-style" means lower skilled-labor cost for maintenance - absolutely worth it.

an_opabinia · on May 27, 2021

The most compelling thing to roll yourself, eclectic hardware like consumer GPUs with dedicated DXR hardware, is something you can't do with this.

ex_amazon_sde · on May 26, 2021

Vendor lock-in at rack level and custom servers? Sounds like blade servers.

A smart company would stay away from this kind of strong lock-in.

loudmax · on May 26, 2021

As I understand it, Oxide is going to have deep software integration into their hardware. So the expectation isn't that the servers in this rack will be running Windows or a generic Linux distribution. In case anyone from Oxide is here, is my understanding correct? And if so, will there be a way to run a smaller version of an Oxide system, say for testing or development, without purchasing an entire rack at a time?

Anyway, glad to finally get a glimpse of what Oxide has to offer. Looking forward to seeing a lot more.

2trill2spill · on May 26, 2021

My understanding is you will use a API to provision virtual machines on top of the Oxide hypervisor/software stack, which is bhyve running on Illumos. So you can still just run your favorite Linux distro or windows or a BSD if you want[1].

[1]: https://soundcloud.com/user-760920229/why-your-servers-suck-...

SSLy · on May 26, 2021

bhyve? what happened to the KVM port?

2trill2spill · on May 26, 2021

Apparently bhyve will be replacing KVM going forward, this article has a list of reasons[1].

[1]: https://omnios.org/info/bhyve

jhickok · on May 26, 2021

Agreed, I would love to hear more about the management plane. I'm glad it's API-driven, but I still have some questions about things like which hypervisor they are using.

If it's a custom software stack, might be nice to get a miniature dev-kit!

kaliszad · on May 26, 2021

They will use Illumos with Bhyve, @bcantrill said it in a podcast just a few months ago. I have linked it somewhere in my comments (look at my profile).

Timothycquinn · on May 27, 2021

Is that Illumos or SmartOS or are those considered interchangeable these days?

kaliszad · on May 27, 2021

Illumos is the project that multiple distributions build on from what I understand. In a way, it could be likened to GNU/ Linux as Illumos probably contains not only the kernel but other tools and libraries as well. There is e.g. omniosce.org, perhaps Nexenta and Joyent/ Samsung SmartOS.

yabones · on May 26, 2021

Aesthetically, absolutely love it.

In reality I would never want this type of hardware... It reminds me of the old boat anchor bladecenter rigs we used to use. They were great, up until you had to replace one of the blades after the support was up. It's not always practical to replace hardware every 3 years like we're supposed to, so this type of stuff sticks around and gets some barnacles.

What would be fantastic would be if the entire industry committed to an open spec for large chassis like this with a standardized networking and storage overlay... But that would never happen because vendor lock-in is the big money maker in 'enterprise'.

But wow, absolutely gorgeous machines.

zozbot234 · on May 26, 2021

> What would be fantastic would be if the entire industry committed to an open spec for large chassis like this with a standardized networking and storage overlay

Isn't the Open Compute Project supposed to be working on that kind of stuff?

bri3d · on May 26, 2021

It seems like a lot of Oxide information is currently hiding out in podcasts and other media - does anyone know how the AuthN, AuthZ, ACL system is going to work?

One of the most powerful elements of the trust root system is in audit ability and access control for both service-to-service and human-to-system aspects and I'm really interested in seeing how this plays out.

For example, a service mesh where hosts can be identified securely and authorized in a specific role unlocks a lot of low-friction service-to-service security. I'm curious what Oxide plan to provide in this space API and SDK wise.

I see some Zanzibar related projects on their GitHub, so it can be assumed the ACL system will be based on the principles there - but that's more a framework than an implementation.

tw04 · on May 26, 2021

The storage is always the difficult part in these architectures. Are you distributing across all nodes? It appears that each sled is an individual compute unit with 10 drives. Are the drives on a proverbial island and only accessible to that local node, or is there some distributed storage going on that you can talk about?

On paper with RDMA and NVMe-OF you could access any drive from any compute unit... but that's easier said than done :)

deeblering4 · on May 27, 2021

Soo... we’re switching back to blade servers again?

The problem with this model is its no longer commodity hardware. You are kinda locked into their exosystem of specialized network and server equipment.

And it of course introduces some unique failure modes to mitigate too.

Not to say its not a cool idea, it’s just interesting to see how hardware trends oscillate between commodity and highly specialized proprietary designs.

psanford · on May 26, 2021

Congrats Oxide team! More competition in this space is always a good thing.

I'm curious about management. Can the rack operate completely standalone? I assume when you have multiple there will be some management abstraction above the rack layer?

The closest direct equivalent that I can think of to this is AWS outposts. Are there any others that I'm forgetting?

_delirium · on May 26, 2021

The density they're getting here is significantly higher than AWS Outposts, which is interesting. The top-end (~$600k) AWS Outposts seem to max out at around 1k CPUs and 4.5 TB RAM in a rack (e.g. 12x m5.24xlarge = 12x 384 GB), while this rack can house 2k CPUs and 30 TB (!) RAM.

psanford · on May 26, 2021

Outposts seem like a solution to the problem, "for regulatory or compliance reasons we are required for data to reside and be processed within a physical space we control." For that problem, an organization that is otherwise on AWS might find Outposts to be appealing. I can imagine an engineering team's response to such a requirement as "Oh yeah? Fine, its but its going to cost you $600k per year per rack!".

I believe Oxide is attempting to capture a much broader market than that.

kaliszad · on May 26, 2021

Yes, well it isn't that dense either. As I have written, it's 32 CPUs (16x 2 CPUs). 1 TB of RAM per CPU is not that huge a deal, it's perhaps 16x 64 GB (Milan uses 8 channels, 2 DIMMs per channel is reasonable), if you consider that is 16 GB of RAM per core. In HPC, you would probably shrink it to 1/4 of the volume (half width, 1 U dual socket server). Oxide probably focuses on optimal thermal efficiency since their limit isn't the space so much as the power density/ max. power per rack in existing DCs, which they are already pushing hard. (Of course they have lower power options too but they probably will not use 2048 cores.)

spamizbad · on May 26, 2021

The problem with pushing higher compute density is you're running into the limits of what most DCs can provide in terms of power and cooling for a single rack. Usually it's specialized HPC facilities or hyperscalers pushing the power and cooling to handle stuff like that. Those people aren't likely Oxide's customers - they've already got their own hardware solutions.

Tuna-Fish · on May 26, 2021

There are 32 half-width compute blades there, so probably single socket servers with 64 cores each.

kaliszad · on May 26, 2021

It does look like it. Perhaps it is easier for maintenance, if you don't want to start with a full rack. The upgrade granularity is higher.

vluft · on May 26, 2021

it's better than that, actually, as an AWS vCPU is a hyperthread, not a full core, and these would have 4096 hyperthreads.

wmf · on May 26, 2021

There are a bunch of enterprisey "private cloud" aka "converged infrastructure" racks like VxRack, Nutanix, etc.

ksec · on May 26, 2021

May be instead of asking target market or audience, who are their competitors?

(Edit: Previous Discussions https://news.ycombinator.com/item?id=21682360 )

Also thinking if the Website is not finished? All the "Read More" actually hide very little information, if so why hide it? And doesn't seems to explain the company very well. Seems like we need to listen to their PodCast to find out what is going on. ( Edit: Found a Youtube Video about it https://www.youtube.com/watch?v=vvZA9n3e5pc )

>Get the most efficient power rating and network speeds of 100GBps without the pain of cable

100GBps would be impressive, 100Gbps would be ... not much?

A interesting thing is all the terminal like graphics are actually HTML/CSS and not images.

daddylongstroke · on May 27, 2021

> ...who are their competitors?

Literally every single server vendor, and almost all (if not all?) storage vendors on the planet are pushing HCI because that is what mid and large size companies want. This is the fastest growing market segment in hardware (because they now realize that hybrid-cloud is the preferred customer model, and most of their customers now are deploying or already have deployed their own internal cloud). Oxide appears to me to be HCI done correctly. I currently work for one of their competitors, and for one, am keeping at eye on their careers page!

Also, 100Gb meets requirements for 99.999% of the customers out there.

hujun · on May 27, 2021

>Get the most efficient power rating and network speeds of 100GBps without the pain of cable

100GBps would be impressive, 100Gbps would be ... not much?

Even 100GBps(800Gbps) is not much for 2048 cores, depends on application, in certain applications, 32 cores could drive 100Gbps...

tom_mellior · on May 26, 2021

Something about this website takes my machine to 500% CPU load (Firefox/Linux). Good thing we will get 2048 cores soon...

faitswulff · on May 26, 2021

Ha! So I'm not the only one. My wife thought I had fired up a video game.

Havoc · on May 27, 2021

Same. FF/Win

pciexpgpu · on May 26, 2021

Congrats to Oxide computer!

Excited to see tech startups do actual tech instead of chasing VC funded growth hacking.

I wonder what sort of enterprise customers this targets.. (definitely not for individual devs)

benlivengood · on May 26, 2021

It's interesting that the RAM/CPU ratio is about double the default shapes from AWS/GCP. In practice I have generally seen those shapes run on the low side of CPU utilization for most workloads, so I think the choice makes sense.

I'm curious if ARC will be running with primarycache=metadata to rely on low latency storage and in-VM cache, otherwise I could see ARC using a fair bit of that RAM overhead in the hosts.

MangoCoffee · on May 26, 2021

>AMD MILAN

Intel is losing on the client and server. everyone is jumping ship to either ARM or AMD for client/server. hopefully Intel new engineer CEO can turn it around like AMD engineer CEO (Lisa Su)

dsr_ · on May 26, 2021

Few companies buy on pure performance. Right now, AMD has the performance kings and the price/performance kings.

Intel could win price/performance, but they would need to cannibalize their own low-end and mid-range market. If they could make a good bet that they would have high yield in one more cycle, that would make sense. If they don't think that will happen, there's nothing much that will save them, and they're extracting the money that they can right now.

StreamBright · on May 26, 2021

I thought M1 had the best price/performance numbers, too bad Apple does not sell CPUs.

dsr_ · on May 26, 2021

Or, more relevant to this discussion, servers.

kbenson · on May 26, 2021

No, CPUs is more relevant. Linux runs on M1, and if they sold CPUs, someone would make a board they could be put on that fit in standard server form factors. For this type of comparison, people want CPUs, not the next version of Xserve.

dsr_ · on May 28, 2021

But Apple doesn't sell the M1 cpu without a Mini or MB carrier included.

I think relatively few people want to buy a rackmount server based on a motherboard that hosts a cannibalized M1 limited to the 16GB of RAM that it came with, paying the price premium for the rest of the machine. An M1 seems to be roughly equivalent to an Ryzen 5000-series CPU, and you can get those for $300 (6c/12t) through $1K (16c/32t) without having to go through the labor costs of pulling out the CPU from a $900 carrier.

kbenson · on May 28, 2021

I'm not sure I follow. The relevant part of the thread as I saw it was "too bad Apple does not sell CPUs." Selling the CPUs would mean you wouldn't need to cannibalize anything, and in a discussion about the merits of AMD vs Intel as the part used in a platform like this, comparing the bare CPU seems more relevant.

zozbot234 · on May 26, 2021

They sell rack-mountable hardware (Mac Pros) running a certified Unix OS. It's only a matter of time until those are based on Apple Silicon too.

floatboth · on May 26, 2021

Rack Mac Pros are targeted at racks that are mostly filled with audio/video equipment. Apple really doesn't seem to have any interest in selling server products again.

zozbot234 · on May 26, 2021

So? It's still rack-mountable workstation-class hardware that will probably be running on Apple Silicon at some point. And it will probably be possible to boot Linux on it, similar to existing M1 Macs. That's pretty indistinguishable from many servers.

simtel20 · on May 26, 2021

You seem to be thinking that a server and a workstation are the same, ignoring that server skus need oob management, apis, hardware support and so many other things as table stakes

zozbot234 · on May 26, 2021

Did Xserve have any of that stuff?

kbenson · on May 27, 2021

Not all of it, and not necessarily well, which was one reason they weren't super popular except for the case you really needed Apple software. It seemed more aimed at the "I want to colocate a box somewhere and I run Apple in the office and I might also want to in the datacenter for some reason or another" rather than "this is a solid platform that offers all the bells and whistles I would expect because I'm deploying tens or hundreds or these".

ksec · on May 26, 2021

Intel still has ~90% of x86 Server Market in unit Shipment, slightly higher in Revenue sold. And their renewed Roadmap from Pat Gelsinger seems to bring a lot of their product forward ( Rightly so ).

That is speaking as someone who wants AMD to grab more market shares ( and has been stating the same for nearly three years and constantly being told off by all AMD fans they are doing fine )

mhh__ · on May 26, 2021

ARM I've seen evidence people are jumping ship too, but is the same true of AMD? This is the best shot they're going to get at it and I for one haven't heard all that much noise pro-AMD.

They make really nice chips, but what happens if BigCorpXYZ just gets a quote from AMD and goes straight to Intel to get it matched - i.e. the Cloud isn't that performance-intensive, so now they get to stay on the Intel stack for less money.

staticassertion · on May 27, 2021

Dropbox moved to AMD quite a while ago. Seems popular.

tgtweak · on May 27, 2021

First thoughts: Nobody will give this company a chance in that top-dollar, high stakes enterprise space without a track record.

Second thoughts: Oh it's former SUN/joyent guys, nvm.

Will the console have fancy ncurses reporting and analytics like then product page?

gjvc · on May 26, 2021

0xide Computer Company deserves to do well.

This is a solid private-cloud play aimed at those corporations (probably mainly financials, but other sectors too I'm sure...) who don't want to outsource to the likes of AWS / GCP.

twoodfin · on May 26, 2021

Not just “don’t want to”, I’d hope: They should be able to win on the economics, too, assuming customers that care more about TCO for a fixed or steady-growing workload rather than elasticity.

numbsafari · on May 26, 2021

Healthcare is definitely another.

Government, too (especially non-US).

diwu1989 · on May 26, 2021

Looks like a white labeled viking VDS2249R running custom software.

ChrisMarshallNY · on May 26, 2021

This is not my area of expertise, but it does look like that[0].

That "custom software," though, is where the magic often lies. As a software person that worked at hardware companies for most of my career, I know all too well, how disrespectful hardware people are of software. If they have a good software-respecting management chain, then it might be pretty awesome.

[0] https://www.prnewswire.com/news-releases/viking-enterprise-s...

selectodude · on May 26, 2021

Well if Oxide's software stack is going to be open source, I guess we'll get a good look into their secret sauce.

dralley · on May 26, 2021

Let's be honest, software people are pretty disrespectful about hardware, too.

ChrisMarshallNY · on May 27, 2021

Not me. I started as an EE.

Wanna check out my first professional engineering project ever?

https://littlegreenviper.com/TF30194/TF30194-Manual-1987.pdf (Downloads a PDF)

jmartrican · on May 26, 2021

Why the elevation constraint?

"The elevation of the room where the rack is installed must be below 10,005 feet (3,050 meters)."

dralley · on May 26, 2021

At 10,000 the air pressure is 1/3 less than at sea level, less density means less capacity for carrying heat, so the cooling might not be sufficient.

notacoward · on May 26, 2021

Bingo. I've personally had to deal with this in other high-density systems. Less cooling not only has the obvious effects, but also reduces PS efficiency which can cause other problems. Cosmic-ray-induced memory errors can also be a problem at those altitudes (or even half that). That's a bit easier to deal with in principle, but the rate of ECC scrubbing required can start to impact performance. Stack that on top of thermal CPU throttling, and you'll have a system that's just slower than it should be. Just as importantly, the slowdown will be uneven across components, so it's effectively a different kind of system when you're debugging low-level issues.

I think it's a good sign that they're aware of the additional support issues associated with higher altitude. Shows that they've really thought things through.

mikey_p · on May 26, 2021

Perhaps a constraint on cooling effectiveness due to air density?

vluft · on May 26, 2021

probably air density cooling solution is qualified for.

throwaway2037 · on May 27, 2021

FYI: The CTO of the Oxide Computer Company is Bryan Cantrill. (He is responding in this discussion as "bcantrill" -- I assume[!].) You can read about him on Wiki: https://en.wikipedia.org/wiki/Bryan_Cantrill He also has many interesting and thoughtful recorded talks on YouTube. I highly recommend them.

I am posting this info because it seems their "team" page (https://oxide.computer/team/) is no longer working. I thought it was weird there was no way to see the senior leaders from the website. When I first opened the site, I vaguely remembered this brand name, but could not remember who was behind it.

@bcantrill: I assume this is a mistake. Plus I cannot find a 'Team' link anywhere on the current website.

// Edit

I just found section "Meet the Team" on this page: https://oxide.computer/careers

ebeip90 · on May 26, 2021

I wonder if they used Monodraw[1] to create their diagrams?

Looks like they did!

[1]: https://monodraw.helftone.com

thundergolfer · on May 27, 2021

Can Monodraw produce the animation as well, as seen on the webpage?

jessfraz · on May 27, 2021

YES we did :)

calvinmorrison · on May 26, 2021

After the acquisition of Joyent by Samsung, who here is interested in buying an extremely locked down hardware that seems seems to be a next-gen SmartOS (provisioning etc), i.e. rethinking networking and nodes all in holistic sense.

proposition is good, history is bad

rcarmo · on May 26, 2021

Well, congrats then! I've been waiting for news (and listening to the "On The Metal" podcast) for a long while now, and this seems like a great way to push the envelope on server hardware.

(plus I suspect there will be more to come...)

mapgrep · on May 27, 2021

Looks cool. Please add an RSS feed to your blog, oxide. So people can keep up. The rss logo is not a link (at least on mobile) and there is no auto discovery tag (that my reader can find).

Also your site badly crashes Brave iOS browser fwiw.

renewiltord · on May 26, 2021

I'm an idiot. I thought this was like the SGI UV300 where you'd view the whole thing as a single computer and everything would be NUMA'd away. It looks like it's not like that, though.

unixhero · on May 28, 2021

I cannot wait for this.

However I also do not see when I will buy a full shelf of gear, even though I would love to. Will they also release a maxi version, micro version and a nano version? Ie. 2U server, PRO workstation and an small formfactor? I think the innovations they have done to these computers deserve to be even more places than just in a massive and awesome data processing rack.

rugwirobaker · on May 28, 2021

A small form factor would definitely make sense as test environment.

fsociety · on May 26, 2021

I kind of want one of these in my home, I wonder what the price range will be.

softfalcon · on May 26, 2021

“2048 x86 cores per rack”

Probably a lot more than any sane person would pay for a home server.

But I won’t stop you from trying! Wouldn’t it be cool to have that plugged into your local network?

bayindirh · on May 26, 2021

I'd get one. Would connect hot side to home HVAC and just run scalability tests of my code. That'd be hot, nice and expensive.

And possibly noisy. Yes, noisy.

gwbrooks · on May 26, 2021

I'm adding one of these to the if-I-win-the-lottery list. Nifty home media server AND it'll probably keep the house warm during the winter.

salmo · on May 26, 2021

Now I just like the idea of you telling the electrician that you need a 3-phase >5kVA PDU. :)

fsociety · on May 26, 2021

Hahaha I have a good source of VFDs which may be able to do the job.

vluft · on May 26, 2021

for what they're advertising there, on the order of $1mm, based on underlying hardware costs.

kaliszad · on May 26, 2021

Actually, standard Azure rack is on the order of $1.1 mm of hardware depending on SKU if I am not mistaken. So I would guess, it could be more like $2 mm. There is the aspect of management and other vendors Dell/EMC + VMware like you to pay way more than the hardware costs for e.g. VXRail/ vSphere licences. That is the real target.

salmo · on May 26, 2021

And maintenance. I like the idea of having to give them a key to your house to replace a disk when the SMART alert phones home. :)

rockooooo · on May 26, 2021

I would love a mini version of this for homelabs.

wmf · on May 26, 2021

If the software stack ends up being open source someone could make a name for themself by porting it to run on Linux + random hardware.

yjftsjthsd-h · on May 27, 2021

If someone manages to port bhyve to Linux, they will definitely make a name for themselves.

But honestly, the equivalent is just libvirt on commodity hardware with openzfs storage; the value here is high end hardware with custom firmware and well-integrated software, not really something you can port usefully.

wmf · on May 27, 2021

I'm more interested in the control plane but maybe I'm in the minority.

edem · on May 27, 2021

This is my favorite part:

> Some will say that we should be paying people differently based on different geographical locations. I know there are thoughtful people who pay folks differently based on their zip code, but (respectfully), we disagree with this approach. Companies spin this by explaining they are merely paying people based on their cost of living, but this is absurd: do we increase someone's salary when their spouse loses their job or when their kid goes to college? Do we slash it when they inherit money from their deceased parent or move in with someone? The answer to all of these is no, of course not: we pay people based on their work, not their costs. The truth is that companies pay people less in other geographies for a simple reason: because they can. We at Oxide just don't agree with this; we pay people the same regardless of where they pick up their mail.

nodesocket · on May 26, 2021

I have a few legacy HP Proliant (cheap eBay) rackmount servers in my office closet. Oxide looks awesome, but obviously not targeted for home / small business use. I was hoping they would offer single-u servers.

notacoward · on May 26, 2021

All NVMe seems like a good starting point, but I'd hope that some day there will be a more capacity-oriented variant for people who actually know what they're doing with exabyte-scale storage.

doublepg23 · on May 28, 2021

Can you expand on what you're doing on an exabyte scale?

notacoward · on May 28, 2021

Nothing any more, but I used to work on such systems at Facebook. The public name was Tectonic; there was a paper at FAST this year IIRC. As time goes by, this kind of scale is going to be more common. I still remember when having a single petabyte was something to brag about.

Sanguinaire · on May 26, 2021

Looks fantastic, and the hardware specs appeal to me greatly - but I'm not sure there is an actual market outside the "cult of personality" bubble. A few SV wannabes will buy into this to trade off a Twitter relationship with the Oxide founders - but does anyone really see the IT teams at Daimler, Proctor & Gamble, Morgan Stanley... et al - actually going for this over HPE/Dell and AWS/Azure? We are a long way away from "Nobody ever got fired for buying from Oxide".

tyingq · on May 26, 2021

You wouldn't have to pitch it initially as a replacement for your on-prem HPE/Dell. It could be pitched as a replacement for the hosted private cloud you have from IBM, Oracle, etc, that you're unhappy with.

nickik · on May 26, 2021

Like everything else, start small, and grow. If you have a good product that actually works.

totetsu · on May 27, 2021

What do you call the kind of tui graphics aesthetic that is being emulated on this webpage? and are there any frameworks to use it in the browser?

possibleworlds · on May 27, 2021

Expensive, since they engaged Pentagram for branding.

More seriously, the animations have "ascii-animation" classes in the dom, tui is probably more fitting aesthetic label if you ask me. Not sure if a lib is involved or its custom. Either way it is done very nicely.

yjftsjthsd-h · on May 27, 2021

https://news.ycombinator.com/item?id=27295638

kaliszad · on May 26, 2021

"Only" 2048 CPU cores per rack is actually not that much by nowadays standards - its 16 U of 2x 64 core CPUs. Perhaps is could be more U if they used the lower core counts but e.g. higher frequency per core SKUs but I don't think they do. (And the picture kind of confirms it). They use 2U servers though so they are able to use lower speed but bigger fans and have more expansion cards and 2,5" form factor drives perhaps. The of course have to fit storage, which needs lots of CPU PCIe lanes for all the NVMe storage and networking (probably 2 or 4 U) and power conversion to power the bus bar and more somewhere. They probably use the 42 U+ standard 19" racks to fit in standard customers DCs. They also don't have such a high power budget as custom DCs for cloud providers do.

1 PB of flash is quite a bit but you could get perhaps 5x as much with HDDs probably (even with a relatively low density of 40x 12 x 12 TB). The problem really is I think, they wouldn't be able to write the HDD firmware in Rust in time (or at all, because no HDD manufacturer would sell an HDD to them without making sure their proprietary firmware is used). SSDs don't necessarily have this property as they are much more like the other components of a modern server.

ChrisArchitect · on May 27, 2021

Sleek AF Pentagram-designed website with a nice balance of style and nerdiness like the ASCII art animations.

Can't miss the Halt and Catch Fire tv references including Haley Clark alongsize Woz and other tech legends and in the blog post about the launch a terminal window with character names like Gordon Clark, etc. Love to see it.

rbanffy · on May 26, 2021

Looks interesting. I wonder if the high integration makes it behave as a single-image machine with that many cores.

wmf · on May 26, 2021

No, single system image is really expensive (see Superdome Flex) and most workloads can't justify that cost.

rbanffy · on May 26, 2021

The OS can still do cost-based memory allocation considering the latencies of going between nodes. These Milan chips have tons of memory controllers for local memory and compute nodes can allocate all those PCIe channels to talk to a shared memory module (IBM's OMI goes in that direction - a little bit of extra latency, but lots of bandwidth and ability to go a little bit further than DDR4/5 can go). I think the bigger POWER9 boxes do this kind of thing. Migrating processes to off-board cores is silly in this case, but core/socket/drawer pinning can go a long way towards making this seamless while enabling applications that wouldn't be feasible in more mundane boxes.

zozbot234 · on May 26, 2021

> The OS can still do cost-based memory allocation considering the latencies of going between nodes.

That's a rather seamless extension of what OS's have to do already in order to deal with NUMA. Pinning can definitely be worthwhile since the default pointless shuttling of workloads across cores is already killing performance on existing NUMA platforms. But that could be addressed more elegantly by adjusting some tuning knobs and still allowing for migration in rare cases.

zozbot234 · on May 26, 2021

Could one reimplement SSI at the OS layer, similar to existing distributed OS's? Distributed shared memory is usually dismissed because of the overhead involved in typical scenarios, but this kind of hardware platform might make it feasible.

wmf · on May 26, 2021

There was Mosix at the OS layer in the 1990s and Virtual Iron at the hypervisor layer in the aughts. I think the cost and performance of software SSI just doesn't intersect with demand anywhere.

e12e · on May 26, 2021

Plan9 on an oxide rack might get you close to an SSI work-a-like - but I think it's unlikely it'd be a practical use of the hw?

zozbot234 · on May 26, 2021

AIUI, Plan9 is not quite fully SSI. It is a distributed OS, and gets quite close, but it's missing support for distributed memory (i.e. software-implemented shared memory exposing a single global address space); it also does not do process checkpointing and auto-migration, without which you don't really have a "global" system image.

rbanffy · on May 26, 2021

Mosix and VirtualIron worked at a time 1GBps ethernet was in its infancy. Today 10GBps are consumer grade and 40GBps can go over Cat 8 copper, roughly equivalent to a DDR4-4000 channel.

Not great, but this is near COTS hardware. They can do significantly better than that.

wmf · on May 26, 2021

Uh no, DDR4-4000 (which servers can't use BTW) is ~256 Gigabits per second. Latency is also a killer; optimized Infiniband is ~1 us which is 10x slower than local RAM at ~100 ns.

rbanffy · on May 27, 2021

Sorry. I wrote GBps when I should Gbps and I got myself fooled in the process. We can somewhat mitigate the latencies with good caches. The overall machine will suffer from a bad case of NUMA ,but would still behave better than a cluster.

e12e · on May 26, 2021

Aftermarket oxide rack running DragonFlyBSD, anyone ? ;)

slownews45 · on May 26, 2021

How can they offer a secure boot solution under the GPLv3 - my understanding is the anti tivoization clauses means they need to release their keys or allow admins and hackers and others to escape the secure boot chain if they are physically in front of the machine or own the machine.

mlindner · on May 26, 2021

I don't know anything, but I remember hearing it's secure boot with only their releases, if you want to run your own software it's not secure boot anymore, but you're free to run whatever you want.

slownews45 · on May 27, 2021

One issue with this is normally this requires a bypass of secure boot option.

So device is not set and forget secure, but you have to physically secure it (not that you wouldn't, but now it can really matter if someone has physical access to the machine).

GPLv3 has been used more recently as a block to others taking code and developing things further given some of the clauses and incompatibilities it introduced.

fredros · on May 27, 2021

I’m still reluctant on using appliances based on a niche OS like illuminos.

It takes a huge amount of manpower to maintain a kernel, and this effort is not shared much.

I guess that zfs is the selling point here, but still...

todd8 · on May 26, 2021

This looks great. From a business perspective, I would be concerned that it would be hard to prevent companies like Dell from entering this space as a competitor quite rapidly.

wmf · on May 26, 2021

Enterprise vendors are culturally incapable of building opinionated hardware or software. That's a moat, but which side are the customers on?

AlphaSite · on May 26, 2021

Isn’t this basically: https://www.vmware.com/products/vmc-on-dell-emc.html

Or https://www.hpe.com/us/en/greenlake.html

Disclaimer vmware employee, but I dont work in this area.

mlindner · on May 26, 2021

Sure, but you don't own those. You're renting the hardware from HP.

herpderperator · on May 27, 2021

Reminded me of https://youtu.be/6KbRA2RjhgQ?t=115 :)

alberth · on May 26, 2021

Who is the target buyer?

Is this to compete against Nutanix / VCE?

fuzzylightbulb · on May 26, 2021

I think it would more play in the space of a NetApp/Cisco FlexPod or VCE's Vblock, but what those customers are really purchasing is the certified validation of core enterprise apps on a particular hardware/software stack, as well as the massive engineering and support organizations that those companies can bring to bear to validate firmware upgrade and to swoop in in the event of an issue. You also seem to get a LOT more flexibility.

I am not a hater in the least but I really am failing to understand what is unique about this offering. It seems like you have no options regarding the internals, and so scaling compute separately from storage doesn't seem possible. I also am very suspect about offerings like this that have not yet released a second version of their key parts. Everyone says that they are going to be backwards compatible, but then the reality of managing heterogenous generations of gear in a homogenous fashion strikes and you get weird behavior all around.

Long story short, I would love to know what a customer of this scale of physical infrastructure is getting with Oxide that they would not be better served by going to one of the major vendors.

Tuna-Fish · on May 26, 2021

Firmware that does not suck?

Because that's something the current "major vendors" really are irredeemably terrible at.