Don't waste money on a math coprocessor they said

rob74 · on Nov 13, 2023

I was actually hoping that by the end of the blog post, the mystery of why the game didn't run would be cleared up, but sadly it wasn't. Looks like it had trouble accessing the HDD - but why should HDD access for a DOS game running under OS/2 require a math coprocessor?!

I have my own story of living happily without a math coprocessor for quite some time: from 1992 to 1997, I had an IBM PS/1000 with a "Blue Lightning" CPU which can be best described as a "486SX3" (25 MHz bus clock, 75MHz internal). Then I tried to run the Quake demo and found out that "real" 3D games need an FPU :(

glimshe · on Nov 13, 2023

I got a math coprocessor relatively early for my 386SX20 - I wanted to use 3D Studio, the killer app for math processors back in the day! The vast majority of games simply used integer math and didn't require, or could take an advantage from a coprocessor, but I have no idea what was going on in the article. In the early 90s, the only games I can remember that could optionally leverage a coprocessor were the Falcon 3.0 family of simulators.

rwmj · on Nov 13, 2023

For me the killer app was POVRAY. Ran so much faster with a 387 (I seem to recall something like 100x faster but it was a long long time ago).

russh · on Nov 13, 2023

I remember being tired all day long after waking up many time during the night to check the progress of my renders or to tweak it and restart. And then I got a math coprocessor so I could wake up more often in the night to check my renders.

qingcharles · on Nov 17, 2023

I would just set them before bed and then check them before going to school in the morning, and set another one running, of course :D

geon · on Nov 15, 2023

I too slept badly because of povray rendering. But I just had trouble sleeping with the fan noise.

weinzierl · on Nov 13, 2023

You lucky. I always dreamt of one, that and a TARGA graphics card. The last one not only for the looks but speed as well. Converting a *.TGA to a indexed color took a considerable amount of time.

Loic · on Nov 13, 2023

20+ floppy disks to get 3D Studio and of course you always had a bad one and not the first.

sillywalk · on Nov 13, 2023

I can still imagine the floppy drive making the "chug-chug-chug" sound when you knew that the disk was bad.

tacone · on Nov 13, 2023

Reminds me of Monkey Island 2 asking to insert floppy 3bis. Good times :)

marhee · on Nov 13, 2023

I remember "Dune 2" (1992) could use a co-processor? Not sure, we never had a machine with a co-processor but a friend had and he said that at start-up the game said something like "Co-processor found - let's use it!" and then game could run faster I believe (not sure).

Edit- Dune 2, not Dune 2000

wigster · on Nov 13, 2023

i remember a friend of mine had falcon for the amiga 500. i don't think he EVER managed to take off without some "cheat" being activated.

a rather difficult game.

f16 however was a joy. happy days

tecleandor · on Nov 13, 2023

Ah, I also had a 386SX, and some games (not a lot, maybe three or four) never run properly on my machine. :(

Edit: I just remembered one of them was "Eye of the Beholder III"

danparsonson · on Nov 13, 2023

> Edit: I just remembered one of them was "Eye of the Beholder III"

If it's any consolation, you didn't miss much - III was a wet rag compared to the majesty of the first two games!

consp · on Nov 13, 2023

> I was actually hoping that by the end of the blog post, the mystery of why the game didn't run would be cleared up, but sadly it wasn't.

My best guess would be something of the following: The software tries to determine if the CPU has a 80(2/3)87 device some way. Thinks it does have it and then tries a FPU instruction to get the error signal firing and getting the eventual INT02 to blow up in it's face. But I'm not limited by any actual knowledge here so this might just as well be something completely different.

bcoates · on Nov 13, 2023

My guess is that the Infocom game was setting up some sort of FP emulation TSR and not cleaning it up, and BattleTech was coincidentally using it. Or whatever the equivalent of a windows 3.x VxD is for OS/2, if that exists. Or just otherwise messing with the interrupt tables involved in the arcane x87 protocol in a way that un-breaks things.

Also it looks like there may be some issues with FP/FP emulation on the OS/2 DOS extender host: https://www.os2museum.com/wp/floating-point-exceptions-and-d...

consp · on Nov 13, 2023

That sounds quite more plausible than my idea. FP emulation is always interesting if performed by multiple programs/companies messing with each other's implementation. Lots of fun with layers of interrupts, real/virtual/protected mode switches and all the other magic the original 87 was not designed for.

mst · on Nov 13, 2023

"I'm not limited by any actual knowledge here" is an excellent way of putting it and I shall endeavour to remember it next time I'm in such a situation.

Poor though my memory is, it happens often enough that I have a decent chance.

mobilio · on Nov 13, 2023

I have similar experience DX4 overclocked to 120 MHz vs. Pentium running on 75 MHz

Hint: Pentium FPU was faster

consp · on Nov 13, 2023

My Retro-socket3-toy-unit with a P83@100 MHz is fast (40MHz bus), but the DX5x86@150mhz (50MHz bus) beats everything except in quake. Timings aren't that great for the memory but the synchronous PCI bus (and working PCI VGA card at that speed) beats everything you throw at it.

mobilio · on Nov 13, 2023

Same here - all DOS games was faster on DX4, but Pentium was so smooth on Quake.

TerrifiedMouse · on Nov 13, 2023

Carmack and Abrash optimized the heck out of Quake making use of a “quirk” in the Pentium where you can overlap integer operations and floating point DIV to eke out every last drop of performance.

Unfortunately the trick doesn’t work on early AMD K5/6 nor Cyrix CPUs - although Cyrix CPUs probably had other more serious problems.

https://youtu.be/DWVhIvZlytc?si=DBE9zpTLj_16-GAt

Sharlin · on Nov 13, 2023

Yeah. Their perspective correct texture mapping did a perspective div only every 16 pixels drawn – in itself a considerable optimization with almost imperceptible loss in quality – but the big deal was that on the Pentium specifically, the integer pipeline could be almost perfectly cycle-optimized to process sixteen pixels in the time the FPU executed the next perspective division in parallel so the result was ready just as it was needed!

touisteur · on Nov 13, 2023

More on this in the Black Book graphic shttps://news.ycombinator.com/item?id=35738709 . Programming on Pentium is towards the end.

rasz · on Nov 14, 2023

K5/Cyrix could overlap Integer and FPU operations, what they couldnt do was interleave (pipeline) FPU operations so that multiple floating point instructions ran in parallel.

https://www.phatcode.net/res/224/files/html/ch63/63-02.html

Here a non perspective correction related Quake FPU code example https://github.com/id-Software/Quake/blob/bf4ac424ce754894ac...

    Lcliploop:
     fld ds:dword ptr[0+0+esi]
     fmul ds:dword ptr[0+0+ebx]
     fld ds:dword ptr[0+4+esi]
     fmul ds:dword ptr[0+4+ebx]
     fld ds:dword ptr[0+8+esi]
     fmul ds:dword ptr[0+8+ebx]
     fxch st(1)
     faddp st(2),st(0)
     fld ds:dword ptr[0+0+edx]
     fmul ds:dword ptr[0+0+ebx]
     fld ds:dword ptr[0+4+edx]
     fmul ds:dword ptr[0+4+ebx]
     fld ds:dword ptr[0+8+edx]
     fmul ds:dword ptr[0+8+ebx]
     fxch st(1)
     faddp st(2),st(0)
     fxch st(3)
     faddp st(2),st(0)
     faddp st(2),st(0)
     fsub ds:dword ptr[12+ebx]
     fxch st(1)
     fsub ds:dword ptr[12+ebx]
     fxch st(1)
     fstp ds:dword ptr[Ld0]
     fstp ds:dword ptr[Ld1]

FXCH instruction is free (zero cycles) on Pentium for most instruction combinations, AMD caught up in late 1998 with CXT revision K6-2. http://www.azillionmonkeys.com/qed/cpuwar.html

_the_inflator · on Nov 13, 2023

Pentium was very significant in comparison to predecessors. In the beginning Intel shipped a Pentium 66 and it smoked the 486 DX 100 easily.

We kind of marveled back then because MHz meant speed.

vikingerik · on Nov 13, 2023

Part of that was the bus and memory speed as well. The 486 bus ran at 33 MHz (the "DX4" is actually only clock-tripled to 100), while the Pentium bus and memory ran at 66.

_the_inflator · on Nov 13, 2023

Yes, I know. Setting the jumpers for the right bus speed for the multiplication was key. This was the time of the fine tuning and over clocking. Asus boards were the best for this job back then.

Some Pentium ran at 60, not 66.

hurryer · on Nov 13, 2023

Pentium was the first CPU which could run two integer operations per cycle.

mobilio · on Nov 13, 2023

I remember that - U pipe and V pipe.

And that's why it's first x86 that was superscalar.

johnklos · on Nov 13, 2023

It wasn't the first CPU that was superscalar by far, but it was the first x86 CPU that was superscalar.

pak9rabid · on Nov 13, 2023

Yep. The P5 arch (original Pentium) superscalar design, combined with its much faster FPU and separation of data and code caches meant it could run circles around a 486-based CPU, even if it was running up to twice the speed of a 486 contender.

mrlonglong · on Nov 13, 2023

I had a dual Pentium Pro 166Mhz w/512kb caches running on a Tyan mobo. That thing was fast under Linux.

EDIT: Present day, rocking a thread ripper with 48 logical cores. Still running Linux.

ggambetta · on Nov 13, 2023

My parents were architects and they used AutoCAD. A very early version, maybe 3? So our PC had a math coprocessor... on a 286!!! Yes, the 80287 was a thing.

NikkiA · on Nov 13, 2023

In the 286 era there were even different types of co-processor, the Weitek implementation (2167) absolutely trounced the official 80287 (and it's second source versions) at CAD, that's most likely what your parents had.

saltcured · on Nov 13, 2023

I searched the comments page to find you were the only one who mentioned them. I think we had a 20 MHz 286 with a Weitek coprocessor when I was a kid.

It seems Weitek also had a coprocessor to go with 386.

NikkiA · on Nov 14, 2023

They had one (4167) for the 486 too, I have no idea if it only worked with 486SX systems, or if it could work alongside a DX (but then who'd have a co-processor socket on a DX board anyway, co-processor sockets had already become scarce in the 386 period).

By that era they appear to have mostly pivotted to their own 486 clones and VGA chipsets though.

edit: I just realised, it could have been like the Cyrix 487S which installed as a PCB carrier that plugged into the 486DX socket, and then had a socket on the carrier for the CPU itself, alongside the 487S chip.

lproven · on Nov 15, 2023

Yep.

http://www.weitek.com/textual/products/wtl3167.html

selcuka · on Nov 13, 2023

I had an 8086 PS/2 clone (Multitech, then rebranded themselves as Acer) with a coprocessor, so yeah, even 8087 was a thing.

scruffyherder · on Nov 13, 2023

I just got a newer cpu today I just wanted to write something out as I wasn’t expecting this kind of response.

There is no problem with the drive on dos, windows or os/2 1.2 so it’s something about it being an early 2.0 beta and probably an older 386 cpu.

jandrese · on Nov 13, 2023

The suggestion in the article was simply that his 386 was a crappy early one that may also be slightly defective seems plausible. IIRC the 387 was a full 386 plus the floating point logic, and when you installed it the coprocessor took over most of the processing and the 386 was reduced to being a data broker.

_ihaque · on Nov 13, 2023

I believe the 387 was an actual coprocessor; the _487_ may be what you're thinking of -- the 487SX contained a whole 486DX inside and just disabled the host processor [1].

[1] https://en.wikipedia.org/wiki/X87#80487

ForOldHack · on Nov 14, 2023

I would concur: The 386sx was just 16 bit. The 387sx was for that 16 bit processor. The 486sx was no math co-processor, and the 487 was a whole 486 DX, i.e. the vacancy slot, brought to you by Intel Marketing.

lproven · on Nov 15, 2023

Nope, as I commented above:

https://news.ycombinator.com/item?id=38275340

scruffyherder · on Nov 14, 2023

The 487 was the “full” replacement chip upgrade.

I still don’t know why the addition of the math coprocessor helped so much

benj111 · on Nov 13, 2023

I wondered if it was a hardware fault, and the fpu is allowing enough electrons to flow where they need to be.

Although the infocom 'boot' step suggests a race condition somewhere? Or some register not being properly set by the os.

wzdd · on Nov 13, 2023

Or installing the coprocessor reseated a slightly-loose cable or created a better connection on a broken trace. The "boot step" could simply result in the system being warmer, which would make marginal connections like this behave differently.

To be sure, the author should remove and reinstall the coprocessor 10 times, testing after each. :)

scruffyherder · on Nov 14, 2023

I replaced the 386 today with one from 1990. I was able to boot a disk I keep around that has MS-DOS 3.30a and Windows/386 2.11 on it. I also have Battletech on there and yes it ran with no issues! It didn’t work at all on the older processor.

When I pushed it into the socket it made such a loud “ker chunk” noise that I wouldn’t want to constantly pull out and reseat these chips. Especially since I don’t have the Intel “rake” like puller I had to use a pair of card edge blanks to gently rock it out of the socket.

Although I do appreciate the 10x humor of it.

It’s weird that so many people focus on the software, which runs perfectly fine in emulation, but seem to lose their minds that an $8,000 machine from 1987 shipped with a buggy CPU…

Hopefully this tbh t will encourage someone out there with a 386 to try to replicate this.

Narishma · on Nov 13, 2023

You didn't actually miss much because even with an FPU, Quake wasn't really playable on a 486. It was optimized for the Pentium.

Projectiboga · on Nov 13, 2023

Might have a been copy protection code requiring the coprocessor?

mrighele · on Nov 13, 2023

My father wanted a computer for doing CAD, so of course he needed a 387 beside the 386.

From what I remember, unlike the 386, for which both Intel and AMD models were available, the 387 was produced only by Intel, which meant that the cheaper and more performant 40MHz AMD processor had not a corresponding FPU, so if you needed floating point operations you had to spend more money for a slower CPU.

What he ended up doing instead, was to get a relative to assemble him a PC with a 40MHz AMD 386 and a 20MHz Intel 387. Surprisingly it did work, as long as one would remember to press the "Turbo" button to slow the 386 to 20MHz before using the CAD software otherwise it would crash.

The rest of the software would happily work at 40MHz

takinola · on Nov 14, 2023

IIRC, the 386 and 387 were exactly the same chip. The difference between them being the 387 had an activated math processor while the 386 did not.

If you plugged in a 387, it would disable the 386 chip and take over all the work of computation. Apparently, happened because the math coprocessor was where most of the manufacturing errors occurred so Intel just sold the defective chips as 386s in order to increase throughput

KayEss · on Nov 14, 2023

I think you're thinking of upgrading the 486sx which didn't have the on die coprocessor. That did disable the old chip as you basically replaced it with a 486dx

lproven · on Nov 15, 2023

You're close but not quite. See above.

lproven · on Nov 15, 2023

You're wrong, and so is the first comment. Sorry (and to @KayEss).

You're thinking of the 486.

The 80386DX was the first x86 CPU to be 32-bit and have an on-chip MMU. And nothing else: no cache, no FPU.

The FPU was a discrete part, the 80387DX.

Because OS/2 1.x didn't support the 80386, and so couldn't run DOS apps well, and so flopped, the 16-bit 80286 kept selling well. It ran DOS fast and it could run Windows 2/286 and Windows 3 in Standard Mode which was good enough. It could only address 16MB of RAM but that was fantastically expensive and it was more than enough for DOS and Windows 3.

So, because DOS still ruled, Intel made a cost-reduced version of the 80386DX, the 80386SX. This had a 16-bit data bus, so it could use cheaper 16-bit motherboards and 16-bit wide RAM, still limited to a max of 16MB. Still enough.

That needed a maths copro for hardware floating point, too: a different part, the 80387SX.

Then Windows 3 came along, which was also good enough, and started a move in PC apps to GUIs. Windows 3.1 (1992) was better still.

So Intel had a 2nd go at the 32-bit chip market with the 80486, marketed as the "486". This integrated a better 386DX-compatible CPU core with a few extra instructions, complete with MMU, plus a 387-style FPU, plus a small amount of L1 cache, all onto one die.

But it was expensive, and didn't sell well.

Also, all the 3rd party x86 makers leapt on the bandwagon and integrated the extra instructions into 16-bit bus 386SX compatible chips and branded them as 486s: the Cyrix and IBM "486slc" for instance. This ate into sales of the real 486.

So Intel came up with an ethically very dodgy borderline scam: it shipped 486s with the FPU disabled, calling them the "486DX" to reuse the branding that distinguished the 32-bit-bus models of 386 from the 16-bit-bus.

People don't understand stuff like bus widths or part numbers, as your post demonstrates, and I mean no offense. They don't.

So now there was a new model of 486, the 486SX with a disabled FPU, and the 486DX with it still turned on.

The "SX" model needed a new motherboard with a 2nd CPU socket that accepted a "floating point co-processor", called the "487", which was nothing of the kind. The "SX" was a lie and so was the "487 copro". The 487 was a 2nd complete 486 chip that disabled the original and took over.

But it reused the older branding, which is what you've remembered.

Later, briefly, Intel made a cheaper cost-reduced 486SX with a smaller die with no FPU present, but not many of them. The clock-doubled 486DX2 took over quite quickly and killed the 486DX and 486SX market.

Some commentators speculated that the 486DX vs 486SX marketing thing allowed Intel to sell defective 486s in which the FPU didn't work but if it did that was a tiny tiny number: a rounding error.

kayfox · on Nov 15, 2023

You have a typo of "486DX" in your first line introducing the 486SX.

>> Some commentators speculated that the 486DX vs 486SX marketing thing allowed Intel to sell defective 486s in which the FPU didn't work but if it did that was a tiny tiny number: a rounding error.

It was a significant amount of chips at the time, since the 486 was still new and the FPU is somewhere around 30% of the die, it would come out to a lot of them. They was also an overlap of the DX die with FPU disabled and SX die both being shipped at the same time.

lproven · on Nov 15, 2023

> You have a typo of "486DX" in your first line introducing the 486SX.

You're right -- but I can't fix it now. :-(

> It was a significant amount of chips at the time, since the 486 was still new

It might have been. If you have any pointers to numbers, I'm curious.

But when the 80486 first appeared, it was just the "486". The DX/SX thing came later, and the early 486SX units definitely had a full FPU on the die -- I remember X-rays, de-capping, and die-shots.

I just checked and the gap was longer than I realised:

(80386, 1985)

(386SX, 1988)

i486, 1989

(Windows 3.0, 1990)

486SX, 1991

486DX2, March 1992

(Windows 3.1, April 1992)

Just a year between the 486SX and the DX2, which rendered it largely irrelevant -- then, the budget option became a non-clock-doubled model, and the clock-doubled one became the premium product.

KayEss · on Nov 17, 2023

I think this is a much fuller explanation of what I was trying to say :-)

lproven · on Nov 17, 2023

NiloCK · on Nov 13, 2023

Low value comment, but I'm astounded some 30 years later to learn there's such a thing as a 387 processor.

... one more!

ngcc_hk · on Nov 14, 2023

It is a different game to play according to crossing the chasm. There were 8087, 80187, 80287 and 80387 is the last one. I do not know that only intel has 387 though.

ChrisClark · on Nov 13, 2023

That's interesting. I specifically remember buying a math coprocessor for my AMD 386DX-40. My motherboard definitely had the slot for a dedicated co-processor though, maybe that was the difference.

mrighele · on Nov 13, 2023

I was a kid listening to "grown-up talking" so I could remember the wrong reason for the funny pairing, or maybe I'm getting old :-) (it's been 30+ years).

English Wikipedia isn't very helpful on the regard but I found a page in German [1] that lists a number of 387DX-compatible chips running at 40MHz, so maybe it was just a matter of price, local availability, or lack of trust in the brands.

[1] https://de.wikipedia.org/wiki/Liste_der_x86er-Koprozessoren

yuhong · on Nov 13, 2023

AFAIK while AMD didn't produce the 387 there was alternatives from for example Cyrix and IIT at the time.

lproven · on Nov 15, 2023

And the socket-compatible, better-than-the-original Weitek 3167.

http://www.weitek.com/textual/products/wtl3167.html

oaktowner · on Nov 13, 2023

OMG I had forgotten all about the "Turbo" button. Great story, and thanks for the trip down memory lane.

enobrev · on Nov 13, 2023

I remember them well enough. But this is the first time I've ever heard of an actual use for it.

plasticchris · on Nov 13, 2023

It was useful for some old games where the frame rate was tied to the cpu frequency.

Narishma · on Nov 13, 2023

IIRC you could have a 386 with a 287 co-processor.

scruffyherder · on Nov 14, 2023

Yeah, although it was apparently really early 386 boards. I’m not sure how it would work electronically though.

bombcar · on Nov 14, 2023

Iirc the FPU was basically a second processor that the first would send data to and get an interrupt when it was done (or similar).

zitsarethecure · on Nov 13, 2023

I remember the day I got my AMD 287 co-processor for my old Samsung 286. IIRC I ordered it from an ad I saw in Compute! magazine. At the time I had this book about writing fractals in C and I went back and recompiled all my code to use the co-processor instead of the emulation library that came with Turbo C, what a revelation. These days I only buy CPU's with math coprocessors.

yjftsjthsd-h · on Nov 13, 2023

> These days I only buy CPU's with math coprocessors.

...is there any other kind anymore? Unless you work with microcontrollers, at least.

foxyv · on Nov 13, 2023

That's a bit of a joke. It's like saying, "I only buy cars with an electric starter now!"

yjftsjthsd-h · on Nov 13, 2023

Okay, thanks, I wondered if that was the case but I wasn't sure I was reading it right

kristianp · on Nov 14, 2023

Huh, this reminds me my raspberry pi pico has no fpu, being a pair of cortex M0s. Maybe I should buy a recent teensy, which has the cortex M7 with an FPU.

andyferris · on Nov 13, 2023

Oh man I used to love BattleTech on the '286! (Yes we had a coprocessor, doubt it mattered).

I kinda wondered why no one made RPG games in the BattleTech universe? The MechWarrior series was fine, but there's so much plot and intrigue that could be explored in this universe, and just _existing_ as a human around all these big machines could be pretty intense.

andyferris · on Nov 13, 2023

Another fun fact - I cut my teeth as a little kid hacking my BattleTech (Cresent Hawk Inception) save games in XTree-Gold. I can still remember the ASCII symbols and the rough layout of the save files. I'd max out my teams stats and skills, stick extra weapons on our mechs (they even programmed in powerful ones that weren't actually in the game) and mark the map as explored. EDIT: I even got so much money it would accidentally wrap around when you got some interest from the bank or whatever (it was some small signed integer type) and then you couldn't spend anything, lol.

freedomben · on Nov 13, 2023

This is tangential to the article and somewhat meta, but as I remember old video game cover art and ads (and seeing the pictures in the article), it occurred to me that we're finally at the point where (sometimes) the actual game looks as good as the cover art. That's a pretty remarkable achievement!

johnklos · on Nov 13, 2023

While playing around with modern NetBSD on a system with just 10 megabytes of memory, I ended up getting and adding an m68881 just so that I could remove the floating point emulation code from the kernel to save a few kilobytes.

OTOH, who doesn't like to toy around with the power to do 100,000 floating point operations a second? ;)

I do hope the author figures out the cause. The issue is an interesting one.

0x0 · on Nov 13, 2023

Strange error, why does it say "illegal instruction at ffffffff" but then the dump shows CS:EIP as 71a3:000056e5? Shouldn't the EIP part be :ffffffff or what am I missing?

bcoates · on Nov 13, 2023

Yeah. Looks like the processor is in virtual x86 mode from the dump, maybe that causes weirdness in the OS/2 BSOD?

1970-01-01 · on Nov 13, 2023

The best 386 was the 486DX because it actually ran everything the 386 was supposed to run :)

https://en.wikipedia.org/wiki/I486#Improvements

neilv · on Nov 13, 2023

At the time, you could also get a math coprocessor for the Apple Macintosh:

https://ncsa30.ncsa.illinois.edu/wp-content/uploads/2016/02/...

johnklos · on Nov 13, 2023

The person in that photo has a very Moss-esque esthetic ;)

The m68881 and m68882 are very good FPUs, and m68882s have continued to be made all the way through at least 2012, apparently. I recently bought one that was made in 2012 with Motorola, not Freescale, markings.

ForOldHack · on Nov 14, 2023

The Mac Plus could take a 68020 Accelerator board, and mine came without a co-processor, so... I found a 68882 with a missing pin, and soldered one on. It worked like a charm. Wait... Apple SANE did not come out until the Macintosh II... OH! You made a joke. ok. a Cray-X-MP/48 is a math co-processor for a Macintosh.

lproven · on Nov 15, 2023

The development of all the main CPU families from 1980s on went much the same way:

16-bit, discrete CPU, optional FPU, optional MMU.

32-bit, integrated CPU + MMU, optional FPU.

32-bit, integrated CPU + MMU + FPU.

32-bit, integrated CPU + MMU + FPU + L1 cache

32-bit, integrated CPU + MMU + FPU + L1 cache, on-package L2 cache.

32-bit, integrated CPU + MMU + FPU + L1 + L2 cache.

64-bit, integrated CPU + MMU + FPU + L1 + L2 cache.

64-bit, dual integrated CPU + MMU + FPU + L1 + L2 cache.

That's about where it starts to lose coherency. There were dual-core 32-bit chips and so on. I'm also eliding changes in core design such as scalar, superscalar, decomposition into micro-operations, micro-op resequencing, etc. because:

[1] it muddies the waters.

[2] if people can't remember the difference between a 387 and a 487 then this will go way over their heads. :-(

gopher_space · on Nov 13, 2023

I remember running a software fpu on a Mac, I think to play a neat little game called Avara.

ekianjo · on Nov 13, 2023

I had Battletech on the Amiga (showcased in the video) and it ran much faster even though the Amiga only has a 7Mhz CPU...

simonh · on Nov 13, 2023

That was probably down to having a Blitter, rather than mathematical performance. This was a memory copy function in one of the support chips (Agnes), that massiveky speeded up things like sprite and window movement and line drawing.

galangalalgol · on Nov 13, 2023

Did the a1000 have hard floating point? I know it had a great f18 flight sim that seems like it would have been difficult with integer math, but people do amazing things with integer math.

kstrauser · on Nov 13, 2023

No, it didn’t. My first Amiga with an FPU was a 2000 with an A2630 accelerator card.

wmil · on Nov 13, 2023

The early VGA chipsets were very slow. They also didn't have any of the sprite & tile features that Amigas had.

On a generic 286 you could swap out the stock ISA VGA video card for a Diamond SpeedStar and the improvement was crazy.

LargoLasskhyfv · on Nov 13, 2023

That was still true with VLB(VESA Local Bus) and later revisions of the Tseng ET4000 chipset. Which was what made the SpeedStars fly. (At 50Mhz! Harr!)

bombcar · on Nov 13, 2023

carmack and friends getting sprites to work on DOS machines was a technical feat. I think they used it for commander keen.

scruffyherder · on Nov 14, 2023

I’m sure it’d beat a 386 running it in v86 mode emulating a CGA display on an VGA window. Something an Amiga cannot do.

Rest assured it’s significantly faster full screen under OS/2. And faster again on native MS-DOS.

patwolf · on Nov 13, 2023

I had the Maxis game El-fish, and the part of the game where you'd breed fish was almost unplayable on a 486SX because the SX lacked a math coprocessor. I wanted a math coprocessor, but as a kid in pre-internet times, I had no clue how to get one.

Maybe I'll try playing in dosbox and see what I was missing out on.

h2odragon · on Nov 13, 2023

Technically, you had one on the 486SX, it was just disabled. Probably*

As I recall the "co-processor" slot on those boards was another CPU slot, with an extra pin; and when filled it took over and disabled the original CPU.

Edit: later 486SX may not have had FPU: https://retrocomputing.stackexchange.com/questions/12109/wer...

achairapart · on Nov 13, 2023

I read many times that 486SXs were rebranded 486DXs with a defective co-processor unit, don't really know if it was rumors or just plain fiction.

acdha · on Nov 13, 2023

There is some discussion here:

http://www.os2museum.com/wp/lies-damn-lies-and-wikipedia/

My guess is that if there’s any truth to it that was early on before the process was fine-tuned. It’s hard to believe Intel would have tolerated a process producing that many errors since they could use the same die space to make more smaller chips.

Sohcahtoa82 · on Nov 13, 2023

Seems perfectly plausible. CPU binning has been being done for years. Made an 8-core chip, but one core is faulty? Disable some cores and sell it as a 6- or 4-core chip. Chip won't run at the frequency expected? Sell it as a lower-spec CPU.

https://en.wikipedia.org/wiki/Product_binning#Semiconductor_...

bcx5k15 · on Nov 15, 2023

probably

intel used to sell cheap core solos back in the core duo days, which were just duos where one of the cores was faulty so they disabled it.

Same with the F model (no iGPU) chips today. People have done teardowns and found that more often than not its there on the die, just disabled because its faulty.

MBCook · on Nov 13, 2023

I loved that game.

But you should have tried breeding on a 33 MHz 386SX. I think I had to let it run overnight.

My next computer was a Pentium 133. The difference in fish breeding was mind blowing.

uptown · on Nov 13, 2023

Good memories. I had a Quantex 386 DX 40 with a math coprocessor. I used to churn out 3D Studio images one rendered block at a time over hours and hours and hours.

lacrimacida · on Nov 13, 2023

And I remember downloading 3D rendered images from BBSes back when I couldn’t produce them myself, was a kid at the time and 3D Studio was way above my abilities or computer power, had an XT then an 286 AT. Good times

ForOldHack · on Nov 14, 2023

Don't ever believe them: I was taking Pascal, and had just switched from a Prime computer to a PC for a large program. I had to finish some major part over the weekend, so I went to my brothers house. He had a co-processor, and Turbo-87 was hell of faster, so back at school, I snuck into the physics lab w/ 8087s and programmed there. When I got my own machine a few years later, I went out of my way to get a 8087 turbo. Lotus 1-2-3 and Turbo pascal ran amazingly fast. Six years later I got a hand me down 33Mhz 386. Upgraded the ram, the processor to 40Mhz, and got the co-processor. (ITT 87-II) again, with the math co processor, amazing speed. Finally got a hand me down 486 to replace the 386, and KaChing! It was not a 486 it was a Pentium Over drive w/ co-processor. Got into AutoCAD at that time, and ... people in the lab were complaining that there were no co-processors. I could not tell them that it ran much faster on my machine at home because they would know where I got the software.

gdubs · on Nov 14, 2023

My first computer was a Tandy 486sx from RadioShack. One day I brought home the book "Becoming A Computer Animator". It came with a disc containing a copy of TrueSpace. But I couldn't run it because it required a math coprocessor.

My dad was not a computer person, so he was not very thrilled about us opening up this mysterious box. But, he agreed, and together we installed the chip. And the software worked!

One thing that was amazing back then was the step-function of upgrading your computer. At that point in Moore's Law, big updates weren't at such a ridiculous pace as they became over time, so if you had a 486, you had a 486 for a while. But add a SoundBlaster card (or the AdLib card for us lower-middle class folks), and suddenly your computer came to life in a way you didn't realize was possible.

rbanffy · on Nov 13, 2023

I could never find x86 AIX anywhere. Does it work well with emulators?

scruffyherder · on Nov 14, 2023

Look for PS/2 aix 1.3

It’s not very stable, although I don’t know if it was a flakey processor

ForOldHack · on Nov 14, 2023

There were 3 versions of AIX 1.3, One for the IBM RT, one for the PS/2, and one for the PS/2 RT co-processor(Crossbow).

https://ardent-tool.com/AIX_1-3/

https://ardent-tool.com/AIX_1-3/AIX_Origions.html

neilv · on Nov 13, 2023

It's not surprising that the PS/2 model 80 would have MS-DOS games compatibility problems, even if running PC-DOS or MS-DOS rather than OS/2.

The Model 80 was one of the first-gen PS/2 systems (which tried to replace the original IBM PC's fairly open industry standard architecture with things like MCA).

IIRC the Model 80 was also IBM's first '386 PC, though IBM had been beaten to the '386 PC (first by Compaq and Advanced Logic Research, IIRC).

Also, for a long time, MS-DOS games didn't necessarily work out of the box even on more conventional PCs, and you might have to mess with drivers, IRQs, etc.

sshagent · on Nov 13, 2023

I still go back and play crescent hawks inception every few years. Love the game

scruffyherder · on Nov 13, 2023

I should do a tutorial on stealing the urban mech, and saving the chameleon!!

It’s so much better with those 2 mechs

afitnerd · on Nov 15, 2023

One of my first paid jobs was in the summer of my college freshman year, 1988, I did some consulting work for an insurance salesman. He had an ibm 8086 based machine and it was taking 15 hours to run an insurance estimate.

I got him an 8087 math coprocessor and it reduced the time to about 10 minutes. I think it cost around $500 at the time.

I made my first $50 in tech, an his business was demonstrably improved. What a thrill!

HankB99 · on Nov 13, 2023

> Well it’s been no secret, but OS/2 6.123 on my PS/2 model 80, is insanely unstable running simple MS-DOS based games (large EXE’s)

Lost me here. I developed several systems based on OS/2 (both 16 and 32 bit) and don't recall this kind of stability issue. Different use case, I guess. I was using IBM's C/C++ tool chain and did some scripting in REXX, no big DOS games.

Narishma · on Nov 13, 2023

The 6.123 version they're using is a pre-release version, either a beta or an alpha. It's not surprising that it isn't stable.

scruffyherder · on Nov 14, 2023

Were you using a 1986/1987 386?

I know everyone has super rose tinted glasses when it comes to OS/2, but try it on period hardware, with what shipped in 1987. Its terrible. And yes it runs great on Pentiums or anything new under emulation as clearly the larger fault is 1987 hardware, namely the 386.

HankB99 · on Nov 14, 2023

I don't recall the processor.

I was using it on period H/W, and in at least one case it was an IBM PS/2 with an 8514 graphics adapter and writing to it using some graphics primitives. The only wonk thing I recall was that if I cranked up the compiler optimizations, it just drew lines at various angles all over the screen instead of the images I needed. IIRC there were a few cranky things about writing directly to the graphics but I don't recall problems otherwise.

h2odragon · on Nov 13, 2023

I'd suspect PS/2 compatibility problems, or "old machine" problems like dust in the empty socket that no longer makes contact when the socket has been filled with the FPU chip.

PS/2 systems weren't fully "PC compatible", there was lots of stuff they would have trouble with. They had different port ranges and the BIOS had some infelicities as i recall.

scruffyherder · on Nov 13, 2023

I used a lot of compressed air on the socket and it made no difference.

Old DOS 5 ran fine along with OS/2 1.21. The upgrade to 6.123 went fine, it’s just something else weird .

Sadly I don’t know if any other 386 fans to give this a shot

CapitalistCartr · on Nov 13, 2023

AutoCAD on a 386. With 8 meg of ram. No coprocessor, no AutoCAD. At least none of his trying to figure it out; it was cut & dried.

achairapart · on Nov 13, 2023

AutoCAD for DOS shipped at some point with a cool NASA Space Shuttle demo, I remember seeing it render (can't remember if on a 286 or 386) without a co-processor and it took several minutes, as each line slowly rendered on screen.

The same demo with the same machine now equipped with a co-processor, it took just a few seconds.

As a kid, I knew nothing about FPU nor math co-processors but that incredible speed bump had a long lasting impression on me.

Animats · on Nov 13, 2023

AutoCAD was supposed to require a math co-processor, but there were software co-processor emulators available which could, slowly, do floating point. That's probably what you saw.

AutoCAD was 64-bit floating point (and sometimes 80-bit internally). Trying to do 80-bit floating point computations on a 16-bit CPU was painfully slow. Previous low-end CAD systems were 32-bit, and large drawings would have errors. That's why the Golden Gate Bridge drawing was a common AutoCAD demo. Zoomed out, you could see the whole bridge, and you could zoom in on the details. Big selling point.

(I did some of the early AutoCAD ports. Not all DOS machines were fully compatible with the IBM PC. There were many 80286 machines which were mostly-compatible. Each needed its own software. Every display, mouse, and plotter needed its own driver. Autodesk had a big room in Sausalito with one each of everything they supported. For a while, it looked like the Texas Instruments TI-Pro was going to win, because it had a color display good enough for text. But it was an awful computer. No memory parity and bad parts. I had to compile everything twice and compare the results.)

vondur · on Nov 13, 2023

I remember my first Mac lacked a math coprocessor. It was the Motorola LC68040, and it came with Quadra 604. I seem to remember it making Excel slow at doing some calculations. My next Mac was a PowerPC, I don't remember any of those lacking math coprocessors.

krylon · on Nov 13, 2023

I think PowerPC had the FPU builtin from the beginning.

enriquto · on Nov 13, 2023

Those were the gpus of the day. At some point, everything got integrated into the cpu. Will we see a similar reintegration any soon?

mytailorisrich · on Nov 13, 2023

A whole range of CPUs have integrated GPUs, but nothing on the high end.

I think there are technical limitations re. die size and heat dissipation if we wanted to integrate a high end CPU and a high end GPU in the same package.

eru · on Nov 13, 2023

I think the Apple chips are the highest end integrated CPU/GPU chips? Especially on the mobile site, and if measured in terms of price.

They are far from the best performers, of course. They are meant to run on low-ish power, especially compared to a desktop or server chip.

hyperionplays · on Nov 13, 2023

the top spec RyzenG Chip is no slouch.

Aromasin · on Nov 13, 2023

Precisely this. Most modern CPUs would be better called SoCs, with CPU/GPU/XPU cores all on a single package. If you want any sort of stopping power in terms of the number of cores or the clocking speed of said cores, you need massive amounts of cooling, hence why GPUs are sold as bulky PCIe cards.

wenyuanyu · on Nov 13, 2023

Possible... The recent Intel Xeon Max with the "brand-new" AMX instructions and 64GB on-chip HBM memory could be an example[0]...

[0] https://www.intel.com/content/www/us/en/products/details/pro...

bonton89 · on Nov 13, 2023

I've often thought Intel kind of ignored the discrete GPU market for so long because they just assumed they would eventually absorb those functions into the CPU like they did with everything else.

devit · on Nov 13, 2023

Current CPU vector instruction sets are as good as the ones on GPUs and they also have tensor/matrix instructions comparable to the ones on GPUs.

However, usually CPUs don't have enough cores and execution units to compete with GPUs, because they spend the transistors on out-of-order execution, cache and other features instead.

Intel Larrabee was the only attempt to actually make a CPU with a GPU-sized vector unit and they dropped it.

smegsicle · on Nov 13, 2023

http://catb.org/jargon/html/W/wheel-of-reincarnation.html

antupis · on Nov 13, 2023

Is the Apple Mx series about that?

MBCook · on Nov 13, 2023

Yep. GPU is built into the same die as the CPU. Not the same package, the same die.

You can see a (small) annotated die shot of the M1 here:

https://www.tomshardware.com/news/apple-m1-vs-apple-m14-floo...

ForOldHack · on Nov 14, 2023

Wow. This is interesting. It used to be everything had to be on die... then on package... and with recent architectures, multiple dies, multiple processes... things are again exploding.

ForOldHack · on Nov 14, 2023

https://www.macrumors.com/guide/m1-vs-m2-chip/

justsomehnguy · on Nov 13, 2023

we are on the another iteration: wifi cards are integreted now

ForOldHack · on Nov 15, 2023

You read the article which is cool, but Wifi was on chip set during the Centrino days, then WinModems ( blows chunks ), and now we have Wifi/BT/Audio/Kitchen sink on chip.

justsomehnguy · on Nov 15, 2023

And you didn't read the comment I was responding to.

> Wifi was on chip set during the Centrino days

Except it did not:

https://www.amazon.com/Intel-Wireless-MiniPCI-Adapter-WM3B21...

> WinModems

Which were just firmware moved to software

> Wifi/BT/Audio/Kitchen sink on chip

Yes, but no: https://en.wikipedia.org/wiki/CNVi

>> The AC 9560 and 9460 family of wireless (Wi-Fi + Bluetooth) modules are the first generation of CNVi modules.[3] They are only compatible with systems running Intel Gen. 8 or 9 processor on adapted motherboards.

kebman · on Nov 13, 2023

This sure brought me back. <3

Probiotic6081 · on Nov 13, 2023

hjjjjjj