x86 came out 1978,
21 years after, x64 came out 1999
we are three years overdue for a shift, and I don’t mean to arm. Is there just no point to it? 128 bit computing is a thing and has been in the talks since 1976 according to Wikipedia. Why hasn’t it been widely adopted by now?
What are you gonna do with 128 that you can’t do with 64
Because there is no need from an address space or compute standpoint.
to understand how large 128bit memory space really is; you’d need a memory size larger than all the number of atoms in the solar system
In the rare cases where you need to deal with a 128bit integer or floating, you can do it in software with not that much overhead by concatenating registers/ops. There hasn’t been enough pressure in terms of use cases that need 128bit int/fp precision for manufacturers to invest the resources in die area to add direct HW support for it.
FWIW there have been 64bit computers since the 60s/70s.
I think what you need to know, in layman terms, is that 128bit is not the double of 64bit. 65bit is double the amount of 64bit.
128bit is an absurd huge amount. And 64 is so much that even I as a radar engineer do not have to worry about it for a second.
Lots of good responses regarding why 128-bit isn’t a thing, but I’d like to talk about something else.
Extrapolating from two data points is a folly. It simply can’t work. You can’t take two events, calculate the time between them, and then assume that another event will happen after the same amount of time.
Besides, your points are wrong. (Edit: That also has been mentioned in another response.)
x86 (8086) came out in 1978 as a 16-bit CPU. 32-bit came with the 386 in 1985. x64, although described in 1999, was released in 2003.
So now you have three data points: 1978 for 16-bit, 1985 for 32-bit and 2003 for 64-bit. Differences are 7 years and 18 years.
Not that extrapolating from 3 points is good practice, but at least it’s more meaningful. You could, for example, conclude that it took about 2.5 times more to move from 32-bit to 64-bit than it did from 16-bit to 32-bit. Multiply 18 years by 2.5 and you get 45 years. So the move from 64-bit to 128-bit would be expected in 2003+45 = 2048.
This is nonsense, of course, but at least it’s a calculation backed by some data (which is still rather meaningless data).
There have been a number of 128bit systems over the years.
As it is, 64bit should be good for the life of x86This is a bit pedantic, but x64 refers to Alpha, which existed long before 1999. 64 bit x86 (x86-64, or amd64) wasn’t purchasable until 2003, although it was announced in 2000.
There were several additional shifts between 1978 and 2003:
8088
/8086
has what’s essentially bank switched 16 bit addressing which gives 1 MB, or 2^20 bytes80286
has physical support for 16 megs, or 2^24 bytes80386
has physical support 4 gigs, or 2^32 bytesPentium Pro
has PAE support for 64 gigs, or 2^36 bytesAMD Opteron
from 2003 has support for 1024 gigs, or 1 terabyte, or 2^40 bytes- Current
AMD
andIntel
CPUs physically support anywhere between 2^48 and 2^57 bytes of physical hardware (256 terabytes to 128 petabytes)
But let’s just use three points of data:
8086
/8088
,80386
, and let’s say the first 64 bitAMD Opteron
supports 64 bits:8086
/8088
, 1978, 20 bits80386
, 1985, 32 bitsAMD Opteron
, 2003, 64 bits
1978 to 1985 is 7 years, with a change in addressing of 12 bits, or about .6 bits per year.
1985 to 2003 is 18 years, with a change in addressing of 32 bits, or about .56 bits per year. So far, pretty consistent.
How long would it take to go from 64 bits to 128 bits? At around .56 bits per year, that’d be about 114 years, and we’ve had twenty so far.
Check back in 94 years.
Let me put it this way: computing is evolving in a way where SMALLER registers are actually more important for new types of algorithmical necessities. AI/ML is a great example - you have to program in specialty frameworks such as CUDA or Tensorflow which want to have registers as small as 8bit so that things are done faster, in the GPU or in L1/2 cache. The hardware of GPUs for instance is made with 8 and 16b logical processing units in mind.
Larger registers only really help a portion of computing, while you can emulate the odd large register you may need without affecting performance THAT much with a combination of smaller registers.
Because 2 to the power of 64 is a stupidly big number.
It is many times less than 2 to the power of 32 because you’ve went ahead and doubled it 32 times to get to 64bits.
What is this? The console wars of the 90s all over again?
Because it was, vector registers have crazy size today, 1Kb+.
Other people have addressed why 64-bit is still fine, but I just want to say that “x86” and “x64” are not two different architectures the way that you’re presenting them. We still use the x86 architecture, it’s just that x86-64, or AMD64, or whatever you want to call it, is a 64-bit extension of that architecture.
And this isn’t the first time that happened; the original 8086 was a 16-bit processor, as was the 286. The 386, however, was a 32-bit processor with backward compatibility for the 16-bit software built for the 16-bit x86 CPUs.
The 386 came out in 1985, so there’s actually a 14 year gap, though actually actually an 18 year gap because a 64-bit x86 processor didn’t actually hit the market until 2003. And then there was a 7-year gap between 16 and 32-bit x86.
But ultimately as other people have said the answer is that we don’t need to go beyond 64-bit right now, and the reason there was such a short gap between 16 and 32-bit processors was because the limitations of a 16-bit architecture became practical obstacles to progress faster than they did for 32-bit, and it’s going to be much longer than that for 64-bit because the address space has grown exponentially, not linearly.
tl;dr: we could but what for?
Practically all comments here are wrong although a few does mention why they are wrong: the address space has nothing to do with the bitness of the CPU.
Now, let’s review what’s what.
Let’s say you want to get the word “GRADIENT” from the memory into the CPU. Using a 8 bit instruction set you need to loop eight instructions. A 16 bit instruction set need four instructions; GR, AD, IE, NT. A 32 bit CPU only two and a 64 bit instruction can read it in a single step. Most of the time the actual CPU facilities will match the instruction set – in the early days, the Motorola 68000 for example had a 16 bit internal data bus and a 16 bit ALU but had a 32 bit instruction set. This was fixed in the 68020. This “merely” meant the 68000 needed internally twice as much time as the 68020 to do anything.
Now, in the past the amount of memory addressable has often been larger than what a single register could address. For example, the famous 8086/8088 CPUs had 20 bit address space while they were 16 bit CPUs. The Pentium Pro was a 32 bit CPU with a 36 bit address bus. These tricks, as the RISC-V instruction set manual drily notes
History suggests that whenever it becomes clear that more than 64 bits of address space is needed, architects will repeat intensive debates about alternatives to extending the address space, including segmentation, 96-bit address spaces, and software workarounds, until, finally, flat 128- bit address spaces will be adopted as the simplest and best solution.
That manual thinks we might need more than 64 bit address space before 2030. And to be fair going to 128 bit is not a big engineering challenge, not for a long time now, after all as early as 1999 even desktop Intel CPUs have included some 128 bit registers although for vector processing only. (A computer with a 128 bit general processor register existed in the 70s.)
Let’s review why we needed 64 bit! Say you want to number your records in a database, if you do that with a 32 bit register then you can have four billion records and game over. Sure you can store your number on two machine words but it’ll be slower. As an example there are more than four billion humans so this was a very real, down-to-the-earth limit which we needed to move on from. Also as per the note above, it’s much nicer to have a big single address space than all the tricks which were running out fast, 64GB was addressable and even run-of-the-mill servers were able to reach 16GB. 64 bits can address 16 billion billion records or bytes of memory, this seems to be fine for now. Notably current CPUs can only address 57 bits worth of physical memory so a hundredfold increase is still possible compared to currently existing machines.
Going 128 bit would require defining a whole new instruction set or at least an extension of one existing. RISC-V has a draft for RV128I but even they didn’t bother fully fleshing it out yet. Each register, internal bus and processing unit widening to 128 bit would consume significant silicon area. The memory usage of everything would at least double (note Apple still selling 8GB laptops at top dollar in 2023). So there are significant drawbacks and so far we have been fine with delegating the 128 bit computing to vector processing units in CPUs and GPUs.
So:
- Addressing has tricks aplenty should a future system need addressing more than 16 exabytes.
- General purpose computing works fine with 64 bit for now.
The world’s biggest super computer, Frontier, has 9,2 PB of RAM. It’s not available to one CPU, so no need to address everything in one address space, but let’s say it is. That still leaves room to build around 1 000 times more RAM into that theoretical CPU. I’m not sure we would be able to build such a computer today. One that needs more than ~10 000 PB RAM to address, which is what 128 bits means.
Sure, RAM isn’t the only reason for bigger address space, but there are also other ways to handle data beyond one address space. For the consumer, we are far from there.
The 32 bit limit was a real constraint, 64 bit is not. Also, modern architectures do actually compute 128 bit data in parallel (say 4x32 bit), so it’d just be a matter of representing that data on the screen in a 128 bit way. Any actual need for 128 bit can just be emulated, and it’s likely you don’t need to process such data at the limit of a 2023 tier processor anyway. In fact if anything for machine learning the direction seems to be going in the other direction, preferring faster hardware at half-precision (https://en.wikipedia.org/wiki/Half-precision_floating-point_format)
Not needed yet, 64bit was a must back then since 32bit can only handle 4GB, 64bit can handle 18exabyte
I need that much memory to run chrome with 3 tabs.