BGB <
[email protected]> writes:
Say, how well IA-64 could perform if only given, say, 16K of L1I$ and
128K of L2 cache, ...
Itanium (Merced) has 16KB I-cache and 96KB L2 cache.
Itanium 2 (McKinley) has 16KB I-cache and 256KB L2 cache.
They both have L3 caches.
But these CPUs actually do fine (for their time) on HPC-style stuff,
so the cache sizes are not the main problem. They perform badly at
code where the compiler cannot predict the branches well, even on
code that tends to perform well with small caches.
Of course, the Cortex-A53 and Bonnell also performs badly, for the
same reason, and Intel learned the lesson and replaced the in-order
Bonnell with the OoO line beginning with Silvermont, and up to the
recent Gracemont (Alder Lake E-core). Apple also went for OoO
E-cores. Only ARM is sticking to in-order cores.
- anton
--
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
Mitch Alsup, <
[email protected]>
--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)