This is a comparison of ARM instruction set architecture application processor cores designed by Arm Holdings (ARM Cortex-A) and 3rd parties. It does not include ARM Cortex-R, ARM Cortex-M, or legacy ARM cores.

ARMv7-A

This is a table comparing 32-bit central processing units that implement the ARMv7-A (A means Application) instruction set architecture and mandatory or optional extensions of it, the last AArch32.

CoreDecode widthExecution portsPipeline depthOut-of-order executionFPUPipelined VFPFPU registersNEON (SIMD)big.LITTLE roleVirtualizationProcess technologyL0 cacheL1 cacheL2 cacheCore configurationsSpeed per core (DMIPS/ MHz)ARM part number (in the main ID register)
ARM Cortex-A518NoVFPv4 (optional)16 × 64-bit64-bit wide (optional)NoNo40/28 nm4–64 KiB / core1, 2, 41.570xC05
ARM Cortex-A7258NoVFPv4 (optional)Yes(16 or 32) × 64-bit64-bit wideLITTLEYes40/28 nm8–64 KiB / coreup to 1 MiB (optional)1, 2, 4, 81.90xC07
ARM Cortex-A82213NoVFPv3No32 × 64-bit64-bit wideNoNo65/55/45 nm32 KiB + 32 KiB256 or 512 (typical) KiB12.00xC08
ARM Cortex-A9238–11YesVFPv3 (optional)Yes(16 or 32) × 64-bit64-bit wide (optional)Companion CoreNo65/45/40/32/28 nm32 KiB + 32 KiB1 MiB1, 2, 42.50xC09
ARM Cortex-A12211YesVFPv4Yes32 × 64-bit128-bit wideNoYes28 nm32–64 KiB + 32 KiB256 KiB, to 8 MiB1, 2, 43.00xC0D
ARM Cortex-A153815/17-25YesVFPv4Yes32 × 64-bit128-bit widebigYes32/28/20 nm32 KiB + 32 KiB per coreup to 4 MiB per cluster, up to 8 MiB per chip2, 4, 8 (4×2)3.5 to 4.010xC0F
ARM Cortex-A17211+YesVFPv4Yes32 × 64-bit128-bit widebigYes28 nm32 KiB + 32 KiB per core256 KiB, up to 8 MiBup to 44.00xC0E
Qualcomm Scorpion2310Yes (FXU&LSU only)VFPv3Yes128-bit wideNo65/45 nm32 KiB + 32 KiB256 KiB (single-core) 512 KiB (dual-core)1, 22.10x00F
Qualcomm Krait3711YesVFPv4Yes128-bit wideNo28 nm4 KiB + 4 KiB direct mapped16 KiB + 16 KiB 4-way set associative1 MiB 8-way set associative (dual-core) / 2 MiB (quad-core)2, 43.3 (Krait 200) 3.39 (Krait 300) 3.39 (Krait 400) 3.51 (Krait 450)0x04D 0x06F
Swift3512YesVFPv4Yes32 × 64-bit128-bit wideNo32 nm32 KiB + 32 KiB1 MiB23.5?
CoreDecode widthExecution portsPipeline depthOut-of-order executionFPUPipelined VFPFPU registersNEON (SIMD)big.LITTLE roleVirtualizationProcess technologyL0 cacheL1 cacheL2 cacheCore configurationsSpeed per core (DMIPS / MHz)ARM part number (in the main ID register)

ARMv8-A

This is a table of central processing units (CPUs) that implement the ARMv8-A instruction set architecture and mandatory or optional extensions of it. Almost all of these CPUs support the 64-bit AArch64 Execution State, and many of them support the 32-bit AArch32 Execution State for legacy applications. All chips of this type have a floating-point unit (FPU) that is better than the one in older ARMv7-A and NEON (SIMD) CPUs. Some of these CPUs have coprocessors also include cores from the older 32-bit architecture (ARMv7). Some of the CPUs are SoCs and can combine both ARM Cortex-A53 and ARM Cortex-A57, such as the Samsung Exynos 7 Octa.

CompanyCoreReleasedRevision32-bit supportDecodePipeline depthOut-of-order executionBranch predictionbig.LITTLE roleExec. portsSIMDFab (in nm)Simult. MTL0 cacheL1 cache Instr + Data (in KiB)L2 cacheL3 cacheCore configurationsSpeed per core (DMIPS/MHz)Clock rateARM part number (in the main ID register)
Have itEntries
ARMCortex-A322017ARMv8.0-A (only 32-bit)Yes2-wide8No0?LITTLE??28NoNo8–64 + 8–640–1 MiBNo1–4+2.3?0xD01
Cortex-A342019ARMv8.0-ANo2-wide8No0?LITTLE???NoNo8–64 + 8–640–1 MiBNo1–4+??0xD02
Cortex-A352017ARMv8.0-AYes2-wide8No0YesLITTLE??28 / 16 / 14 / 10NoNo8–64 + 8–640 / 128 KiB–1 MiBNo1–4+1.7-1.85?0xD04
Cortex-A532014ARMv8.0-AYes2-wide8No0Conditional+ Indirect branch predictionbig/LITTLE2?28 / 20 / 16 / 14 / 12 / 10 / 4NoNo8–64 + 8–64128 KiB–2 MiBNo1–4+2.24?0xD03
Cortex-A552017ARMv8.2-AYes2-wide8No0big/LITTLE2?28 / 20 / 16 / 14 / 12 / 10 / 5NoNo16–64 + 16–640–256 KiB/core0–4 MiB1–8+2.65?0xD05
Cortex-A572013ARMv8.0-AYes3-wide15Yes 3-wide dispatch??big8?28 / 20 / 16 / 14NoNo48 + 320.5–2 MiBNo1–4+4.1-4.8?0xD07
Cortex-A652019ARMv8.2-ANo2-wide10-12Yes 4-wide dispatchTwo-level?9?SMT2No32–64 + 32–64 KiB0, 64–256 KiB0, 0.5–4 MiB1-8??0xD06
Cortex-A65AE2019ARMv8.2-ANo??YesTwo-level?2?SMT2No32–64 + 32–64 KiB64–256 KiB0, 0.5–4 MiB1–8??0xD43
Cortex-A722015ARMv8.0-AYes3-wide15Yes 5-wide dispatchTwo-levelbig828 / 16NoNo48 + 320.5–4 MiBNo1–4+4.7-6.3?0xD08
Cortex-A732016ARMv8.0-AYes2-wide11–12Yes 4-wide dispatchTwo-levelbig728 / 16 / 10NoNo64 + 32/641–8 MiBNo1–4+4.8–8.5?0xD09
Cortex-A752017ARMv8.2-AYes3-wide11–13Yes 6-wide dispatchTwo-levelbig8?2*128b28 / 16 / 10NoNo64 + 64256–512 KiB/core0–4 MiB1–8+6.1–9.5?0xD0A
Cortex-A762018ARMv8.2-AEL0 only4-wide11–13Yes 8-wide dispatch128Two-levelbig82*128b10 / 7NoNo64 + 64256–512 KiB/core1–4 MiB1–46.4?0xD0B
Cortex-A76AE2018ARMv8.2-AEL0 only??Yes128Two-levelbig??NoNo??????0xD0E
Cortex-A772019ARMv8.2-AEL0 only4-wide11–13Yes 10-wide dispatch160Two-levelbig122*128b7No1.5K entries64 + 64256–512 KiB/core1–4 MiB1–47.3?0xD0D
Cortex-A782020ARMv8.2-AEL0 only4-wideYes160Yesbig132*128bNo1.5K entries32/64 + 32/64256–512 KiB/core1–4 MiB1–47.6-8.2?0xD41
Cortex-X12020ARMv8.2-AEL0 only5-wide?Yes224Yesbig154*128bNo3K entries64 + 64up to 1 MiBup to 8 MiBcustom10-11?0xD44
AppleCyclone2013ARMv8.0-A?6-wide16Yes192YesNo928NoNo64 + 641 MiB4 MiB2?1.3–1.4 GHz
Typhoon2014ARMv8.0‑A?6-wide16YesYesNo920NoNo64 + 641 MiB4 MiB2, 3 (A8X)?1.1–1.5 GHz
Twister2015ARMv8.0‑A?6-wide16YesYesNo916 / 14NoNo64 + 643 MiB4 MiB No (A9X)2?1.85–2.26 GHz
Hurricane2016ARMv8.0‑A?6-wide16Yes"big" (In A10/A10X paired with "LITTLE" Zephyr cores)93*128b16 (A10) 10 (A10X)NoNo64 + 643 MiB (A10) 8 MiB (A10X)4 MiB (A10) No (A10X)2x Hurricane (A10) 3x Hurricane (A10X)?2.34–2.36 GHz
ZephyrARMv8.0‑A?3-wide12YesLITTLE516 (A10) 10 (A10X)NoNo32 + 321 MiB4 MiB (A10) No (A10X)2x Zephyr (A10) 3x Zephyr (A10X)?1.09–1.3 GHz
Monsoon2017ARMv8.2‑A?7-wide16Yes"big" (In Apple A11 paired with "LITTLE" Mistral cores)113*128b10NoNo64 + 648 MiBNo2x Monsoon?2.39 GHz
MistralARMv8.2‑A?3-wide12YesLITTLE510NoNo32 + 321 MiBNoMistral?1.19 GHz
Vortex2018ARMv8.3‑A?7-wide16Yes"big" (In Apple A12/Apple A12X/Apple A12Z paired with "LITTLE" Tempest cores)113*128b7NoNo128 + 1288 MiBNo2x Vortex (A12) 4x Vortex (A12X/A12Z)?2.49 GHz
TempestARMv8.3‑A?3-wide12YesLITTLE57NoNo32 + 322 MiBNo4x Tempest?1.59 GHz
Lightning2019ARMv8.4‑A?8-wide16Yes560"big" (In Apple A13 paired with "LITTLE" Thunder cores)113*128b7NoNo128 + 1288 MiBNo2x Lightning?2.65 GHz
ThunderARMv8.4‑A?3-wide12YesLITTLE57NoNo96 + 484 MiBNo4x Thunder?1.8 GHz
Firestorm2020ARMv8.4-A?8-wideYes630"big" (In Apple A14 and Apple M1/M1 Pro/M1 Max/M1 Ultra paired with "LITTLE" Icestorm cores)144*128b5No192 + 1288 MiB (A14) 12 MiB (M1) 24 MiB (M1 Pro/M1 Max) 48 MiB (M1 Ultra)No2x Firestorm (A14) 4x Firestorm (M1) 6x or 8x Firestorm (M1 Pro) 8x Firestorm (M1 Max) 16x Firestorm (M1 Ultra)?3.0–3.23 GHz
IcestormARMv8.4-A?4-wideYes110LITTLE72*128b5No128 + 644 MiB 8 MiB (M1 Ultra)No4x Icestorm (A14/M1) 2x Icestorm (M1 Pro/Max) 4x Icestorm (M1 Ultra)?1.82–2.06 GHz
Avalanche2021ARMv8.6‑A?8-wideYes"big" (In Apple A15 and Apple M2/M2 Pro/M2 Max/M2 Ultra paired with "LITTLE" Blizzard cores)144*128b5No192 + 12812 MiB (A15) 16 MiB (M2) 32 MiB (M2 Pro/M2 Max) 64 MiB (M2 Ultra)No2x Avalanche (A15) 4x Avalanche (M2) 6x or 8x Avalanche (M2 Pro) 8x Avalanche (M2 Max) 16x Avalanche (M2 Ultra)?2.93–3.49 GHz
BlizzardARMv8.6‑A?4-wideYesLITTLE82*128b5No128 + 644 MiB 8 MiB (M2 Ultra)No4x Blizzard?2.02–2.42 GHz
Everest2022ARMv8.6‑A?8-wideYes"big" (In Apple A16 paired with "LITTLE" Sawtooth cores)144*128b5No192 + 12816 MiBNo2x Everest?3.46 GHz
SawtoothARMv8.6‑A?4-wideYesLITTLE82*128b5No128 + 644 MiBNo4x Sawtooth?2.02 GHz
NvidiaDenver2014ARMv8‑A?2-wide ARM or binary translated VLIW13If translated into VLIW code by softwareDirect+ Indirect branch predictionNo728NoNo128 + 642 MiBNo2??
Denver 22016ARMv8‑A??13If translated into VLIW code by softwareDirect+ Indirect branch prediction"Super" Nvidia's own implementation?16NoNo128 + 642 MiBNo2??
Carmel2018ARMv8.2‑A??Direct+ Indirect branch prediction?12NoNo128 + 642 MiB(4 MiB @ 8 cores)2 (+ 8)6.5-7.4?
CaviumThunderX2014ARMv8-A?2-wide9YesTwo-level?28NoNo78 + 3216 MiBNo8–16, 24–48??
ThunderX22018ARMv8.1-A?4-wide "4 μops"?YesMulti-level??16SMT4No32 + 32 (data 8-way)256 KiB per core1 MiB per core16–32??
MarvellThunderX32020ARMv8.3+?8-wide?Yes 4-wide dispatchMulti-level?77SMT4?64 + 32512 KiB per core90 MiB60??
Applied MicroHelix2014????????40 / 28NoNo32 + 32 (per core; write-through w/parity)256 KiB shared per core pair (with ECC)1 MiB/core2, 4, 8??
X-Gene2013??4-wide15Yes???40NoNo8 MiB84.2?
X-Gene 22015??4-wide15Yes???28NoNo8 MiB84.2?
X-Gene 32017????????16NoNo??32 MiB32??
QualcommKryo2015ARMv8-A???YesTwo-level?"big" or "LITTLE" Qualcomm's own similar implementation?14NoNo32+240.5–1 MiB2+26.3?
Kryo 2002016ARMv8-A?2-wide11–12Yes 7-wide dispatchTwo-levelbig714 / 11 / 10 / 6NoNo64 + 32/64?512 KiB/Gold CoreNo4?1.8–2.45 GHz
?2-wide8No0Conditional+ Indirect branch predictionLITTLE28–64? + 8–64?256 KiB/Silver Core4?1.8–1.9 GHz
Kryo 3002017ARMv8.2-A?3-wide11–13Yes 8-wide dispatchTwo-levelbig810NoNo64+64256 KiB/Gold Core2 MiB2, 4?2.0–2.95 GHz
?2-wide8No0Conditional+ Indirect branch predictionLITTLE2816–64? + 16–64?128 KiB/Silver4, 6?1.7–1.8 GHz
Kryo 4002018ARMv8.2-A?4-wide11–13Yes 8-wide dispatchYesbig811 / 8 / 7NoNo64 + 64512 KiB/Gold Prime 256 KiB/Gold2 MiB2, 1+1, 4, 1+3?2.0–2.96 GHz
?2-wide8No0Conditional+ Indirect branch predictionLITTLE216–64? + 16–64?128 KiB/Silver4, 6?1.7–1.8 GHz
Kryo 5002019ARMv8.2-A?4-wide11–13Yes 8-wide dispatchYesbig8 / 7No?512 KiB/Gold Prime 256 KiB/Gold3 MiB2, 1+3?2.0–3.2 GHz
?2-wide8No0Conditional+ Indirect branch predictionLITTLE2?128 KiB/Silver4, 6?1.7–1.8 GHz
Kryo 6002020ARMv8.4-A?4-wide11–13Yes 8-wide dispatchYesbig6 / 5No?64 + 641024 KiB/Gold Prime 512 KiB/Gold4 MiB2, 1+3?2.2–3.0 GHz
?2-wide8No0Conditional+ Indirect branch predictionLITTLE2?128 KiB/Silver4, 6?1.7–1.8 GHz
Falkor2017"ARMv8.1-A features"No4-wide10–15Yes 8-wide dispatchYes?810No24 KiB88 + 32500KiB1.25MiB40–48??
SamsungM12016ARMv8-A?4-wide13Yes 9-wide dispatch96big814NoNo64 + 322 MiBNo4?2.6 GHz
M22017ARMv8-A?4-wide100Two-levelbig10NoNo64 + 642 MiBNo4?2.3 GHz
M32018ARMv8.2-A?6-wide15Yes 12-wide dispatch228Two-levelbig1210NoNo64 + 64512 KiB per core4096KB4?2.7 GHz
M42019ARMv8.2-A?6-wide15Yes 12-wide dispatch228Two-levelbig128 / 7NoNo64 + 64512 KiB per core3072KB2?2.73 GHz
M52020ARMv8.2-A?6-wideYes 12-wide dispatch228Two-levelbig7NoNo64 + 64512 KiB per core3072KB2?2.73 GHz
FujitsuA64FX2019ARMv8.2-A?4/2-wide7+Yes 5-way?Yes—N/a8+2*512b7NoNo64 + 648MiB per 12+1 coresNo48+4?1.9 GHz+
HiSiliconTaiShan V1102019ARMv8.2-A?4-wide?Yes—N/a87NoNo64 + 64512 KiB per core1 MiB per core???
CompanyCoreReleasedRevision32-bit supportDecodePipeline depthOut-of-order executionBranch predictionbig.LITTLE roleExec. portsSIMDFab (in nm)Simult. MTL0 cacheL1 cache Instr + Data (in KiB)L2 cacheL3 cacheCore configurationsSpeed per core (DMIPS/MHz)Clock rateARM part number (in the main ID register)

ARMv9-A

CompanyCoreReleasedRevision32-bit supportDecodePipeline depthOut-of-order executionBranch predictionbig.LITTLE roleExec. portsSIMDFab (in nm)Simult. MTL0 cacheL1 cache Instr + Data (in KiB)L2 cacheL3 cacheCore configurationsSpeed per core (DMIPS/MHz)Clock rateARM part number (in the main ID register)
Have itEntries
ARMCortex-A510May 2021ARMv9-AEL0 only (2022 refresh only)3-wide8 stagesNoN/A (does not support out-of-order execution)Advanced techniques similar to larger cores, specifics not disclosedLITTLE2 execution portsYes5nmNoN/A32 or 64 KB eachConfigurable, typically 128 KB to 512 KBN/ATypically paired with Cortex-A710 in configurations (e.g., 1+3)Not explicitly stated, but performance uplift of 35% over A55Up to 2.85 GHz (varies by implementation)0xD46
Cortex-A710May 2021ARMv9-AEL0 only4-wide10 stagesYes160 entriesEnhanced with larger structures and better accuracybig13 execution portsYes5nmYesNot specified64/128 KiB each256/512 KiBOptional, up to 16 MiBTypically 1+3+4 (big.LITTLE)Not specified in resultsUp to 3.0 GHz (approx.)0xD47
Cortex-A715June 2022ARMv9-ANo5-wide10 stagesYes160 entriesAdvanced branch prediction capabilitiesbig13 execution portsYes4nmYesNot specified64 KiB each1 MiB16 MiB (in certain configurations)1+3+4 or similar setupsNot specified, but designed for high efficiencyUp to 2.8 GHz0xD4D
Cortex-X2May 2021ARMv9-ANo5-wide10 stagesYes288 entriesAdvanced, with improved accuracybig15 execution portsYes5nmYesNot specified64 KiB each1 MiB8 MiB1+3+4 (X2+A710+A510)Not specifiedUp to 3.2 GHz0xD48
Cortex-X3June 2022ARMv9-ANo6-wide9 stagesYes320 entriesAdvanced branch prediction capabilitiesbig15 execution portsYes4nmYesNot specified64 KiB each1 MiB16 MiB1+3+4 or up to 8+4Not specifiedUp to 3.6 GHz0xD4E

See also

Notes