| |||
| Home > Programmers Model > Instruction throughput and latency > Instruction throughput and latency | |||
Table 2.1 shows:
the FPU instruction throughput and latency cycles for all operations except loads, stores and system register accesses
the old ARM assembler mnemonics and the ARM Unified Assembler Language (UAL) mnemonics.
Table 2.1. FPU instruction throughput and latency cycles
| Old ARM assembler mnemonic | UAL | Single Precision | Double Precision | ||||
|---|---|---|---|---|---|---|---|
| Throughput | Latency | Throughput | Latency | ||||
| Fwd | Wbck | Fwd | Wbck | ||||
|
| 1 | 4 | 1 | 4 | ||
|
| 1 | 5 | 2 | 6 | ||
|
| 1 | 8 | 2 | 9 | ||
|
| 1 | 1 | 2 | 1 | 1 | 2 |
| VMOV[a] | 1 | - | 3[b] | 1 | - | 3[b] |
|
| 1 | 1 | 2 | 1 | 1 | 2 |
FMSTAT | VMRS | 1 | - | 3[b] | 1 | - | 3[b] |
FDIV |
| 10 | 15 | 20 | 25 | ||
FSQRT |
| 13 | 17 | 28 | 32 | ||
|
| 1 | 1 | 4 | 1 | 1 | 4 |
| - |
| 1 | 2 | 2 | - | - | - |
| - |
| 1 | - | 4 | - | - | - |
[a] FPU to ARM. [b] The writeback number for these instructions is given from an ARM core writeback point of view. It reflects the penalty of moving data from the FPU into the ARM core register file before the following ARM instruction can use the moved data. [c] ARM to FPU. | |||||||