Introduction

This programming manual provides information for application and system-level software developers. It gives a full description of the STM32F7 Series and STM32H7 Series Cortex®-M7 processor programming model, instruction set and core peripherals.

STM32F7 Series and STM32H7 Series Cortex-M7 processors are high-performance 32-bit processors designed for the microcontroller market.

The Arm® Cortex®-M7 is the highest-performance Cortex-M processor. It combines a six-stage, superscalar pipeline with flexible system and memory interfaces including AXI, AHB, caches and tightly-coupled memories, and delivers a high integer, floating-point and DSP performance in the STM32F7 Series and STM32H7 Series MCUs. It also supports dual-issue of load/load and load/store instruction pairs to multiple memory interfaces.

The Cortex-M7 processor takes advantage of the same easy-to-use, C friendly programmer’s model and is 100% binary compatible with the existing Cortex-M processors and tools. Along with all Cortex-M series processors, it enjoys full support from the Arm Cortex-M ecosystem. The software compatibility enables a simple migration from Cortex-M3 and Cortex-M4 processors.
## Contents

1 About this document ................................................. 14
   1.1 Typographical conventions ................................. 14
   1.2 List of abbreviations for registers ....................... 14
   1.3 About the Cortex®-M7 processor and core peripherals .... 15
      1.3.1 System level interface ................................. 16
      1.3.2 Integrated configurable debug ......................... 16
      1.3.3 Cortex®-M7 processor features and benefits summary . 17
      1.3.4 Cortex®-M7 processor core peripherals ................. 17

2 The Cortex-M7 processor ........................................... 19
   2.1 Programmers model ........................................... 19
      2.1.1 Processor mode and privilege levels for software execution . 19
      2.1.2 Stacks .................................................. 19
      2.1.3 Core registers .......................................... 20
      2.1.4 Exceptions and interrupts ............................. 28
      2.1.5 Data types .............................................. 29
      2.1.6 The Cortex Microcontroller Software Interface Standard (CMSIS) . 29
   2.2 Cortex®-M7 configurations ................................... 30
   2.3 Memory model ................................................ 32
      2.3.1 Memory regions, types and attributes ................ 33
      2.3.2 Memory system ordering of memory accesses .......... 33
      2.3.3 Behavior of memory accesses .......................... 34
      2.3.4 Software ordering of memory accesses ................. 36
      2.3.5 Memory endianness .................................... 36
      2.3.6 Synchronization primitives ............................ 37
      2.3.7 Programming hints for the synchronization primitives . 38
   2.4 Exception model .............................................. 39
      2.4.1 Exception states ....................................... 39
      2.4.2 Exception types ........................................ 39
      2.4.3 Exception handlers .................................... 41
      2.4.4 Vector table ........................................... 42
      2.4.5 Exception priorities ................................... 43
      2.4.6 Interrupt priority grouping ............................ 43
      2.4.7 Exception entry and return ............................ 44
### 2.5 Fault handling

- **2.5.1 Fault types**
- **2.5.2 Fault escalation and hard faults**
- **2.5.3 Synchronous and Asynchronous bus faults**
- **2.5.4 Fault status registers and fault address registers**
- **2.5.5 Lockup**

### 2.6 Power management

- **2.6.1 Entering sleep mode**
- **2.6.2 Wakeup from sleep mode**
- **2.6.3 The external event input**
- **2.6.4 Power management programming hints**

### 3 The Cortex-M7 instruction set

- **3.1 Instruction set summary**
- **3.1.1 Binary compatibility with other Cortex processors**
- **3.2 CMSIS functions**
- **3.3 About the instruction descriptions**
- **3.3.1 Operands**
- **3.3.2 Restrictions when using PC or SP**
- **3.3.3 Flexible second operand**
- **3.3.4 Shift operations**
- **3.3.5 Address alignment**
- **3.3.6 PC-relative expressions**
- **3.3.7 Conditional execution**
- **3.3.8 Instruction width selection**
- **3.4 Memory access instructions**
- **3.4.1 ADR**
- **3.4.2 LDR and STR, immediate offset**
- **3.4.3 LDR and STR, register offset**
- **3.4.4 LDR and STR, unprivileged**
- **3.4.5 LDR, PC-relative**
- **3.4.6 LDM and STM**
- **3.4.7 PLD**
- **3.4.8 PUSH and POP**
- **3.4.9 LDREX and STREX**
- **3.4.10 CLREX**
3.5 General data processing instructions ........................................ 85
  3.5.1 ADD, ADC, SUB, SBC, and RSB ........................................ 87
  3.5.2 AND, ORR, EOR, BIC, and ORN ........................................ 89
  3.5.3 ASR, LSL, LSR, ROR, and RRX ......................................... 90
  3.5.4 CLZ ............................................................... 91
  3.5.5 CMP and CMN ....................................................... 92
  3.5.6 MOV and MVN ....................................................... 93
  3.5.7 MOVT ............................................................ 94
  3.5.8 REV, REV16, REVSH, and RBIT ..................................... 95
  3.5.9 SADD16 and SADD8 .................................................. 96
  3.5.10 SHADD16 and SHADD8 .............................................. 97
  3.5.11 SHASX and SHSAX ................................................... 98
  3.5.12 SHSUB16 and SHSUB8 .............................................. 99
  3.5.13 SSSUB16 and SSUB8 ............................................... 100
  3.5.14 SASX and SSAX .................................................... 101
  3.5.15 TST and TEQ ...................................................... 102
  3.5.16 UADD16 and UADD8 ............................................... 103
  3.5.17 UASX and USAX .................................................... 104
  3.5.18 UHADD16 and UHADD8 ............................................ 105
  3.5.19 UHASX and UHSAX ................................................ 106
  3.5.20 UHSUB16 and UHSUB8 ............................................ 107
  3.5.21 SEL .............................................................. 108
  3.5.22 USAD8 ........................................................... 108
  3.5.23 USADA8 ........................................................... 109
  3.5.24 USUB16 and USUB8 ............................................... 110

3.6 Multiply and divide instructions .......................................... 111
  3.6.1 MUL, MLA, and MLS ................................................ 112
  3.6.2 UMULL, UMAAL, UMLAL ........................................... 113
  3.6.3 SMLA and SMLAW .................................................. 115
  3.6.4 SMLAD ............................................................. 116
  3.6.5 SMLAL and SMLALD ............................................... 117
  3.6.6 SMLSD and SMLSLD ............................................... 119
  3.6.7 SMMLA and SMMLS ............................................... 121
  3.6.8 SMUL ............................................................ 122
  3.6.9 SMUAD and SMUSD ............................................... 123
  3.6.10 SMUL and SMULW ................................................ 124
  3.6.11 UMULL, UMLAL, SMULL, and SMLAL ............................ 126
## Contents

<table>
<thead>
<tr>
<th>Section</th>
<th>Page</th>
</tr>
</thead>
<tbody>
<tr>
<td>3.6.12</td>
<td>SDIV and UDIV</td>
</tr>
<tr>
<td>3.7</td>
<td>Saturating instructions</td>
</tr>
<tr>
<td>3.7.1</td>
<td>SSAT and USAT</td>
</tr>
<tr>
<td>3.7.2</td>
<td>SSAT16 and USAT16</td>
</tr>
<tr>
<td>3.7.3</td>
<td>QADD and QSUB</td>
</tr>
<tr>
<td>3.7.4</td>
<td>QASX and QSAX</td>
</tr>
<tr>
<td>3.7.5</td>
<td>QDADD and QDSUB</td>
</tr>
<tr>
<td>3.7.6</td>
<td>UQASX and UQSAX</td>
</tr>
<tr>
<td>3.7.7</td>
<td>UQADD and UQSUB</td>
</tr>
<tr>
<td>3.8</td>
<td>Packing and unpacking instructions</td>
</tr>
<tr>
<td>3.8.1</td>
<td>PKHBT and PKHTB</td>
</tr>
<tr>
<td>3.8.2</td>
<td>SXT and UXT</td>
</tr>
<tr>
<td>3.8.3</td>
<td>SXTA and UXTA</td>
</tr>
<tr>
<td>3.9</td>
<td>Bit field instructions</td>
</tr>
<tr>
<td>3.9.1</td>
<td>BFC and BFI</td>
</tr>
<tr>
<td>3.9.2</td>
<td>SBFX and UBFX</td>
</tr>
<tr>
<td>3.9.3</td>
<td>SXT and UXT</td>
</tr>
<tr>
<td>3.10</td>
<td>Branch and control instructions</td>
</tr>
<tr>
<td>3.10.1</td>
<td>B, BL, BX, and BLX</td>
</tr>
<tr>
<td>3.10.2</td>
<td>CBZ and CBNZ</td>
</tr>
<tr>
<td>3.10.3</td>
<td>IT</td>
</tr>
<tr>
<td>3.10.4</td>
<td>TBB and TBH</td>
</tr>
<tr>
<td>3.11</td>
<td>Floating-point instructions</td>
</tr>
<tr>
<td>3.11.1</td>
<td>VABS</td>
</tr>
<tr>
<td>3.11.2</td>
<td>VADD</td>
</tr>
<tr>
<td>3.11.3</td>
<td>VCMP, VCME</td>
</tr>
<tr>
<td>3.11.4</td>
<td>VCVT, VCVTR between floating-point and integer</td>
</tr>
<tr>
<td>3.11.5</td>
<td>VCVT between floating-point and fixed-point</td>
</tr>
<tr>
<td>3.11.6</td>
<td>VCVTB, VCVTT</td>
</tr>
<tr>
<td>3.11.7</td>
<td>VDIV</td>
</tr>
<tr>
<td>3.11.8</td>
<td>VFMA, VFMS</td>
</tr>
<tr>
<td>3.11.9</td>
<td>VFNMA, VFNMS</td>
</tr>
<tr>
<td>3.11.10</td>
<td>VLDM</td>
</tr>
<tr>
<td>3.11.11</td>
<td>VLDR</td>
</tr>
<tr>
<td>3.11.12</td>
<td>VMLA, VMLS</td>
</tr>
<tr>
<td>3.11.13</td>
<td>VMOV immediate</td>
</tr>
</tbody>
</table>
3.11.14 VMOV register .......................................................... 162
3.11.15 VMOV scalar to Arm core register ................................. 163
3.11.16 VMOV Arm core register to single-precision ................... 163
3.11.17 VMOV two Arm core registers to two single-precision registers ... 164
3.11.18 VMOV two Arm core registers and a double-precision register ...... 164
3.11.19 VMOV Arm core register to scalar ................................. 165
3.11.20 VMRS ................................................................. 165
3.11.21 VMSR ................................................................. 166
3.11.22 VMUL ................................................................. 166
3.11.23 VNEG ................................................................. 167
3.11.24 VNMLA, VNMLS, VNMUL ............................................. 167
3.11.25 VPOP ................................................................. 168
3.11.26 VPUSH ............................................................... 169
3.11.27 VSQRT ............................................................... 169
3.11.28 VSTM ............................................................... 170
3.11.29 VSTR ............................................................... 170
3.11.30 VSUB ............................................................... 171
3.11.31 VSEL ............................................................... 172
3.11.32 VMAXNM, VMINNM ................................................... 172
3.11.33 VCVTA, VCVTN, VCVTP, VCVTM ................................... 173
3.11.34 VRINTR, VRINTX ..................................................... 173
3.11.35 VRINTA, VRINTN, VRINTP, VRINTM, VRINTZ ...................... 174
3.12 Miscellaneous instructions .................................................... 175
3.12.1 BKPT ................................................................. 175
3.12.2 CPS ................................................................. 176
3.12.3 DMB ................................................................. 177
3.12.4 DSB ................................................................. 177
3.12.5 ISB ................................................................. 178
3.12.6 MRS ................................................................. 178
3.12.7 MSR ................................................................. 179
3.12.8 NOP ................................................................. 180
3.12.9 SEV ................................................................. 180
3.12.10 SVC ............................................................... 181
3.12.11 WFE ............................................................... 181
3.12.12 WFI ............................................................... 182

4 Cortex-M7 peripherals ......................................................... 183
4.1 About the Cortex-M7 peripherals ........................................ 183
4.2 Nested Vectored Interrupt Controller ............................... 184
  4.2.1 Accessing the Cortex®-M7 NVIC registers using CMSIS .... 185
  4.2.2 Interrupt set-enable registers ............................... 185
  4.2.3 Interrupt clear-enable registers ............................ 186
  4.2.4 Interrupt set-pending registers ............................. 186
  4.2.5 Interrupt clear-pending registers ........................... 187
  4.2.6 Interrupt active bit registers .............................. 188
  4.2.7 Interrupt priority registers ............................... 188
  4.2.8 Software trigger interrupt register ....................... 189
  4.2.9 Level-sensitive and pulse interrupts ....................... 190
  4.2.10 NVIC design hints and tips ................................ 191
4.3 System control block ............................................... 192
  4.3.1 Auxiliary control register .................................. 193
  4.3.2 CPUID base register ....................................... 194
  4.3.3 Interrupt control and state register ....................... 194
  4.3.4 Vector table offset register ............................... 197
  4.3.5 Application interrupt and reset control register ......... 197
  4.3.6 System control register ................................... 199
  4.3.7 Configuration and control register ......................... 200
  4.3.8 System handler priority registers ......................... 202
  4.3.9 System handler control and state register ................. 204
  4.3.10 Configurable fault status register ......................... 205
  4.3.11 HardFault status register ................................ 210
  4.3.12 MemManage fault address register ......................... 211
  4.3.13 BusFault address register ................................ 212
  4.3.14 System control block design hints and tips ............... 212
4.4 System timer, SysTick ............................................. 212
  4.4.1 SysTick control and status register ....................... 213
  4.4.2 SysTick reload value register ............................ 214
  4.4.3 SysTick current value register ........................... 214
  4.4.4 SysTick calibration value register ......................... 215
  4.4.5 SysTick design hints and tips ........................... 216
4.5 Processor features .................................................. 217
  4.5.1 Cache level ID register .................................. 217
  4.5.2 Cache type register ....................................... 218
4.5.3 Cache size ID register ............................................. 219
4.5.4 Cache size selection register ................................... 220

4.6 Memory protection unit ............................................. 221
4.6.1 MPU type register .................................................. 223
4.6.2 MPU control register ............................................. 223
4.6.3 MPU region number register ................................... 225
4.6.4 MPU region base address register ......................... 225
4.6.5 MPU region attribute and size register ................. 226
4.6.6 MPU access permission attributes ...................... 228
4.6.7 MPU mismatch ...................................................... 230
4.6.8 Updating an MPU region .................................. 230
4.6.9 MPU design hints and tips .................................. 232

4.7 Floating-point unit ................................................. 233
4.7.1 Coprocessor access control register ................. 233
4.7.2 Floating-point context control register ............ 234
4.7.3 Floating-point context address register ........... 236
4.7.4 Floating-point status control register .............. 236
4.7.5 Floating-point default status control register . 237
4.7.6 Enabling the FPU ................................................. 238
4.7.7 Enabling and clearing FPU exception interrupts .... 239

4.8 Cache maintenance operations ................................. 240
4.8.1 Full instruction cache operation ....................... 241
4.8.2 Instruction and data cache operations by address .... 241
4.8.3 Data cache operations by set-way ....................... 241
4.8.4 Cortex®-M7 cache maintenance operations using CMSIS 242
4.8.5 Initializing and enabling the L1-cache .............. 242
4.8.6 Faults handling considerations ..................... 244
4.8.7 Cache maintenance design hints and tips .......... 244

4.9 Access control ...................................................... 245
4.9.1 Instruction and data tightly-coupled memory control registers 246
4.9.2 AHBP control register ........................................ 248
4.9.3 Auxiliary cache control register ...................... 249
4.9.4 AHB slave control register ............................... 250
4.9.5 Auxiliary bus fault status register .................. 251

5 Revision history ....................................................... 253
List of tables

Table 1. Summary of processor mode, execution privilege level, and stack use options .......... 20
Table 2. Core register set summary .................................................. 21
Table 3. PSR register combinations .................................................. 22
Table 4. APSPR bit assignments ...................................................... 23
Table 5. IPSR bit assignments .......................................................... 24
Table 6. EPSR bit assignments ......................................................... 24
Table 7. PRIMASK register bit assignments ....................................... 26
Table 8. FAULTMASK register bit assignments ................................. 26
Table 9. BASEPRI register bit assignments ..................................... 27
Table 10. Control register bit assignments ...................................... 27
Table 11. STM32F746xx/STM32F756xx Cortex®-M7 configuration .......... 30
Table 12. STM32F76xxx/STM32F77xxx Cortex®-M7 configuration .......... 30
Table 13. STM32F72xxx/STM32F73xxx Cortex®-M7 configuration .......... 31
Table 14. STM32H7 Series Cortex®-M7 configuration .......................... 31
Table 15. Ordering of memory accesses .......................................... 34
Table 16. Memory access behavior ................................................ 34
Table 17. Memory region shareability and cache policies ................... 35
Table 18. CMSIS functions for exclusive access instructions .............. 38
Table 19. Properties of the different exception types ....................... 40
Table 20. Exception return behavior ............................................. 46
Table 21. Faults ............................................................................. 47
Table 22. Fault status and fault address registers ............................. 49
Table 23. Cortex®-M7 instructions ................................................ 49
Table 24. CMSIS functions to generate some Cortex®-M7 processor instructions .......... 62
Table 25. CMSIS functions to access the special registers ................. 63
Table 26. Condition code suffixes ................................................ 70
Table 27. Memory access instructions ............................................ 72
Table 28. Offset ranges ................................................................. 75
Table 29. Offset ranges ................................................................. 78
Table 30. Data processing instructions ........................................... 85
Table 31. Multiply and divide instructions ..................................... 111
Table 32. Saturating instructions .................................................... 128
Table 33. Packing and unpacking instructions ................................ 137
Table 34. Packing and unpacking instructions ................................ 141
Table 35. Branch and control instructions ..................................... 145
Table 36. Branch ranges ............................................................... 146
Table 37. Floating-point instructions ............................................ 151
Table 38. Miscellaneous instructions .......................................... 175
Table 39. Core peripheral register regions .................................... 183
Table 40. NVIC register summary ................................................. 184
Table 41. CMSIS access NVIC functions ....................................... 185
Table 42. ISER bit assignments ....................................................... 185
Table 43. ICER bit assignments ....................................................... 186
Table 44. ISPR bit assignments ......................................................... 187
Table 45. ICPRI bit assignments ..................................................... 187
Table 46. IABR bit assignments ....................................................... 188
Table 47. IPR bit assignments ........................................................ 189
Table 48. STIR bit assignments ....................................................... 189
Table 49. CMSIS functions for NVIC control ................................................................. 191
Table 50. Summary of the system control block registers ........................................... 192
Table 51. ACTLR bit assignments ................................................................................. 193
Table 52. CPUID bit assignments .................................................................................. 194
Table 53. ICSR bit assignments ..................................................................................... 195
Table 54. VTOR bit assignments .................................................................................... 197
Table 55. AIRCR bit assignments ................................................................................. 198
Table 56. Priority grouping ........................................................................................ 198
Table 57. SCR bit assignments ...................................................................................... 199
Table 58. CCR bit assignments ...................................................................................... 201
Table 59. System fault handler priority fields ................................................................. 202
Table 60. SHPR1 register bit assignments .................................................................... 203
Table 61. SHPR2 register bit assignments .................................................................... 203
Table 62. SHPR3 register bit assignments .................................................................... 203
Table 63. SHCSR bit assignments ............................................................................... 204
Table 64. MMFSR bit assignments ............................................................................... 206
Table 65. BFSR bit assignments .................................................................................... 208
Table 66. UFSR bit assignments .................................................................................... 209
Table 67. HFSR bit assignments .................................................................................... 211
Table 68. MMFAR bit assignments ................................................................................. 211
Table 69. BFAR bit assignments .................................................................................... 212
Table 70. CMSIS function for system control ............................................................... 212
Table 71. System timer registers summary ................................................................. 213
Table 72. SysTick SYST_CSR bit assignments .............................................................. 213
Table 73. SYST_RVR bit assignments .......................................................................... 214
Table 74. SYST_CVR bit assignments ......................................................................... 215
Table 75. SYST_CALIB bit assignments ...................................................................... 215
Table 76. CMSIS functions for SysTick control ............................................................ 216
Table 77. Identification space summary ..................................................................... 217
Table 78. CLIDR bit assignments .................................................................................. 217
Table 79. CTR bit assignments ..................................................................................... 218
Table 80. CCSIDR bit assignments ............................................................................... 219
Table 81. CCSIDR encodings ....................................................................................... 220
Table 82. CSSELR bit assignments ............................................................................... 220
Table 83. Memory attributes summary ....................................................................... 221
Table 84. MPU registers summary ............................................................................... 222
Table 85. TYPE bit assignments ................................................................................... 223
Table 86. MPU_CTRL bit assignments ....................................................................... 224
Table 87. MPU_RNR bit assignments ........................................................................... 225
Table 88. MPU_RBAR bit assignments ....................................................................... 226
Table 89. MPU_RASR bit assignments ....................................................................... 227
Table 90. Example SIZE field values ........................................................................ 228
Table 91. TEX, C, B, and S encoding .......................................................................... 228
Table 92. Cache policy for memory attribute encoding ............................................... 229
Table 93. AP encoding ................................................................................................. 229
Table 94. Cortex®-M7 floating-point system registers ................................................. 233
Table 95. CPACR bit assignments ............................................................................... 234
Table 96. FPCCR bit assignments ............................................................................... 234
Table 97. FPCR bit assignments .................................................................................. 236
Table 98. FPSR bit assignments .................................................................................. 236
Table 99. FPSCR bit assignments ............................................................................... 237
Table 100. Cache maintenance space register summary .............................................. 240
Table 101. Cache operation registers bit assignments. ........................................... 241
Table 102. Cache operations by set-way bit assignments ........................................ 241
Table 103. CMSIS access cache maintenance operations ......................................... 242
Table 104. Access control register summary ......................................................... 245
Table 105. ITCMCR and DTCMCR bit assignments .............................................. 246
Table 106. AHBPCR bit assignments ................................................................. 248
Table 107. CACR bit assignments ................................................................. 249
Table 108. AHBSCR bit assignments ................................................................. 250
Table 109. ABFSR bit assignments ................................................................. 251
Table 110. Document revision history ................................................................. 253
List of figures

Figure 1. STM32 Cortex®-M7 implementation processor .................................................. 15
Figure 2. Processor core registers ................................................................................. 20
Figure 3. APSR, IPSR and EPSR bit assignments .......................................................... 22
Figure 4. PRIMASK bit assignments: ............................................................................ 26
Figure 5. FAULTMASK bit assignments ........................................................................ 26
Figure 6. BASEPRI bit assignments .............................................................................. 27
Figure 7. Control bit assignments .................................................................................. 27
Figure 8. Processor memory map .................................................................................. 32
Figure 9. Little-endian format ....................................................................................... 37
Figure 10. Vector table .................................................................................................. 42
Figure 11. Exception stack frame ................................................................................... 45
Figure 12. ASR .............................................................................................................. 66
Figure 13. LSR ............................................................................................................. 66
Figure 14. LSL ............................................................................................................. 67
Figure 15. ROR ............................................................................................................ 67
Figure 16. RRX ............................................................................................................ 67
Figure 17. ISER bit assignments .................................................................................... 185
Figure 18. ICER bit assignment .................................................................................... 186
Figure 19. ISPR bit assignments ................................................................................... 186
Figure 20. ICPR bit assignments ................................................................................... 187
Figure 21. IABR bit assignments ................................................................................... 188
Figure 22. IPR bit assignments ...................................................................................... 188
Figure 23. STIR bit assignments ................................................................................... 189
Figure 24. ACTLR bit assignments .............................................................................. 193
Figure 25. CPUID bit assignments .............................................................................. 194
Figure 26. ICSR bit assignments ................................................................................... 195
Figure 27. VTOR bit assignments .................................................................................. 197
Figure 28. AIRCR bit assignments ................................................................................ 197
Figure 29. SCR bit assignments: .................................................................................. 199
Figure 30. CCR bit assignments ................................................................................... 200
Figure 31. SHPR1 bit assignments ............................................................................... 202
Figure 32. SHPR2 bit assignments ............................................................................... 203
Figure 33. SHPR3 bit assignments ............................................................................... 203
Figure 34. SHCSR bit assignments .............................................................................. 204
Figure 35. CFSR bit assignments .................................................................................. 205
Figure 36. MMFSR bit assignments .............................................................................. 206
Figure 37. BFSR bit assignments .................................................................................. 207
Figure 38. UFSR bit assignments .................................................................................. 209
Figure 39. HFSR bit assignments .................................................................................. 210
Figure 40. SysTick SYST_CSR bit assignments ............................................................. 213
Figure 41. SYST_RVR bit assignments ........................................................................ 214
Figure 42. SYST_CVR bit assignments: ....................................................................... 214
Figure 43. SYST_CALIB bit assignments ..................................................................... 215
Figure 44. CLIDR bit assignments ............................................................................... 217
Figure 45. CTR bit assignments ................................................................................... 218
Figure 46. CCSIDR bit assignments .............................................................................. 219
Figure 47. CSSELIR bit assignments ............................................................................ 220
Figure 48. TYPE bit assignments .................................................................................. 223
Figure 49. MPU_CTRL bit assignments ........................................................... 223
Figure 50. MPU_RNR bit assignments .......................................................... 225
Figure 51. MPU_RBAR bit assignments ........................................................ 225
Figure 52. MPU_RASR bit assignments ......................................................... 227
Figure 53. Example of disabling subregion ................................................... 232
Figure 54. CPACR bit assignments ............................................................... 233
Figure 55. FPCCR bit assignments ............................................................... 234
Figure 56. FPCAR bit assignments ............................................................... 236
Figure 57. FPSCR bit assignments ............................................................... 236
Figure 58. FPDSCR bit assignments ............................................................ 237
Figure 59. Cache operation bit assignments ................................................. 241
Figure 60. ITCMR and DTCMR bit assignments .......................................... 246
Figure 61. AHBPCR bit assignments ............................................................ 248
Figure 62. CACR bit assignments ............................................................... 249
Figure 63. AHBSR bit assignments ............................................................. 250
Figure 64. ABFSR bit assignments ............................................................. 251
1 About this document

This document provides information required for application and system-level software development. It does not provide information on debug components, features, or operation.

STM32F7 Series and STM32H7 Series 32-bit MCUs are based on Arm® Cortex®-M processors.

This material is for microcontroller software and hardware engineers, including those who have no experience of Arm products.

1.1 Typographical conventions

The typographical conventions used in this document are:

- *italic*: Highlights important notes, introduces special terminology, denotes internal cross-references, and citations.
- **bold**: Highlights interface elements, such as menu names. Denotes signal names. Also used for terms in descriptive lists, where appropriate.
- `monospace`: Denotes text that the user can enter at the keyboard, such as commands, file and program names, and source code.
- `monospace` *italic*: Denotes arguments to monospace text where the argument is to be replaced by a specific value.
- `&lt;` and `&gt;`: Enclose replaceable terms for assembler syntax where they appear in code or code fragments. For example: `LDRSB<cond> <Rt>, [<Rn>, #<offset>]`

1.2 List of abbreviations for registers

The following abbreviations are used in register descriptions:

- `read/write (rw)`: Software can read and write to these bits.
- `read-only (r)`: Software can only read these bits.
- `write-only (w)`: Software can only write to this bit. Reading the bit returns the reset value.
- `read/clear (rc_w)`: Software can read as well as clear this bit by writing any value.

a. Arm is a registered trademark of Arm Limited (or its subsidiaries) in the US and/or elsewhere.
1.3 About the Cortex®-M7 processor and core peripherals

The Cortex®-M7 processor is a high performance 32-bit processor designed for the microcontroller market. It offers significant benefits to developers, including:

- Outstanding processing performance combined with fast interrupt handling.
- Enhanced system debug with extensive breakpoint and trace capabilities.
- Efficient processor core, system and memories.
- Low-power consumption with integrated sleep modes.
- Platform security robustness, with integrated Memory Protection Unit (MPU).

The Cortex®-M7 processor is built on a high-performance processor core, with a 6-stage pipeline Harvard architecture, making it ideal for demanding embedded applications. The in-
order superscalar processor delivers exceptional power efficiency through an efficient instruction set and extensively optimized design, providing high-end processing hardware including IEEE754-compliant single-precision and double-precision floating-point computation, a range of single-cycle and SIMD multiplication and multiply-with-accumulate capabilities, saturating arithmetic and dedicated hardware division.

To facilitate the design of cost-sensitive devices, the Cortex®-M7 processor implements tightly-coupled system components that reduce processor area while significantly improving interrupt handling and system debug capabilities. The Cortex®-M7 processor implements a version of the Thumb™ instruction set based on Thumb-2 technology, ensuring high code density and reduced program memory requirements. The Cortex®-M7 instruction set provides the exceptional performance expected of a modern 32-bit architecture, with the high code density of 8-bit and 16-bit microcontrollers.

The Cortex®-M7 processor closely integrates a configurable NVIC, to deliver industry-leading interrupt performance. The NVIC includes a Non Maskable Interrupt (NMI), and provides up to 256 interrupt priority levels. The tight integration of the processor core and NVIC provides fast execution of interrupt service routines (ISRs), dramatically reducing the interrupt latency. This is achieved through the hardware stacking of registers, and the ability to suspend load-multiple and store-multiple operations. Interrupt handlers do not require wrapping in assembler code, removing any code overhead from the ISRs. A tail-chain optimization also significantly reduces the overhead when switching from one ISR to another.

To optimize low-power designs, the NVIC integrates with the sleep modes, that include a deep sleep function that enables the entire device to be rapidly powered down while still retaining program state.

The reliability is increased with automatic fault detection and handling built-in. The Cortex®-M7 processor uses ECC and SECDED on accesses to memory and has Memory Build-in Self Test (MBIST) capability. The Cortex®-M7 processor is dual-redundant, which means it can operate in lock-step. The MCU vendor determines the reliability features configuration and therefore this can differ across different devices and families.

1.3.1 System level interface

The Cortex®-M7 processor provides multiple interfaces using AMBA® technology to provide high speed, low latency memory accesses. It supports unaligned data accesses.

The Cortex®-M7 processor has an MPU that provides fine grain memory control, enabling applications to utilize multiple privilege levels, separating and protecting code, data and stack on a task-by-task basis. Such requirements are becoming critical in many embedded applications such as automotive.

1.3.2 Integrated configurable debug

The Cortex®-M7 processor implements a complete hardware debug solution. This provides high system visibility of the processor and memory through either a traditional JTAG port or a 2-pin Serial Wire Debug (SWD) port that is ideal for microcontrollers and other small package devices. The MCU vendor determines the debug feature configuration and therefore this can differ across different devices and families.
For system trace the processor integrates an Instrumentation Trace Macrocell (ITM) together with data watchpoints and a profiling unit. To enable simple and cost-effective profiling of the system events these generate, a Serial Wire Viewer (SWV) can export a stream of software-generated messages, data trace, and profiling information through a single pin.

The optional CoreSight technology components, Embedded Trace Macrocell™ (ETM), delivers unrivalled instruction trace and data trace capture in an area far smaller than traditional trace units, enabling many low cost MCUs to implement full instruction trace for the first time.

The Breakpoint Unit provides up to eight hardware breakpoint comparators that debuggers can use.

1.3.3 Cortex®-M7 processor features and benefits summary

- Tight integration of system peripherals reduces area and development costs.
- Thumb instruction set combines high code density with 32-bit performance.
- IEEE754-compliant single-precision and double-precision Floating-Point Unit (FPU).
- Power control optimization of system components.
- Integrated sleep modes for low-power consumption.
- Fast code execution permits slower processor clock or increases sleep mode time.
- Hardware division and fast digital-signal-processing orientated multiply accumulate.
- Saturating arithmetic for signal processing.
- Deterministic, high-performance interrupt handling for time-critical applications.
- MPU for safety-critical applications.
- Arm Cortex®-M7 with instruction cache and data cache
- Memory system features such as caches, Tightly-Coupled Memory (TCM) with DMA port, and a high performance AXI external memory interface.
- Dedicated AHB slave (AHBS) interface for system access to TCMs
- Extensive debug and trace capabilities:
  - Serial Wire Debug and Serial Wire Trace reduce the number of pins required for debugging, tracing, and code profiling.

1.3.4 Cortex®-M7 processor core peripherals

The Cortex®-M7 processor core peripherals are:

**Nested Vectored Interrupt Controller**

The NVIC is an embedded interrupt controller that supports low latency interrupt processing.

**System Control Block**

The System Control Block (SCB) is the programmers model interface to the processor. It provides system implementation information and system control, including configuration, control, and reporting of system exceptions.
**Integrated instruction and data caches**

The instruction and data caches provide fast access to frequently accessed data and instructions, providing support for increased average performance when using system based memory.

**System timer**

The system timer, SysTick, is a 24-bit count-down timer. Use this as a Real Time Operating System (RTOS) tick timer or as a simple counter.

**Memory Protection Unit**

The Memory Protection Unit (MPU) improves system reliability by defining the memory attributes for different memory regions. It provides up to 8 different regions, and an optional predefined background region.

**Floating-point unit**

The FPU provides IEEE754-compliant operations on 32-bit single-precision and 64-bit double-precision floating-point values.
2 The Cortex-M7 processor

2.1 Programmers model

This section describes the Cortex®-M7 programmers model. In addition to the individual core register descriptions, it contains information about the processor modes and privilege levels for software execution and stacks.

2.1.1 Processor mode and privilege levels for software execution

The processor modes are:

- **Thread mode**: Executes application software. The processor enters Thread mode when it comes out of reset.
- **Handler mode**: Handles exceptions. The processor returns to Thread mode when it has finished all exception processing.

The privilege levels for software execution are:

- **Unprivileged**: The software:
  - Has limited access to system registers using the MSR and MRS instructions, and cannot use the CPS instruction to mask interrupts.
  - Cannot access the system timer, NVIC, or system control block.
  - Might have restricted access to memory or peripherals.

  Unprivileged software executes at the unprivileged level.

- **Privileged**: The software can use all the instructions and has access to all resources.

  Privileged software executes at the privileged level.

In Thread mode, the CONTROL register controls whether software execution is privileged or unprivileged, see CONTROL register on page 27. In Handler mode, software execution is always privileged.

Only privileged software can write to the CONTROL register to change the privilege level for software execution in Thread mode. Unprivileged software can use the SVC instruction to make a supervisor call to transfer control to privileged software.

2.1.2 Stacks

The processor uses a full descending stack. This means the stack pointer holds the address of the last stacked item in memory. When the processor pushes a new item onto the stack, it decrements the stack pointer and then writes the item to the new memory location. The processor implements two stacks, the **main stack** and the **process stack**, with a pointer for each held in independent registers, see Stack pointer on page 21.

In Thread mode, the CONTROL register controls whether the processor uses the main stack or the process stack, see CONTROL register on page 27. In Handler mode, the processor always uses the main stack. The options for processor operations are:
Table 1. Summary of processor mode, execution privilege level, and stack use options

<table>
<thead>
<tr>
<th>Processor mode</th>
<th>Used to execute</th>
<th>Privilege level for software execution</th>
<th>Stack used</th>
</tr>
</thead>
<tbody>
<tr>
<td>Thread</td>
<td>Applications</td>
<td>Privileged or unprivileged(^{(1)})</td>
<td>Main stack or process stack(^{(1)})</td>
</tr>
<tr>
<td>Handler</td>
<td>Exception handlers</td>
<td>Always privileged</td>
<td>Main stack</td>
</tr>
</tbody>
</table>

1. See CONTROL register on page 27.

### 2.1.3 Core registers

The processor core registers are:

![Figure 2. Processor core registers](image)

- **Low registers**
  - R0
  - R1
  - R2
  - R3
  - R4
  - R5
  - R6
  - R7

- **High registers**
  - R8
  - R9
  - R10
  - R11
  - R12

- **Stack Pointer**
  - SP (R13)

- **Link Register**
  - LR (R14)

- **Program Counter**
  - PC (R15)

**General-purpose registers**

- **Program status register (PSR)**
- **Exception mask registers**
  - PRIMASK
  - FAULTMASK
  - BASEPRI
- **CONTROL register**

\(^{(1)}\)Banked version of SP
General-purpose registers

R0-R12 are 32-bit general-purpose registers for data operations.

Stack pointer

The Stack Pointer (SP) is register R13. In Thread mode, bit[1] of the CONTROL register indicates the stack pointer to use:

- 0 = Main Stack Pointer (MSP). This is the reset value.
- 1 = Process Stack Pointer (PSP).

On reset, the processor loads the MSP with the value from address 0x00000000.

Link register

The Link Register (LR) is register R14. It stores the return information for subroutines, function calls, and exceptions. On reset, the processor sets the LR value to 0xFFFFFFFF.

Table 2. Core register set summary

<table>
<thead>
<tr>
<th>Register</th>
<th>Name</th>
<th>Type(1)</th>
<th>Required privilege(2)</th>
<th>Reset value</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>Stack pointer</td>
<td>MSP</td>
<td>RW</td>
<td>Either</td>
<td>See description</td>
<td>Stack pointer on page 21.</td>
</tr>
<tr>
<td>Stack pointer</td>
<td>PSP</td>
<td>RW</td>
<td>Either</td>
<td>Unknown</td>
<td>Stack pointer on page 21.</td>
</tr>
<tr>
<td>Link register</td>
<td>LR</td>
<td>RW</td>
<td>Either</td>
<td>0xFFFFFFFF</td>
<td>Link register on page 21</td>
</tr>
<tr>
<td>Program counter</td>
<td>PC</td>
<td>RW</td>
<td>Either</td>
<td>See description</td>
<td>Program counter on page 21</td>
</tr>
<tr>
<td>Program status register</td>
<td>PSR</td>
<td>RW</td>
<td>Either</td>
<td>0x01000000(3)</td>
<td>Program status register on page 22</td>
</tr>
<tr>
<td>Application program status register</td>
<td>APSR</td>
<td>RW</td>
<td>Either</td>
<td>Unknown</td>
<td>Application program status register on page 23</td>
</tr>
<tr>
<td>Interrupt program status register</td>
<td>IPSR</td>
<td>RO</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Interrupt program status register on page 23</td>
</tr>
<tr>
<td>Execution program Status register</td>
<td>EPSR</td>
<td>RO</td>
<td>Privileged</td>
<td>0x01000000(3)</td>
<td>Execution program status register on page 24</td>
</tr>
<tr>
<td>Priority mask register</td>
<td>PRIMASK</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Priority mask register on page 25</td>
</tr>
<tr>
<td>Fault mask register</td>
<td>FAULTMASK</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Fault mask register on page 26</td>
</tr>
<tr>
<td>Base priority mask register</td>
<td>BASEPRIS</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Priority mask register on page 25</td>
</tr>
<tr>
<td>Control register</td>
<td>CONTROL</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>CONTROL register on page 27</td>
</tr>
</tbody>
</table>

1. Describes access type during program execution in Thread mode and Handler mode. Debug access can differ.
2. An entry of Either means privileged and unprivileged software can access the register.
3. The EPSR reads as zero when executing an MRS instruction.
Program counter

The Program Counter (PC) is register R15. It contains the current program address. On reset, the processor loads the PC with the value of the reset vector, which is at address 0x00000004. Bit[0] of the value is loaded into the EPSR T-bit at reset and must be 1.

Program status register

The Program Status register (PSR) combines:

- Application Program Status register (APSR).
- Interrupt Program Status register (IPSR).
- Execution Program Status register (EPSR).

These registers are mutually exclusive bit fields in the 32-bit PSR. The bit assignments are

![Figure 3. APSR, IPSR and EPSR bit assignments](MSv3963TV1)

Access these registers individually or as a combination of any two or all three registers, using the register name as an argument to the MSR or MRS instructions. For example:

- Read all of the registers using PSR with the MRS instruction.
- Write to the APSR N, Z, C, V, and Q bits using APSR_nzcvq with the MSR instruction.

The PSR combinations and attributes are:

<table>
<thead>
<tr>
<th>Register Type Combination</th>
<th>Register Type</th>
<th>Combination</th>
</tr>
</thead>
<tbody>
<tr>
<td>APSR, EPSR, and IPSR.</td>
<td>APSR</td>
<td>APSR, EPSR, and IPSR.</td>
</tr>
<tr>
<td>EPSR and IPSR.</td>
<td>IEPSR</td>
<td>EPSR and IPSR.</td>
</tr>
<tr>
<td>APSR and IPSR.</td>
<td>IAPSR</td>
<td>APSR and IPSR.</td>
</tr>
<tr>
<td>APSR and EPSR.</td>
<td>EAPSR</td>
<td>APSR and EPSR.</td>
</tr>
</tbody>
</table>

1. The processor ignores writes to the IPSR bits.
2. Reads of the EPSR bits return zero, and the processor ignores writes to these bits.

See the instruction descriptions **MRS on page 178** and **MSR on page 179** for more information about how to access the program status registers.
Application program status register

The APSR contains the current state of the condition flags from previous instruction executions. See the register summary in *Table 4 on page 23* for its attributes. The bit assignments are:

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31]</td>
<td>N</td>
<td>Negative flag.</td>
</tr>
<tr>
<td>[29]</td>
<td>C</td>
<td>Carry or borrow flag.</td>
</tr>
<tr>
<td>[28]</td>
<td>V</td>
<td>Overflow flag.</td>
</tr>
<tr>
<td>[27]</td>
<td>Q</td>
<td>DSP overflow and saturation flag</td>
</tr>
<tr>
<td>[26:20]</td>
<td>-</td>
<td>Reserved</td>
</tr>
<tr>
<td>[15:0]</td>
<td>-</td>
<td>Reserved</td>
</tr>
</tbody>
</table>

Interrupt program status register

The IPSR contains the exception type number of the current *Interrupt Service Routine* (ISR). See the register summary in *Table 5 on page 24* for its attributes. The bit assignments are:
Execution program status register

The EPSR contains the Thumb state bit, and the execution state bits for either the:

- **If-Then (IT) instruction.**
- **Interruptible-Continuable Instruction (ICI) field** for an interrupted load multiple or store multiple instruction.

See the register summary in *Table 6 on page 24* for the EPSR attributes. The bit assignments are:

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:9]</td>
<td>-</td>
<td>Reserved</td>
</tr>
<tr>
<td>[8:0]</td>
<td>ISR_NUMBER</td>
<td>This is the number of the current exception:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0 = Thread mode.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1 = Reserved.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>2 = NMI.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>3 = HardFault.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>4 = MemManage.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>5 = BusFault</td>
</tr>
<tr>
<td></td>
<td></td>
<td>6 = UsageFault</td>
</tr>
<tr>
<td></td>
<td></td>
<td>7-10 = Reserved</td>
</tr>
<tr>
<td></td>
<td></td>
<td>11 = SVCall.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>12 = Reserved for debug</td>
</tr>
<tr>
<td></td>
<td></td>
<td>13 = Reserved</td>
</tr>
<tr>
<td></td>
<td></td>
<td>14 = PendSV.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>15 = SysTick.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>16 = IRQ0.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>256 = IRQ239.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>see <em>Exception types on page 39</em> for more information.</td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:27]</td>
<td>-</td>
<td>Reserved</td>
</tr>
<tr>
<td>[26:25], [15:10]</td>
<td>IT</td>
<td>Indicates the execution state bits of the IT instruction, see <em>IT on page 148.</em></td>
</tr>
<tr>
<td>[24]</td>
<td>T</td>
<td>Thumb state bit, see <em>Thumb state.</em></td>
</tr>
<tr>
<td>[23:16]</td>
<td>-</td>
<td>Reserved</td>
</tr>
<tr>
<td>[9:0]</td>
<td>-</td>
<td>Reserved</td>
</tr>
</tbody>
</table>
The attempts to read the EPSR directly through application software using the MSR instruction always return zero. The attempts to write the EPSR using the MSR instruction in application software are ignored.

**Interruptible-continuable instructions**

When an interrupt occurs during the execution of an LDM, STM, PUSH, POP, VLDM, VSTM, VPUSH, or VPOP instruction, the processor:
- Stops the load multiple or store multiple instruction operation temporarily.
- Stores the next register operand in the multiple operation to EPSR bits[15:12].

After servicing the interrupt, the processor:
- Returns to the register pointed to by bits[15:12].
- Resumes execution of the multiple load or store instruction.

When the EPSR holds ICI execution state, bits[26:25,11:10] are zero.

**If-Then block**

The If-Then block contains up to four instructions following an IT instruction. Each instruction in the block is conditional. The conditions for the instructions are either all the same, or some can be the inverse of others. See IT on page 148 for more information.

**Thumb state**

The Cortex®-M7 processor only supports execution of instructions in Thumb state. The following can clear the T bit to 0:
- Instructions BLX, BX and POP{PC}.
- Restoration from the stacked xPSR value on an exception return.
- Bit[0] of the vector value on an exception entry or reset.

Attempting to execute instructions when the T bit is 0 results in a fault or lockup. See Lockup on page 49 for more information.

**Exception mask registers**

The exception mask registers disable the handling of exceptions by the processor. Disable exceptions where they might impact on timing critical tasks.

To access the exception mask registers use the MSR and MRS instructions, or the CPS instruction to change the value of PRIMASK or FAULTMASK. See MRS on page 178, MSR on page 179, and CPS on page 176 for more information.

**Priority mask register**

The PRIMASK register prevents the activation of all exceptions with a configurable priority. See the register summary in Table 7 for its attributes. The bit assignments are
The Cortex-M7 processor

Figure 4. PRIMASK bit assignments:

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:1]</td>
<td>-</td>
<td>Reserved.</td>
</tr>
</tbody>
</table>
| [0] | PRIMASK | Prioritizable interrupt mask:  
0 = No effect.  
1 = Prevents the activation of all exceptions with configurable priority. |

Fault mask register

The FAULTMASK register prevents activation of all exceptions except for Non Maskable Interrupt (NMI). See the register summary in Table 8 on page 26 for its attributes. The bit assignments are:

Figure 5. FAULTMASK bit assignments

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:1]</td>
<td>-</td>
<td>Reserved.</td>
</tr>
</tbody>
</table>
| [0] | FAULTMASK | Prioritizable interrupt mask:  
0 = No effect.  
1 = Prevents the activation of all exceptions except for NMI. |

The processor clears the FAULTMASK bit to 0 on exit from any exception handler except the NMI handler.

Base priority mask register

The BASEPRI register defines the minimum priority for exception processing. When BASEPRI is set to a nonzero value, it prevents the activation of all exceptions with the same or lower priority level as the BASEPRI value. See the register summary in Table 9 on page 27 for its attributes. The bit assignments are:
CONTROL register

The CONTROL register controls the stack used and the privilege level for software execution when the processor is in Thread mode and indicates whether the FPU state is active. See the register summary in Table 10 on page 27 for its attributes. The bit assignments are:

Table 9. BASEPRI register bit assignments

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:8]</td>
<td>-</td>
<td>Reserved.</td>
</tr>
<tr>
<td>[7:0]</td>
<td>BASEPRI(1)</td>
<td>Priority mask bits:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x00 No effect</td>
</tr>
<tr>
<td></td>
<td></td>
<td>Nonzero: Defines the base priority for exception processing.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>The processor does not process any exception with a priority value</td>
</tr>
<tr>
<td></td>
<td></td>
<td>greater than or equal to BASEPRI.</td>
</tr>
</tbody>
</table>

1. This field is similar to the priority fields in the interrupt priority registers. The device implements only bits[7:M] of this field, bits[M-1:0] read as zero and ignore writes. See Interrupt program status register on page 23 for more information. Remember that higher priority field values correspond to lower exception priorities.

Table 10. Control register bit assignments

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:3]</td>
<td>-</td>
<td>Reserved.</td>
</tr>
<tr>
<td>[2]</td>
<td>FPCA</td>
<td>Indicates whether floating-point context is currently active:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: No floating-point context active.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: Floating-point context active.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>This bit is used to determine whether to preserve floating-point state</td>
</tr>
<tr>
<td></td>
<td></td>
<td>when processing an exception.</td>
</tr>
</tbody>
</table>
The Cortex-M7 processor always uses the MSP, so the processor ignores explicit writes to the active stack pointer bit of the CONTROL register when in Handler mode. The exception entry and return mechanisms automatically update the CONTROL register based on the EXC_RETURN value, see Table 20 on page 46.

In an OS environment, Arm recommends that threads running in Thread mode use the process stack and the kernel and exception handlers use the main stack.

By default, Thread mode uses the MSP. To switch the stack pointer used in Thread mode to the PSP, either:

- Use the MSR instruction to set the CONTROL.SPSEL bit, the current active stack pointer bit, to 1, see MSR on page 179.
- Perform an exception return to Thread mode with the appropriate EXC_RETURN value, see Table 20 on page 46.

When changing the stack pointer, software must use an ISB instruction immediately after the MSR instruction. This ensures that instructions after the ISB instruction execute using the new stack pointer. See ISB on page 178.

### 2.1.4 Exceptions and interrupts

The Cortex®-M7 processor supports interrupts and system exceptions. The processor and the NVIC prioritize and handle all exceptions. An exception changes the normal flow of software control. The processor uses Handler mode to handle all exceptions except for reset. See Exception entry on page 44 and Exception return on page 46 for more information.

The NVIC registers control interrupt handling. See Nested Vectored Interrupt Controller on page 184 for more information.
2.1.5 Data types

The processor:
- Supports the following data types:
  - 32-bit words.
  - 16-bit halfwords.
  - 8-bit bytes.
  - 32-bit single-precision floating point numbers.
  - 64-bit double-precision floating point numbers.
- Manages all data memory accesses as little-endian. See Memory regions, types and attributes on page 33 for more information.

2.1.6 The Cortex Microcontroller Software Interface Standard (CMSIS)

For a Cortex®-M7 microcontroller system, the Cortex Microcontroller Software Interface Standard (CMSIS) defines:
- A common way to:
  - Access peripheral registers.
  - Define exception vectors.
- The names of:
  - The registers of the core peripherals.
  - The core exception vectors.
- A device-independent interface for RTOS kernels, including a debug channel.

The CMSIS includes address definitions and data structures for the core peripherals in the Cortex®-M7 processor.

CMSIS simplifies software development by enabling the reuse of template code and the combination of CMSIS-compliant software components from various middleware vendors. Software vendors can expand the CMSIS to include their peripheral definitions and access functions for those peripherals.

This document includes the register names defined by the CMSIS, and gives short descriptions of the CMSIS functions that address the processor core and the core peripherals.

This document uses the register short names defined by the CMSIS. In a few cases these differ from the architectural short names that might be used in other documents.

The following sections give more information about the CMSIS:
- Power management programming hints on page 51.
- CMSIS functions on page 62.
- SysTick design hints and tips on page 216
- Accessing the Cortex®-M7 NVIC registers using CMSIS on page 185.
- NVIC programming hints on page 191.
- Cortex®-M7 cache maintenance operations using CMSIS on page 242
2.2 Cortex®-M7 configurations

*Table 11, Table 12* and *Table 13* show the configurations for STM32F7 Series Cortex-M7.

**Table 11. STM32F746xx/STM32F756xx Cortex®-M7 configuration**

<table>
<thead>
<tr>
<th>Features</th>
<th>STM32F746xx/STM32F756xx</th>
</tr>
</thead>
<tbody>
<tr>
<td>Floating Point Unit</td>
<td>Single precision floating point unit</td>
</tr>
<tr>
<td>MPU</td>
<td>8 regions</td>
</tr>
<tr>
<td>Instruction TCM size</td>
<td>Flash TCM: 1 Mbyte, RAM ITCM: 16 Kbytes</td>
</tr>
<tr>
<td>Data TCM size</td>
<td>64 Kbytes</td>
</tr>
<tr>
<td>Instruction cache size</td>
<td>4 Kbytes</td>
</tr>
<tr>
<td>Data cache size</td>
<td>4 Kbytes</td>
</tr>
<tr>
<td>Cache ECC</td>
<td>Not implemented</td>
</tr>
<tr>
<td>Interrupt priority levels</td>
<td>16 priority levels</td>
</tr>
<tr>
<td>Number of IRQ</td>
<td>98</td>
</tr>
<tr>
<td>WIC, CTI</td>
<td>Not implemented</td>
</tr>
<tr>
<td>Debug</td>
<td>JTAG &amp; Serial-Wire Debug Ports, 8 breakpoints and 4 watchpoints.</td>
</tr>
<tr>
<td>ITM support</td>
<td>Data Trace (DWT), and instrumentation trace (ITM)</td>
</tr>
<tr>
<td>ETM support</td>
<td>Instruction Trace interface</td>
</tr>
</tbody>
</table>

**Table 12. STM32F76xxx/STM32F77xxx Cortex®-M7 configuration**

<table>
<thead>
<tr>
<th>Features</th>
<th>STM32F76xxx/STM32F77xxx</th>
</tr>
</thead>
<tbody>
<tr>
<td>Floating Point Unit</td>
<td>Double and single precision floating point unit</td>
</tr>
<tr>
<td>MPU</td>
<td>8 regions</td>
</tr>
<tr>
<td>Instruction TCM size</td>
<td>Flash TCM: 2 Mbytes, RAM ITCM: 16 Kbytes</td>
</tr>
<tr>
<td>Data TCM size</td>
<td>128 Kbytes</td>
</tr>
<tr>
<td>Instruction cache size</td>
<td>16 Kbytes</td>
</tr>
<tr>
<td>Data cache size</td>
<td>16 Kbytes</td>
</tr>
<tr>
<td>Cache ECC</td>
<td>Not implemented</td>
</tr>
<tr>
<td>Interrupt priority levels</td>
<td>16 priority levels</td>
</tr>
<tr>
<td>Number of IRQ</td>
<td>110</td>
</tr>
<tr>
<td>WIC, CTI</td>
<td>Not implemented</td>
</tr>
<tr>
<td>Debug</td>
<td>JTAG &amp; Serial-Wire Debug Ports, 8 breakpoints and 4 watchpoints.</td>
</tr>
<tr>
<td>ITM support</td>
<td>Data Trace (DWT), and instrumentation trace (ITM)</td>
</tr>
<tr>
<td>ETM support</td>
<td>Instruction Trace interface</td>
</tr>
</tbody>
</table>
Table 13. STM32F72xxx/STM32F73xxx Cortex®-M7 configuration

<table>
<thead>
<tr>
<th>Features</th>
<th>STM32F72xxx/STM32F73xxx</th>
</tr>
</thead>
<tbody>
<tr>
<td>Floating Point Unit</td>
<td>Single precision floating point unit</td>
</tr>
<tr>
<td>MPU</td>
<td>8 regions</td>
</tr>
<tr>
<td>Instruction TCM size</td>
<td>Flash TCM: 512 Kbytes</td>
</tr>
<tr>
<td></td>
<td>RAM ITCM: 16 Kbytes</td>
</tr>
<tr>
<td>Data TCM size</td>
<td>64 Kbytes</td>
</tr>
<tr>
<td>Instruction cache size</td>
<td>8 Kbytes</td>
</tr>
<tr>
<td>Data cache size</td>
<td>8 Kbytes</td>
</tr>
<tr>
<td>Cache ECC</td>
<td>Not implemented</td>
</tr>
<tr>
<td>Interrupt priority levels</td>
<td>16 priority levels</td>
</tr>
<tr>
<td>Number of IRQ</td>
<td>104</td>
</tr>
<tr>
<td>WIC, CTI</td>
<td>Not implemented</td>
</tr>
<tr>
<td>Debug</td>
<td>JTAG &amp; Serial-Wire Debug Ports</td>
</tr>
<tr>
<td></td>
<td>8 breakpoints and 4 watchpoints.</td>
</tr>
<tr>
<td>ITM support</td>
<td>Data Trace (DWT), and instrumentation trace (ITM)</td>
</tr>
<tr>
<td>ETM support</td>
<td>Instruction Trace interface</td>
</tr>
</tbody>
</table>

Table 14 shows the configurations for STM32H7 Series Cortex-M7.

Table 14. STM32H7 Series Cortex®-M7 configuration

<table>
<thead>
<tr>
<th>Features</th>
<th>STM32H7 Series</th>
</tr>
</thead>
<tbody>
<tr>
<td>Floating Point Unit</td>
<td>Double and single precision floating point unit</td>
</tr>
<tr>
<td>MPU</td>
<td>16 regions</td>
</tr>
<tr>
<td>Instruction TCM size</td>
<td>RAM ITCM: 64 Kbytes</td>
</tr>
<tr>
<td>Data TCM size</td>
<td>128 Kbytes</td>
</tr>
<tr>
<td>Instruction cache size</td>
<td>16 Kbytes</td>
</tr>
<tr>
<td>Data cache size</td>
<td>16 Kbytes</td>
</tr>
<tr>
<td>Cache ECC</td>
<td>Implemented</td>
</tr>
<tr>
<td>Interrupt priority levels</td>
<td>16 priority levels</td>
</tr>
<tr>
<td>Number of IRQ</td>
<td>149</td>
</tr>
<tr>
<td>WIC, CTI</td>
<td>WIC Not implemented, CTI implemented</td>
</tr>
<tr>
<td>Debug</td>
<td>JTAG &amp; Serial-Wire Debug Ports</td>
</tr>
<tr>
<td></td>
<td>8 breakpoints and 4 watchpoints.</td>
</tr>
<tr>
<td>ITM support</td>
<td>Data Trace (DWT), and instrumentation trace (ITM)</td>
</tr>
<tr>
<td>ETM support</td>
<td>Instruction Trace interface</td>
</tr>
</tbody>
</table>
2.3 Memory model

This section describes the processor memory map and the behavior of memory accesses. The processor has a fixed default memory map that provides up to 4 Gbytes of addressable memory. The memory map is:

![Processor memory map](image)

The processor reserves regions of the *Private peripheral bus* (PPB) address range for core peripheral registers, see *About the Cortex-M7 peripherals on page 183.*
2.3.1 Memory regions, types and attributes

The memory map and the programming of the MPU split the memory map into regions. Each region has a defined memory type, and some regions have additional memory attributes. The memory type and attributes determine the behavior of accesses to the region.

The memory types are:

- **Normal**: The processor can re-order transactions for efficiency, or perform speculative reads.
- **Device and Strongly-ordered**: The processor preserves transaction order relative to other transactions to Device or Strongly-ordered memory.

The different ordering requirements for Device and Strongly-ordered memory mean that the memory system can buffer a write to Device memory, but must not buffer a write to Strongly-ordered memory.

The additional memory attributes include:

- **Shareable**: For a shareable memory region, the memory system provides data synchronization between bus masters in a system with multiple bus masters, for example, a processor with a DMA controller. Strongly-ordered memory is always shareable. If multiple bus masters can access a non-shareable memory region, software must ensure data coherency between the bus masters.
- **Execute Never (XN)**: Means the processor prevents instruction accesses. A HardFault exception is generated on executing an instruction fetched from an XN region of memory.

2.3.2 Memory system ordering of memory accesses

For most of memory accesses caused by explicit memory access instructions, the memory system does not guarantee that the order in which the accesses complete matches the program order of the instructions. Providing any re-ordering does not affect the behavior of the instruction sequence. Normally, if a correct program execution depends on two memory accesses completing in the program order, the software must insert a memory barrier instruction between the memory access instructions, see 2.3.4: Software ordering of memory accesses on page 36.

However, the memory system does guarantee some ordering of accesses to Device and Strongly-ordered memory. For two memory access instructions A1 and A2, if A1 occurs before A2 in the program order, the ordering of the memory accesses caused by two instructions is:
The Cortex-M7 processor

### 2.3.3 Behavior of memory accesses

The behavior of accesses to each region in the memory map is:

<table>
<thead>
<tr>
<th>Address range</th>
<th>Memory region</th>
<th>Memory type</th>
<th>XN</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0x00000000-0x1FFFFFFF</td>
<td>Code</td>
<td>Normal</td>
<td>-</td>
<td>Executable region for program code. The user can also put data here. Instruction fetches and data accesses are performed over the ITCM or AXIM interface.</td>
</tr>
<tr>
<td>0x20000000-0x3FFFFFFF</td>
<td>SRAM</td>
<td>Normal</td>
<td>-</td>
<td>Executable region for data. The user can also put code here. Instruction fetches and data accesses are performed over the DTCM or AXIM interface.</td>
</tr>
<tr>
<td>0x40000000-0x5FFFFFFF</td>
<td>Peripheral</td>
<td>Device</td>
<td>XN</td>
<td>External device memory. Data accesses are performed over the AHBP or AXIM interface.</td>
</tr>
<tr>
<td>0x60000000-0x9FFFFFFF</td>
<td>External RAM</td>
<td>Normal</td>
<td>-</td>
<td>Executable region for data. Instruction fetches and data accesses are performed over the AXIM interface.</td>
</tr>
<tr>
<td>0xA0000000-0xDFFFFFFF</td>
<td>External device</td>
<td>Device</td>
<td>XN</td>
<td>External device memory. Instruction fetches and data accesses are performed over the AXIM interface.</td>
</tr>
<tr>
<td>0xE0000000-0xE00FFFFF</td>
<td>Private Peripheral Bus</td>
<td>Strongly-ordered</td>
<td>XN</td>
<td>This region includes the NVIC, System timer, and System Control Block. Only word accesses can be used in this region.</td>
</tr>
<tr>
<td>0xE0100000-0xE01FFFFF</td>
<td>Vendor-specific device</td>
<td>Device</td>
<td>XN</td>
<td>Accesses to this region are to vendor-specific peripherals.</td>
</tr>
</tbody>
</table>

1. See Memory regions, types and attributes on page 33 for more information.

The Code, SRAM, and external RAM regions can hold programs.

---

Table 15. Ordering of memory accesses

<table>
<thead>
<tr>
<th>A1</th>
<th>A2</th>
</tr>
</thead>
<tbody>
<tr>
<td>Normal access</td>
<td>-</td>
</tr>
<tr>
<td>Device access, non-shareable</td>
<td>- &lt;</td>
</tr>
<tr>
<td>Device access, shareable</td>
<td>- &lt;</td>
</tr>
<tr>
<td>Strongly ordered access</td>
<td>&lt; &lt;</td>
</tr>
</tbody>
</table>

1. - means that the memory system does not guarantee the ordering of the accesses. < means that accesses are observed in program order, that is, A1 is always observed before A2.

Table 16. Memory access behavior

<table>
<thead>
<tr>
<th>Address range</th>
<th>Memory region</th>
<th>Memory type</th>
<th>XN</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0x00000000-0x1FFFFFFF</td>
<td>Code</td>
<td>Normal</td>
<td>-</td>
<td>Executable region for program code. The user can also put data here. Instruction fetches and data accesses are performed over the ITCM or AXIM interface.</td>
</tr>
<tr>
<td>0x20000000-0x3FFFFFFF</td>
<td>SRAM</td>
<td>Normal</td>
<td>-</td>
<td>Executable region for data. The user can also put code here. Instruction fetches and data accesses are performed over the DTCM or AXIM interface.</td>
</tr>
<tr>
<td>0x40000000-0x5FFFFFFF</td>
<td>Peripheral</td>
<td>Device</td>
<td>XN</td>
<td>External device memory. Data accesses are performed over the AHBP or AXIM interface.</td>
</tr>
<tr>
<td>0x60000000-0x9FFFFFFF</td>
<td>External RAM</td>
<td>Normal</td>
<td>-</td>
<td>Executable region for data. Instruction fetches and data accesses are performed over the AXIM interface.</td>
</tr>
<tr>
<td>0xA0000000-0xDFFFFFFF</td>
<td>External device</td>
<td>Device</td>
<td>XN</td>
<td>External device memory. Instruction fetches and data accesses are performed over the AXIM interface.</td>
</tr>
<tr>
<td>0xE0000000-0xE00FFFFF</td>
<td>Private Peripheral Bus</td>
<td>Strongly-ordered</td>
<td>XN</td>
<td>This region includes the NVIC, System timer, and System Control Block. Only word accesses can be used in this region.</td>
</tr>
<tr>
<td>0xE0100000-0xE01FFFFF</td>
<td>Vendor-specific device</td>
<td>Device</td>
<td>XN</td>
<td>Accesses to this region are to vendor-specific peripherals.</td>
</tr>
</tbody>
</table>
The MPU can override the default memory access behavior described in this section. For more information, see Memory protection unit on page 221.

Additional memory access constraints for caches and shared memory

When a system includes caches or shared memory, some memory regions have additional access constraints, and some regions are subdivided, as Table 17 shows:

### Table 17. Memory region shareability and cache policies

<table>
<thead>
<tr>
<th>Address range</th>
<th>Memory region</th>
<th>Memory type (1)</th>
<th>Shareability (1)</th>
<th>Cache policy (2)</th>
</tr>
</thead>
<tbody>
<tr>
<td>0x00000000-0xFFFFFFF</td>
<td>Code</td>
<td>Normal</td>
<td>Non-shareable</td>
<td>WT</td>
</tr>
<tr>
<td>0x20000000-0x3FFFFF</td>
<td>SRAM</td>
<td>Normal</td>
<td>Non-shareable</td>
<td>WBWA</td>
</tr>
<tr>
<td>0x40000000-0xFFFFF</td>
<td>Peripheral</td>
<td>Device</td>
<td>Non-shareable</td>
<td>-</td>
</tr>
<tr>
<td>0x60000000-0x7FFFFF</td>
<td>External RAM</td>
<td>Normal</td>
<td>Non-shareable</td>
<td>-</td>
</tr>
<tr>
<td>0x80000000-0x9FFFFF</td>
<td>External device</td>
<td>Device</td>
<td>Shareable</td>
<td>-</td>
</tr>
<tr>
<td>0xA0000000-0xBFFFFF</td>
<td>External device</td>
<td>Device</td>
<td>Non-shareable</td>
<td>-</td>
</tr>
<tr>
<td>0xC0000000-0xDFFFFF</td>
<td>Private Peripheral Bus</td>
<td>Strongly-ordered</td>
<td>Shareable</td>
<td>-</td>
</tr>
<tr>
<td>0xE0000000-0xE00FFFFF</td>
<td>Vendor-specific device</td>
<td>Device</td>
<td>Non-shareable</td>
<td>-</td>
</tr>
</tbody>
</table>

1. See Section 2.3.1: Memory regions, types and attributes on page 33 for more information.
2. WT = Write through, no write allocate. WBWA = Write back, write allocate.

Instruction prefetch and branch prediction

The Cortex®-M7 processor:
- Prefetches instructions ahead of execution.
- Speculatively prefetches from branch target addresses.
2.3.4 **Software ordering of memory accesses**

The order of instructions in the program flow does not always guarantee the order of the corresponding memory transactions. This is because:

- The processor can reorder some memory accesses to improve efficiency, providing this does not affect the behavior of the instruction sequence.
- The processor has multiple bus interfaces.
- Memory or devices in the memory map have different wait states.
- Some memory accesses are buffered or speculative.

*Memory system ordering of memory accesses on page 33* describes the cases where the memory system guarantees the order of memory accesses. Otherwise, if the order of memory accesses is critical, software must include memory barrier instructions to force that ordering. The processor provides the following memory barrier instructions:

- **DMB** The Data Memory Barrier (DMB) instruction ensures that outstanding memory transactions complete before subsequent memory transactions. See *DMB on page 177*.
- **DSB** The Data Synchronization Barrier (DSB) instruction ensures that outstanding memory transactions complete before subsequent instructions execute. See *DSB on page 177*.
- **ISB** The Instruction Synchronization Barrier (ISB) ensures that the effect of all completed memory transactions is recognizable by subsequent instructions. See *ISB on page 178*.

**MPU programming**

Use a DSB, followed by an ISB instruction or exception return to ensure that the new MPU configuration is used by subsequent instructions.

2.3.5 **Memory endianness**

The processor views memory as a linear collection of bytes numbered in ascending order from zero. For example, bytes 0-3 hold the first stored word, and bytes 4-7 hold the second stored word. *Little-endian format on page 36* describes how words of data are stored in memory.

**Little-endian format**

In little-endian format, the processor stores the least significant byte of a word at the lowest-numbered byte, and the most significant byte at the highest-numbered byte. For example:
Figure 9. Little-endian format

2.3.6 Synchronization primitives

The instruction set support for the Cortex®-M7 processor includes pairs of synchronization primitives. These provide a non-blocking mechanism that a thread or process can use to obtain exclusive access to a memory location. Software can use them to perform a guaranteed read-modify-write memory update sequence, or for a semaphore mechanism.

A pair of synchronization primitives comprises:

A Load-Exclusive instruction

Used to read the value of a memory location, requesting exclusive access to that location.

A Store-Exclusive instruction

Used to attempt to write to the same memory location, returning a status bit to a register. If this bit is:

- 0 it indicates that the thread or process gained exclusive access to the memory, and the write succeeds,
- 1 it indicates that the thread or process did not gain exclusive access to the memory, and no write was performed.

The pairs of Load-Exclusive and Store-Exclusive instructions are:

- The word instructions LDREX and STREX.
- The halfword instructions LDREXH and STREXH.
- The byte instructions LDREXB and STREXB.

Software must use a Load-Exclusive instruction with the corresponding Store-Exclusive instruction.

To perform an exclusive read-modify-write of a memory location, software must:

1. Use a Load-Exclusive instruction to read the value of the location.
2. Modify the value, as required.
3. Use a Store-Exclusive instruction to attempt to write the new value back to the memory location.
4. Test the returned status bit. If this bit is:
   - 0: The read-modify-write completed successfully.
   - 1: No write was performed. This indicates that the value returned at step 1 might be out of date. The software must retry the entire read-modify-write sequence.
Software can use the synchronization primitives to implement a semaphore as follows:

1. Use a Load-Exclusive instruction to read from the semaphore address to check whether the semaphore is free.
2. If the semaphore is free, use a Store-Exclusive to write the claim value to the semaphore address.
3. If the returned status bit from step 2 indicates that the Store-Exclusive succeeded then the software has claimed the semaphore. However, if the Store-Exclusive failed, another process might have claimed the semaphore after the software performed step 1.

The Cortex®-M7 processor includes an exclusive access monitor, that tags the fact that the processor has executed a Load-Exclusive instruction. If the processor is part of a multiprocessor system and the address is in a shared region of memory, the system also globally tags the memory locations addressed by exclusive accesses by each processor.

The processor removes its exclusive access tag if:
- It executes a CLREX instruction.
- It executes a STREX instruction, regardless of whether the write succeeds.
- An exception occurs. This means the processor can resolve semaphore conflicts between different threads.

In a multiprocessor implementation:
- Executing a CLREX instruction removes only the local exclusive access tag for the processor.
- Executing a STREX instruction, or an exception, removes the local exclusive access tags for the processor.
- Executing a STREX instruction to a shared memory region can also remove the global exclusive access tags for the processor in the system.

For more information about the synchronization primitive instructions, see LDREX and STREX on page 83 and CLREX on page 84.

### 2.3.7 Programming hints for the synchronization primitives

ISO/IEC C cannot directly generate the exclusive access instructions. CMSIS provides intrinsic functions for generation of these instructions:

<table>
<thead>
<tr>
<th>Instruction</th>
<th>CMSIS function</th>
</tr>
</thead>
<tbody>
<tr>
<td>LDREX</td>
<td>uint32_t __LDREXW (uint32_t *addr)</td>
</tr>
<tr>
<td>LDREXH</td>
<td>uint16_t __LDREXH (uint16_t *addr)</td>
</tr>
<tr>
<td>LDREXB</td>
<td>uint8_t __LDREXB (uint8_t *addr)</td>
</tr>
<tr>
<td>STREX</td>
<td>uint32_t __STREXW (uint32_t value, uint32_t *addr)</td>
</tr>
<tr>
<td>STREXH</td>
<td>uint32_t __STREXH (uint16_t value, uint16_t *addr)</td>
</tr>
<tr>
<td>STREXB</td>
<td>uint32_t __STREXB (uint8_t value, uint8_t *addr)</td>
</tr>
<tr>
<td>CLREX</td>
<td>void __CLREX (void)</td>
</tr>
</tbody>
</table>
For example:

```c
uint16_t  value;
uint16_t *address = 0x20001002;
value = __LDREXH (address);    // load 16-bit value from memory address 0x20001002
```

### 2.4 Exception model

This section describes the exception model. It describes:

- Exception states.
- Exception types.
- Exception handlers on page 41.
- Vector table on page 42.
- Exception priorities on page 43.
- Interrupt priority grouping on page 43.
- Exception entry and return on page 44.

#### 2.4.1 Exception states

Each exception is in one of the following states:

- **Inactive**
  - The exception is not active and not pending.

- **Pending**
  - The exception is waiting to be serviced by the processor.
  - An interrupt request from a peripheral or from software can change the state of the corresponding interrupt to pending.

- **Active**
  - An exception that is being serviced by the processor but has not completed.
  - **Note:** An exception handler can interrupt the execution of another exception handler. In this case both exceptions are in the active state.

- **Active and pending**
  - The exception is being serviced by the processor and there is a pending exception from the same source.

#### 2.4.2 Exception types

The exception types are:

- **Reset**
  - Reset is invoked on power up or a warm reset. The exception model treats reset as a special form of exception. When reset is asserted, the operation of the processor stops, potentially at any point in an instruction. When reset is deasserted, execution restarts from the address provided by the reset entry in the vector table. Execution restarts as privileged execution in Thread mode.
NMI

A NonMaskable Interrupt (NMI) can be signalled by a peripheral or triggered by software. This is the highest priority exception other than reset. It is permanently enabled and has a fixed priority of -2. NMIs cannot be:

- Masked or prevented from activation by any other exception.
- Preempted by any exception other than Reset.

HardFault

A HardFault is an exception that occurs because of an error during normal or exception processing. HardFaults have a fixed priority of -1, meaning they have higher priority than any exception with configurable priority.

SVCall

A Supervisor Call (SVC) is an exception that is triggered by the SVC instruction. In an OS environment, applications can use SVC instructions to access OS kernel functions and device drivers.

PendSV

PendSV is an interrupt-driven request for system-level service. In an OS environment, use PendSV for context switching when no other exception is active.

SysTick

A SysTick exception is an exception the system timer generates when it reaches zero. Software can also generate a SysTick exception. In an OS environment, the processor can use this exception as system tick.

Interrupt (IRQ)

An interrupt, or IRQ, is an exception signalled by a peripheral, or generated by a software request. All interrupts are asynchronous to instruction execution. In the system, peripherals use interrupts to communicate with the processor.

Table 19. Properties of the different exception types

<table>
<thead>
<tr>
<th>Exception number(1)</th>
<th>IRQ number(1)</th>
<th>Exception type</th>
<th>Priority</th>
<th>Vector address(2)</th>
<th>Activation</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>-</td>
<td>Reset</td>
<td>-3, the highest</td>
<td>0x00000004</td>
<td>Asynchronous</td>
</tr>
<tr>
<td>2</td>
<td>-14</td>
<td>NMI</td>
<td>-2</td>
<td>0x00000008</td>
<td>Asynchronous</td>
</tr>
<tr>
<td>3</td>
<td>-13</td>
<td>HardFault</td>
<td>-1</td>
<td>0x0000000C</td>
<td>Synchronous</td>
</tr>
<tr>
<td>4</td>
<td>-12</td>
<td>MemManage</td>
<td>Configurable(3)</td>
<td>0x00000010</td>
<td>Synchronous</td>
</tr>
<tr>
<td>5</td>
<td>-11</td>
<td>BusFault</td>
<td>Configurable(3)</td>
<td>0x00000014</td>
<td>Synchronous when precise, asynchronous when imprecise</td>
</tr>
<tr>
<td>6</td>
<td>-10</td>
<td>UsageFault</td>
<td>Configurable(3)</td>
<td>0x00000018</td>
<td>Synchronous</td>
</tr>
<tr>
<td>7-10</td>
<td>-</td>
<td>Reserved</td>
<td>-</td>
<td>-</td>
<td>-</td>
</tr>
<tr>
<td>11</td>
<td>-5</td>
<td>SVCall</td>
<td>Configurable(3)</td>
<td>0x0000002C</td>
<td>Synchronous</td>
</tr>
<tr>
<td>12-13</td>
<td>-</td>
<td>Reserved</td>
<td>-</td>
<td>-</td>
<td>-</td>
</tr>
<tr>
<td>14</td>
<td>-2</td>
<td>PendSV</td>
<td>Configurable(3)</td>
<td>0x00000038</td>
<td>Asynchronous</td>
</tr>
</tbody>
</table>
For an asynchronous exception, other than reset, the processor can execute additional instructions between when the exception is triggered and when the processor enters the exception handler.

Privileged software can disable the exceptions that Table 19 on page 40 shows as having configurable priority, see:

- System handler control and state register on page 204
- Interrupt clear-enable registers on page 186.

For more information about HardFaults, MemManage faults, BusFaults, and UsageFaults, see Section 2.5: Fault handling on page 47

### 2.4.3 Exception handlers

The processor handles exceptions using:

**Interrupt Service Routines (ISRs)** Interrupts IRQ0 to IRQ239 are the exceptions handled by ISRs

**Fault handler** HardFault, MemManage fault, UsageFault, and BusFault are fault exceptions handled by the fault handler.

**System handlers** NMI, PendSV, SVC, and SysTick, and the fault exceptions are all system exceptions handled by system handlers.
2.4.4 Vector table

The vector table contains the reset value of the stack pointer, and the start addresses, also called exception vectors, for all exception handlers. Figure 10 on page 42 shows the order of the exception vectors in the vector table. The least-significant bit of each vector must be 1, indicating that the exception handler is Thumb code, see Thumb state on page 25.

![Vector Table](image)

On system reset, the vector table is at address 0x00000000. Privileged software can write to the VTOR to relocate the vector table start address to a different memory location, in the range 0x00000000 to 0xFFFFFF80.

The silicon vendor must configure the top range value, which is dependent on the number of interrupts implemented. The minimum alignment is 32 words, enough for up to 16 interrupts. For more interrupts, adjust the alignment by rounding up to the next power of two. For example, if the user requires 21 interrupts, the alignment must be on a 64-word boundary because the required table size is 37 words, and the next power of two is 64, see Vector table offset register on page 197.

Arm recommends that the user locates the vector table in either the CODE, SRAM, External RAM, or External Device areas of the system memory map, see Cortex®-M7 configurations on page 30. Using the Peripheral, Private peripheral bus, or Vendor-specific memory areas.
can lead to unpredictable behavior in some systems. This is because the processor uses a different interfaces for load/store instructions and vector fetch in these memory areas. If the vector table is located in a region of memory that is cacheable, core must treat any load or store to the vector as self-modifying code and use cache maintenance instructions to synchronize the update to the data and instruction caches, see Cache maintenance design hints and tips on page 244.

2.4.5 Exception priorities

As Table 19 on page 40 shows, all the exceptions have an associated priority, with:

- A lower priority value indicating a higher priority.
- Configurable priorities for all the exceptions except Reset, HardFault, and NMI.

If the software does not configure any priorities, then all the exceptions with a configurable priority have a priority of 0. For information about configuring the exception priorities see

- System handler priority registers on page 202.
- Interrupt priority registers on page 188.

Note: Configurable priority values are in the range 0-255. This means that the Reset, HardFault, and NMI exceptions, with fixed negative priority values, always have higher priority than any other exception.

For example, assigning a higher priority value to IRQ[0] and a lower priority value to IRQ[1] means that IRQ[1] has higher priority than IRQ[0]. If both IRQ[1] and IRQ[0] are asserted, IRQ[1] is processed before IRQ[0].

If multiple pending exceptions have the same priority, the pending exception with the lowest exception number takes precedence. For example, if both IRQ[0] and IRQ[1] are pending and have the same priority, then IRQ[0] is processed before IRQ[1].

When the processor is executing an exception handler, the exception handler is preempted if a higher priority exception occurs. If an exception occurs with the same priority as the exception being handled, the handler is not preempted, irrespective of the exception number. However, the status of the new interrupt changes to pending.

2.4.6 Interrupt priority grouping

To increase priority control in systems with interrupts, the NVIC supports priority grouping. This divides each interrupt priority register entry into two fields:

- An upper field that defines the group priority.
- A lower field that defines a subpriority within the group.

Only the group priority determines preemption of interrupt exceptions. When the processor is executing an interrupt exception handler, another interrupt with the same group priority as the interrupt being handled does not preempt the handler.

If multiple pending interrupts have the same group priority, the subpriority field determines the order in which they are processed. If multiple pending interrupts have the same group priority and subpriority, the interrupt with the lowest IRQ number is processed first.

For information about splitting the interrupt priority fields into group priority and subpriority, see Application interrupt and reset control register on page 197.
2.4.7 Exception entry and return

Descriptions of exception handling use the following terms:

Preemption
When the processor is executing an exception handler, an exception can preempt the exception handler if its priority is higher than the priority of the exception being handled. See Interrupt priority grouping on page 43 for more information about preemption by an interrupt.

When one exception preempts another, the exceptions are called nested exceptions. See Exception entry on page 44 more information.

Return
This occurs when the exception handler is completed, and:

- There is no pending exception with sufficient priority to be serviced.
- The completed exception handler was not handling a late-arriving exception.

The processor pops the stack and restores the processor state to the state it had before the interrupt occurred. See Exception return on page 46 for more information.

Tail-chaining
This mechanism speeds up exception servicing. On completion of an exception handler, if there is a pending exception that meets the requirements for exception entry, the stack pop is skipped and control transfers to the new exception handler.

Late-arriving
This mechanism speeds up preemption. If a higher priority exception occurs during state saving for a previous exception, the processor switches to handle the higher priority exception and initiates the vector fetch for that exception. State saving is not affected by late arrival because the state saved is the same for both exceptions. Therefore the state saving continues uninterrupted. The processor can accept a late arriving exception until the first instruction of the exception handler of the original exception enters the execute stage of the processor. On return from the exception handler of the late-arriving exception, the normal tail-chaining rules apply.

Exception entry

The exception entry occurs when there is a pending exception with sufficient priority and either:

- The processor is in Thread mode.
- The new exception is of higher priority than the exception being handled, in which case the new exception preempts the original exception.

When one exception preempts another, the exceptions are nested.

Sufficient priority means the exception has more priority than any limits set by the mask registers, see Exception mask registers on page 25. An exception with less priority than this is pending but is not handled by the processor.

When the processor takes an exception, unless the exception is a tail-chained or a late-arriving exception, the processor pushes information onto the current stack. This operation is referred to as stacking and the structure of eight data words is referred as the stack frame.
When using floating-point routines, the Cortex®-M7 processor automatically stacks the architected floating-point state on exception entry. Figure 11 on page 45 shows the Cortex®-M7 stack frame layout when floating-point state is preserved on the stack as the result of an interrupt or an exception.

Note: Where stack space for floating-point state is not allocated, the stack frame is the same as that of Armv7-M implementations without an FPU. Figure 11 on page 45 shows this stack frame also.

Figure 11. Exception stack frame

Immediately after stacking, the stack pointer indicates the lowest address in the stack frame. The alignment of the stack frame is controlled using the STKALIGN bit of the Configuration Control register (CCR).

Note: In the Cortex®-M7 processor CCR.STKALIGN is read-only and has a value of 1. This means the stack address is always 8-byte aligned.

The stack frame includes the return address. This is the address of the next instruction in the interrupted program. This value is restored to the PC at exception return so that the interrupted program resumes.

In parallel to the stacking operation, the processor performs a vector fetch that reads the exception handler start address from the vector table. When stacking is complete, the
The Cortex-M7 processor starts executing the exception handler. At the same time, the processor writes an EXC_RETURN value to the LR. This indicates which stack pointer corresponds to the stack frame and what operation mode the processor was in before the entry occurred.

If no higher priority exception occurs during exception entry, the processor starts executing the exception handler and automatically changes the status of the corresponding pending interrupt to active.

If another higher priority exception occurs during exception entry, the processor starts executing the exception handler for this exception and does not change the pending status of the earlier exception. This is the late arrival case.

**Exception return**

The exception return occurs when the processor is in Handler mode and executes one of the following instructions to load the EXC_RETURN value into the PC:

- An LDM or POP instruction that loads the PC.
- An LDR instruction with PC as the destination.
- A BX instruction using any register.

EXC_RETURN is the value loaded into the LR on exception entry. The exception mechanism relies on this value to detect when the processor has completed an exception handler. The lowest five bits of this value provide information on the return stack and processor mode. *Table 20* shows the EXC_RETURN values with a description of the exception return behavior.

All EXC_RETURN values have bits[31:5] set to one. When this value is loaded into the PC it indicates to the processor that the exception is complete, and the processor initiates the appropriate exception return sequence.

<table>
<thead>
<tr>
<th>EXC_RETURN[31:0]</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0xFFFFFFFF1</td>
<td>Return to Handler mode, exception return uses non-floating-point state from the MSP and execution uses MSP after return.</td>
</tr>
<tr>
<td>0xFFFFFFFF9</td>
<td>Return to Thread mode, exception return uses non-floating-point state from MSP and execution uses MSP after return.</td>
</tr>
<tr>
<td>0xFFFFFFFD</td>
<td>Return to Thread mode, exception return uses non-floating-point state from the PSP and execution uses PSP after return.</td>
</tr>
<tr>
<td>0xFFFFFFFFE1</td>
<td>Return to Handler mode, exception return uses floating-point-state from MSP and execution uses MSP after return.</td>
</tr>
<tr>
<td>0xFFFFFFFFE9</td>
<td>Return to Thread mode, exception return uses floating-point state from MSP and execution uses MSP after return.</td>
</tr>
<tr>
<td>0xFFFFFFFFED</td>
<td>Return to Thread mode, exception return uses floating-point state from PSP and execution uses PSP after return.</td>
</tr>
</tbody>
</table>
2.5 Fault handling

Faults are a subset of the exceptions, see *Exception model on page 39*. Faults are generated by:

- A bus error on:
  - An instruction fetch or vector table load.
  - A data access.
- An internally-detected error such as an undefined instruction.
- Attempting to execute an instruction from a memory region marked as *Execute Never* (XN).
- A privilege violation or an attempt to access an unmanaged region causing an MPU fault.

2.5.1 Fault types

*Table 21* shows the types of fault, the handler used for the fault, the corresponding fault status register, and the register bit that indicates that the fault has occurred. See *Configuration and control register on page 200* for more information about the fault status registers.

<table>
<thead>
<tr>
<th>Fault</th>
<th>Handler</th>
<th>Bit name</th>
<th>Fault status register</th>
</tr>
</thead>
<tbody>
<tr>
<td>Bus error on a vector read</td>
<td>HardFault</td>
<td>VECTTBL</td>
<td>HardFault status register on page 210</td>
</tr>
<tr>
<td>Fault escalated to a hard fault</td>
<td></td>
<td>FORCED</td>
<td></td>
</tr>
<tr>
<td>MPU or default memory map mismatch:</td>
<td>MemManage</td>
<td>-</td>
<td></td>
</tr>
<tr>
<td>On instruction access</td>
<td></td>
<td>IACCVIOL (1)</td>
<td>MemManage fault status register on page 206</td>
</tr>
<tr>
<td>On data access</td>
<td>MemManage</td>
<td>DACCVIOL</td>
<td></td>
</tr>
<tr>
<td>During exception stacking</td>
<td>MemManage</td>
<td>MSTKERR</td>
<td></td>
</tr>
<tr>
<td>During exception unstacking</td>
<td>MemManage</td>
<td>MUNSKERR</td>
<td></td>
</tr>
<tr>
<td>During lazy floating-point state preservation</td>
<td>MemManage</td>
<td>MLSPERR</td>
<td></td>
</tr>
<tr>
<td>Bus error:</td>
<td></td>
<td>-</td>
<td></td>
</tr>
<tr>
<td>During exception stacking</td>
<td>BusFault</td>
<td>STKERR</td>
<td>BusFault status register on page 207</td>
</tr>
<tr>
<td>During exception unstacking</td>
<td>BusFault</td>
<td>UNSTKERR</td>
<td></td>
</tr>
<tr>
<td>During instruction prefetch</td>
<td>BusFault</td>
<td>IBUSERR</td>
<td></td>
</tr>
<tr>
<td>During lazy floating-point state preservation</td>
<td>BusFault</td>
<td>LSPERR</td>
<td></td>
</tr>
<tr>
<td>Precise data bus error</td>
<td></td>
<td>PRECISERR</td>
<td></td>
</tr>
<tr>
<td>Imprecise data bus error</td>
<td></td>
<td>IMPRECISERR</td>
<td></td>
</tr>
</tbody>
</table>
2.5.2 Fault escalation and hard faults

All the fault exceptions except for HardFault have configurable exception priority, see System handler priority registers on page 202. The software can disable the execution of the handlers for these faults, see System handler control and state register on page 204.

Usually, the exception priority, together with the values of the exception mask registers, determines whether the processor enters the fault handler, and whether a fault handler can preempt another fault handler, as described in Exception model on page 39.

In some situations, a fault with configurable priority is treated as a HardFault. This is called priority escalation, and the fault is described as escalated to HardFault. Escalation to HardFault occurs when:

- A fault handler causes the same kind of fault as the one it is servicing. This escalation to HardFault occurs because a fault handler cannot preempt itself because it must have the same priority as the current priority level.
- A fault handler causes a fault with the same or lower priority as the fault it is servicing. This is because the handler for the new fault cannot preempt the currently executing fault handler.
- An exception handler causes a fault for which the priority is the same as or lower than the currently executing exception.
- A fault occurs and the handler for that fault is not enabled.

If a BusFault occurs during a stack push when entering a BusFault handler, the BusFault does not escalate to a HardFault. This means that if a corrupted stack causes a fault, the fault handler executes even though the stack push for the handler failed. The fault handler operates but the stack contents are corrupted.

Note: Only Reset and NMI can preempt the fixed priority HardFault. A HardFault can preempt any exception other than Reset, NMI, or another HardFault.

<table>
<thead>
<tr>
<th>Fault</th>
<th>Handler</th>
<th>Bit name</th>
<th>Fault status register</th>
</tr>
</thead>
<tbody>
<tr>
<td>Attempt to access a coprocessor</td>
<td>UsageFault</td>
<td>NOCP</td>
<td>UsageFault status register on page 209</td>
</tr>
<tr>
<td>Undefined instruction</td>
<td></td>
<td>UNDEFINSTR</td>
<td></td>
</tr>
<tr>
<td>Attempt to enter an invalid instruction set state(^{(2)})</td>
<td></td>
<td>INVSTATE</td>
<td></td>
</tr>
<tr>
<td>Invalid EXC_RETURN value</td>
<td></td>
<td>INVPC</td>
<td></td>
</tr>
<tr>
<td>Illegal unaligned load or store</td>
<td></td>
<td>UNALIGNED</td>
<td></td>
</tr>
<tr>
<td>Divide By 0</td>
<td></td>
<td>DIVBYZERO</td>
<td></td>
</tr>
</tbody>
</table>

1. Occurs on an access to an XN region even if the processor does not include an MPU or the MPU is disabled.
2. Attempting to use an instruction set other than the Thumb instruction set or returns to a non load/store-multiple instruction with ICI continuation.

Table 21. Faults (continued)
2.5.3 Synchronous and Asynchronous bus faults

In the Cortex®-M7 processor all bus faults triggered by:

- Processor load operations are synchronous.
- Processor store operations are asynchronous, including stores to Device and Strongly-ordered regions.
- Debugger load or store accesses are synchronous, and are visible to the debugger interface only.

When an asynchronous bus fault is triggered, the BusFault exception is pended. If the BusFault handler is not enabled, the HardFault exception is pended instead. The HardFault caused by the asynchronous BusFault never escalates into lockup.

If an IRQ is triggered after the write, the write buffer might not drain before the ISR is executed. Therefore an asynchronous BusFault can occur across context boundaries.

A synchronous BusFault can escalate into lockup if it occurs inside a NMI or HardFault handler.

Cache maintenance operations can also trigger a bus fault. See Faults handling considerations on page 244 for more information.

2.5.4 Fault status registers and fault address registers

The fault status registers indicate the cause of a fault. For synchronous BusFaults and MemManage faults, the fault address register indicates the address accessed by the operation that caused the fault, as shown in Table 22.

<table>
<thead>
<tr>
<th>Handler</th>
<th>Status register name</th>
<th>Address register name</th>
<th>Register description</th>
</tr>
</thead>
<tbody>
<tr>
<td>HardFault</td>
<td>HFSR</td>
<td>-</td>
<td>HardFault status register on page 210</td>
</tr>
<tr>
<td>MemManage</td>
<td>MMFSR</td>
<td>MMFAR</td>
<td>MemManage fault status register on page 206</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>MemManage fault address register on page 211</td>
</tr>
<tr>
<td>BusFault</td>
<td>BFSR</td>
<td>BFAR</td>
<td>BusFault status register on page 207</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>BusFault address register on page 212</td>
</tr>
<tr>
<td>UsageFault</td>
<td>UFSR</td>
<td>-</td>
<td>UsageFault status register on page 209</td>
</tr>
</tbody>
</table>

2.5.5 Lockup

The processor enters a lockup state if a fault occurs when executing the NMI or HardFault handlers. When the processor is in lockup state it does not execute any instructions. The processor remains in lockup state until either:

- It is reset.
- An NMI occurs.
- It is halted by a debugger.

Note: If a lockup state occurs from the NMI handler a subsequent NMI does not cause the processor to leave lockup state.
2.6 Power management

The Cortex®-M7 processor sleep modes reduce power consumption:
- Sleep mode stops the processor clock.
- Deep sleep mode stops the system clock and switches off the PLL and the Flash memory.

The SLEEPDEEP bit of the SCR selects which sleep mode is used, see System control register on page 199.

For more information about the behavior of the sleep modes see specific STM32 product referene manual.

This section describes the mechanisms for entering sleep mode, and the conditions for waking up from sleep mode.

2.6.1 Entering sleep mode

This section describes the mechanisms software can use to put the processor into sleep mode.

The system can generate spurious wakeup events, for example a debug operation wakes up the processor. Therefore software must be able to put the processor back into sleep mode after such an event. A program might have an idle loop to put the processor back to sleep mode.

Wait for interrupt

The wait for interrupt instruction, WFI, causes immediate entry to sleep mode unless the wakeup condition is true, see Wakeup from WFI or sleep-on-exit on page 51. When the processor executes a WFI instruction it stops executing instructions and enters sleep mode. See WFI on page 182 for more information.

Wait for event

The wait for event instruction, WFE, causes entry to sleep mode depending on the value of a one-bit event register. When the processor executes a WFE instruction, it checks the value of the event register:
- 0: The processor stops executing instructions and enters sleep mode.
- 1: The processor clears the register to 0 and continues executing instructions without entering sleep mode.

See WFE on page 181 for more information.

If the event register is 1, this indicates that the processor must not enter sleep mode on execution of a WFE instruction. Typically, this is because an external event signal is asserted, or a processor in the system has executed an SEV instruction, see SEV on page 180. Software cannot access this register directly.

Sleep-on-exit

If the SLEEPONEXIT bit of the SCR is set to 1, when the processor completes the execution of all exception handlers it returns to Thread mode and immediately enters sleep mode. Use this mechanism in applications that only require the processor to run when an exception occurs.
2.6.2 Wakeup from sleep mode

The conditions for the processor to wakeup depend on the mechanism that cause it to enter sleep mode.

Wakeup from WFI or sleep-on-exit

Normally, the processor wakes up only when it detects an exception with sufficient priority to cause exception entry. Some embedded systems might have to execute system restore tasks after the processor wakes up, and before it executes an interrupt handler. To achieve this set the PRIMASK bit to 1 and the FAULTMASK bit to 0. If an interrupt arrives that is enabled and has a higher priority than the current exception priority, the processor wakes up but does not execute the interrupt handler until the processor sets PRIMASK to zero. For more information about PRIMASK and FAULTMASK see Exception mask registers on page 25.

Wakeup from WFE

The processor wakes up if:
- It detects an exception with sufficient priority to cause exception entry.
- It detects an external event signal, see The external event input.
- In a multiprocessor system, another processor in the system executes an SEV instruction.

In addition, if the SEVONPEND bit in the SCR is set to 1, any new pending interrupt triggers an event and wakes up the processor, even if the interrupt is disabled or has insufficient priority to cause exception entry. For more information about the SCR see System control register on page 199.

2.6.3 The external event input

The processor provides an external event input signal. Peripherals can drive this signal, either to wake the processor from WFE, or to set the internal WFE event register to one to indicate that the processor must not enter sleep mode on a later WFE instruction. See Wait for event on page 50 for more information.

2.6.4 Power management programming hints

ISO/IEC C cannot directly generate the WFI and WFE instructions. The CMSIS provides the following functions for these instructions:

```c
void __WFE(void) // Wait for Event
void __WFI(void) // Wait for Interrupt
```
3 The Cortex-M7 instruction set

3.1 Instruction set summary

The processor implements Armv7-M instruction set and features provided by the Armv7E-M architecture profile. Table 23 lists the supported instructions.

In Table 23:
- Angle brackets, <>, enclose alternative forms of the operand.
- Braces, {}, enclose optional operands.
- The Operands column is not exhaustive.
- Op2 is a flexible second operand that can be either a register or a constant.
- Most instructions can use an optional condition code suffix.

For more information on the instructions and operands, see the instruction descriptions.

### Table 23. Cortex®-M7 instructions

<table>
<thead>
<tr>
<th>Mnemonic</th>
<th>Operands</th>
<th>Brief description</th>
<th>Flags</th>
<th>Page</th>
</tr>
</thead>
<tbody>
<tr>
<td>ADC, ADCS</td>
<td>(Rd,) Rn, Op2</td>
<td>Add with Carry</td>
<td>N,Z,C,V</td>
<td>3.5.1 on page 87</td>
</tr>
<tr>
<td>ADD, ADDS</td>
<td>(Rd,) Rn, Op2</td>
<td>Add</td>
<td>N,Z,C,V</td>
<td>3.5.1 on page 87</td>
</tr>
<tr>
<td>ADD, ADDW</td>
<td>(Rd,) Rn, #imm12</td>
<td>Add</td>
<td>-</td>
<td>3.5.1 on page 87</td>
</tr>
<tr>
<td>ADR</td>
<td>Rd, label</td>
<td>Load PC-relative Address</td>
<td>-</td>
<td>3.4.1 on page 73</td>
</tr>
<tr>
<td>AND, ANDS</td>
<td>(Rd,) Rn, Op2</td>
<td>Logical AND</td>
<td>N,Z,C</td>
<td>3.5.2 on page 89</td>
</tr>
<tr>
<td>ASR, ASRS</td>
<td>Rd, Rm, &lt;Rs</td>
<td>#n&gt;</td>
<td>Arithmetic Shift Right</td>
<td>N,Z,C</td>
</tr>
<tr>
<td>B</td>
<td>label</td>
<td>Branch</td>
<td>-</td>
<td>3.10.1 on page 145</td>
</tr>
<tr>
<td>BFC</td>
<td>Rd, #lsb, #width</td>
<td>Bit Field Clear</td>
<td>-</td>
<td>3.9.1 on page 142</td>
</tr>
<tr>
<td>BFI</td>
<td>Rd, Rn, #lsb, #width</td>
<td>Bit Field Insert</td>
<td>-</td>
<td>3.9.1 on page 142</td>
</tr>
<tr>
<td>BIC, BICS</td>
<td>(Rd,) Rn, Op2</td>
<td>Bit Clear</td>
<td>N,Z,C</td>
<td>3.5.2 on page 89</td>
</tr>
<tr>
<td>BKPT</td>
<td>#imm</td>
<td>Breakpoint</td>
<td>-</td>
<td>3.12.1 on page 175</td>
</tr>
<tr>
<td>BL</td>
<td>label</td>
<td>Branch with Link</td>
<td>-</td>
<td>3.10.1 on page 145</td>
</tr>
<tr>
<td>BLX</td>
<td>Rm</td>
<td>Branch indirect with Link</td>
<td>-</td>
<td>3.10.1 on page 145</td>
</tr>
<tr>
<td>BX</td>
<td>Rm</td>
<td>Branch indirect</td>
<td>-</td>
<td>3.10.1 on page 145</td>
</tr>
<tr>
<td>CBNZ</td>
<td>Rn, label</td>
<td>Compare and Branch if Non Zero</td>
<td>-</td>
<td>3.10.2 on page 147</td>
</tr>
<tr>
<td>CBZ</td>
<td>Rn, label</td>
<td>Compare and Branch if Zero</td>
<td>-</td>
<td>3.10.2 on page 147</td>
</tr>
<tr>
<td>CLREX</td>
<td>-</td>
<td>Clear Exclusive</td>
<td>-</td>
<td>3.4.10 on page 84</td>
</tr>
<tr>
<td>CLZ</td>
<td>Rd, Rm</td>
<td>Count Leading Zeros</td>
<td>-</td>
<td>3.5.4 on page 91</td>
</tr>
<tr>
<td>CMN</td>
<td>Rn, Op2</td>
<td>Compare Negative</td>
<td>N,Z,C,V</td>
<td>3.5.5 on page 92</td>
</tr>
<tr>
<td>CMP</td>
<td>Rn, Op2</td>
<td>Compare</td>
<td>N,Z,C,V</td>
<td>3.5.5 on page 92</td>
</tr>
<tr>
<td>Mnemonic</td>
<td>Operands</td>
<td>Brief description</td>
<td>Flags</td>
<td>Page</td>
</tr>
<tr>
<td>----------</td>
<td>----------</td>
<td>------------------</td>
<td>-------</td>
<td>------</td>
</tr>
<tr>
<td>CPSID</td>
<td>i</td>
<td>Change Processor State, Disable Interrupts</td>
<td>-</td>
<td>3.12.2 on page 176</td>
</tr>
<tr>
<td>CPSIE</td>
<td>i</td>
<td>Change Processor State, Enable Interrupts</td>
<td>-</td>
<td>3.12.2 on page 176</td>
</tr>
<tr>
<td>DMB</td>
<td>-</td>
<td>Data Memory Barrier</td>
<td>-</td>
<td>3.12.3 on page 177</td>
</tr>
<tr>
<td>DSB</td>
<td>-</td>
<td>Data Synchronization Barrier</td>
<td>-</td>
<td>3.12.4 on page 177</td>
</tr>
<tr>
<td>EOR, EORS</td>
<td>(Rd,) Rn, Op2</td>
<td>Exclusive OR</td>
<td>N,Z,C</td>
<td>3.5.2 on page 89</td>
</tr>
<tr>
<td>ISB</td>
<td>-</td>
<td>Instruction Synchronization Barrier</td>
<td>-</td>
<td>3.12.5 on page 178</td>
</tr>
<tr>
<td>IT</td>
<td>-</td>
<td>If-Then condition block</td>
<td>-</td>
<td>3.10.3 on page 148</td>
</tr>
<tr>
<td>LDM</td>
<td>Rn(!), reglist</td>
<td>Load Multiple registers, increment after</td>
<td>-</td>
<td>3.4.6 on page 79</td>
</tr>
<tr>
<td>LDMDB, LDMEA</td>
<td>Rn(!), reglist</td>
<td>Load Multiple registers, decrement before</td>
<td>-</td>
<td>3.4.6 on page 79</td>
</tr>
<tr>
<td>LDMFD, LDMIA</td>
<td>Rn(!), reglist</td>
<td>Load Multiple registers, increment after</td>
<td>-</td>
<td>3.4.6 on page 79</td>
</tr>
<tr>
<td>LDR</td>
<td>Rt, [Rn, #offset]</td>
<td>Load register with word</td>
<td>-</td>
<td>3.4.2 on page 73</td>
</tr>
<tr>
<td>LDRB, LDRBT</td>
<td>Rt, [Rn, #offset]</td>
<td>Load register with byte</td>
<td>-</td>
<td>3.4.2 on page 73</td>
</tr>
<tr>
<td>LDRD</td>
<td>Rt, Rt2, [Rn, #offset]</td>
<td>Load register with two bytes</td>
<td>-</td>
<td>3.4.4 on page 77</td>
</tr>
<tr>
<td>LDREX</td>
<td>Rt, [Rn, #offset]</td>
<td>Load register Exclusive</td>
<td>-</td>
<td>3.4.9 on page 83</td>
</tr>
<tr>
<td>LDREXB</td>
<td>Rt, [Rn]</td>
<td>Load register Exclusive with Byte</td>
<td>-</td>
<td>3.4.9 on page 83</td>
</tr>
<tr>
<td>LDREXH</td>
<td>Rt, [Rn]</td>
<td>Load register Exclusive with Halfword</td>
<td>-</td>
<td>3.4.9 on page 83</td>
</tr>
<tr>
<td>LDRH, LDRHT</td>
<td>Rt, [Rn, #offset]</td>
<td>Load register with Halfword</td>
<td>-</td>
<td>3.4.2 on page 73</td>
</tr>
<tr>
<td>LDRSB, LDRSBT</td>
<td>Rt, [Rn, #offset]</td>
<td>Load register with Signed Byte</td>
<td>-</td>
<td>3.4.2 on page 73</td>
</tr>
<tr>
<td>LDRSH, LDRSHT</td>
<td>Rt, [Rn, #offset]</td>
<td>Load register with Signed Halfword</td>
<td>-</td>
<td>3.4.2 on page 73</td>
</tr>
<tr>
<td>LDRT</td>
<td>Rt, [Rn, #offset]</td>
<td>Load register with word</td>
<td>-</td>
<td>3.4.2 on page 73</td>
</tr>
<tr>
<td>LSL, LLSLS</td>
<td>Rd, Rm, &lt;Rs</td>
<td>#n&gt;</td>
<td>Logical Shift Left</td>
<td>N,Z,C</td>
</tr>
<tr>
<td>LSR, LSRS</td>
<td>Rd, Rm, &lt;Rs</td>
<td>#n&gt;</td>
<td>Logical Shift Right</td>
<td>N,Z,C</td>
</tr>
<tr>
<td>MLA</td>
<td>Rd, Rn, Rm, Ra</td>
<td>Multiply with Accumulate, 32-bit result</td>
<td>-</td>
<td>3.6.1 on page 112</td>
</tr>
<tr>
<td>MLS</td>
<td>Rd, Rn, Rm, Ra</td>
<td>Multiply and Subtract, 32-bit result</td>
<td>-</td>
<td>3.6.1 on page 112</td>
</tr>
<tr>
<td>MOV, MOVS</td>
<td>Rd, Op2</td>
<td>Move</td>
<td>N,Z,C</td>
<td>3.5.6 on page 93</td>
</tr>
<tr>
<td>MOVT</td>
<td>Rd, #imm16</td>
<td>Move Top</td>
<td>-</td>
<td>3.5.7 on page 94</td>
</tr>
<tr>
<td>Mnemonic</td>
<td>Operands</td>
<td>Brief description</td>
<td>Flags</td>
<td>Page</td>
</tr>
<tr>
<td>---------------</td>
<td>----------</td>
<td>------------------------------------</td>
<td>-------</td>
<td>------------</td>
</tr>
<tr>
<td>MOVW, MOV</td>
<td>Rd, #imm16</td>
<td>Move 16-bit constant</td>
<td>N,Z,C</td>
<td>3.5.6 on page 93</td>
</tr>
<tr>
<td>MRS</td>
<td>Rd, spec_reg</td>
<td>Move from Special register to general register</td>
<td>-</td>
<td>3.12.6 on page 178</td>
</tr>
<tr>
<td>MSR</td>
<td>spec_reg, Rm</td>
<td>Move from general register to Special register</td>
<td>N,Z,C,V</td>
<td>3.12.7 on page 179</td>
</tr>
<tr>
<td>MUL, MULS</td>
<td>(Rd,) Rn, Rm</td>
<td>Multiply, 32-bit result</td>
<td>N,Z</td>
<td>3.6.1 on page 112</td>
</tr>
<tr>
<td>MVN, MVNS</td>
<td>Rd, Op2</td>
<td>Move NOT</td>
<td>N,Z,C</td>
<td>3.5.6 on page 93</td>
</tr>
<tr>
<td>NOP</td>
<td>-</td>
<td>No Operation</td>
<td>-</td>
<td>3.12.8 on page 180</td>
</tr>
<tr>
<td>ORN, ORNS</td>
<td>(Rd,) Rn, Op2</td>
<td>Logical OR NOT</td>
<td>N,Z,C</td>
<td>3.5.2 on page 89</td>
</tr>
<tr>
<td>ORR, ORRS</td>
<td>(Rd,) Rn, Op2</td>
<td>Logical OR</td>
<td>N,Z,C</td>
<td>3.5.2 on page 89</td>
</tr>
<tr>
<td>PKHTB, PKHBT</td>
<td>(Rd,) Rn, Rm, Op2</td>
<td>Pack Halfword</td>
<td>-</td>
<td>3.8.1 on page 138</td>
</tr>
<tr>
<td>PLD</td>
<td>[Rn, #offset]</td>
<td>Preload Data</td>
<td>-</td>
<td>3.4.7 on page 81</td>
</tr>
<tr>
<td>POP</td>
<td>reglist</td>
<td>Pop registers from stack</td>
<td>-</td>
<td>3.4.8 on page 82</td>
</tr>
<tr>
<td>PUSH</td>
<td>reglist</td>
<td>Push registers onto stack</td>
<td>-</td>
<td>3.4.8 on page 82</td>
</tr>
<tr>
<td>QADD</td>
<td>(Rd,) Rn, Rm</td>
<td>Saturating double and Add</td>
<td>Q</td>
<td>3.7.3 on page 131</td>
</tr>
<tr>
<td>QADD16</td>
<td>(Rd,) Rn, Rm</td>
<td>Saturating Add 16</td>
<td>-</td>
<td>3.7.3 on page 131</td>
</tr>
<tr>
<td>QADD8</td>
<td>(Rd,) Rn, Rm</td>
<td>Saturating Add 8</td>
<td>-</td>
<td>3.7.3 on page 131</td>
</tr>
<tr>
<td>QASX</td>
<td>(Rd,) Rn, Rm</td>
<td>Saturating Add and Subtract with Exchange</td>
<td>-</td>
<td>3.7.4 on page 132</td>
</tr>
<tr>
<td>QDADD</td>
<td>(Rd,) Rn, Rm</td>
<td>Saturating Add</td>
<td>Q</td>
<td>3.7.5 on page 133</td>
</tr>
<tr>
<td>QDSUB</td>
<td>(Rd,) Rn, Rm</td>
<td>Saturating double and Subtract</td>
<td>Q</td>
<td>3.7.3 on page 131</td>
</tr>
<tr>
<td>QSAX</td>
<td>(Rd,) Rn, Rm</td>
<td>Saturating Subtract and Add with Exchange</td>
<td>-</td>
<td>3.7.4 on page 132</td>
</tr>
<tr>
<td>QSUB</td>
<td>(Rd,) Rn, Rm</td>
<td>Saturating Subtract</td>
<td>Q</td>
<td>3.7.3 on page 131</td>
</tr>
<tr>
<td>QSUB16</td>
<td>(Rd,) Rn, Rm</td>
<td>Saturating Subtract 16</td>
<td>-</td>
<td>3.7.3 on page 131</td>
</tr>
<tr>
<td>QSUB8</td>
<td>(Rd,) Rn, Rm</td>
<td>Saturating Subtract 8</td>
<td>-</td>
<td>3.7.3 on page 131</td>
</tr>
<tr>
<td>RBIT</td>
<td>Rd, Rn</td>
<td>Reverse Bits</td>
<td>-</td>
<td>3.5.8 on page 95</td>
</tr>
<tr>
<td>REV</td>
<td>Rd, Rn</td>
<td>Reverse byte order in a word</td>
<td>-</td>
<td>3.5.8 on page 95</td>
</tr>
<tr>
<td>REV16</td>
<td>Rd, Rn</td>
<td>Reverse byte order in each halfword</td>
<td>-</td>
<td>3.5.8 on page 95</td>
</tr>
<tr>
<td>REVSH</td>
<td>Rd, Rn</td>
<td>Reverse byte order in bottom halfword and sign extend</td>
<td>-</td>
<td>3.5.8 on page 95</td>
</tr>
<tr>
<td>ROR, RORS</td>
<td>Rd, Rm, &lt;Rs</td>
<td>#n&gt;</td>
<td>Rotate Right</td>
<td>N,Z,C</td>
</tr>
<tr>
<td>RRX, RRXS</td>
<td>Rd, Rm</td>
<td>Rotate Right with Extend</td>
<td>N,Z,C</td>
<td>3.5.3 on page 90</td>
</tr>
<tr>
<td>RSB, RSBS</td>
<td>(Rd,) Rn, Op2</td>
<td>Reverse Subtract</td>
<td>N,Z,C,V</td>
<td>3.5.1 on page 87</td>
</tr>
</tbody>
</table>
### Table 23. Cortex®-M7 instructions (continued)

<table>
<thead>
<tr>
<th>Mnemonic</th>
<th>Operands</th>
<th>Brief description</th>
<th>Flags</th>
<th>Page</th>
</tr>
</thead>
<tbody>
<tr>
<td>SADD16</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed Add 16</td>
<td>GE</td>
<td>3.5.9 on page 96</td>
</tr>
<tr>
<td>SADD8</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed Add 8</td>
<td>GE</td>
<td>3.5.9 on page 96</td>
</tr>
<tr>
<td>SASX</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed Add and Subtract with Exchange</td>
<td>GE</td>
<td>3.5.14 on page 101</td>
</tr>
<tr>
<td>SBC, SBCS</td>
<td>(Rd,) Rn, Op2</td>
<td>Subtract with Carry</td>
<td>N,Z,C,V</td>
<td>3.5.1 on page 87</td>
</tr>
<tr>
<td>SBFX</td>
<td>Rd, Rn, #lsb, #width</td>
<td>Signed Bit Field Extract</td>
<td>-</td>
<td>3.9.2 on page 143</td>
</tr>
<tr>
<td>SDIV</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed Divide</td>
<td>-</td>
<td>3.6.12 on page 127</td>
</tr>
<tr>
<td>SEL</td>
<td>(Rd,) Rn, Rm</td>
<td>Select bytes</td>
<td>-</td>
<td>3.5.21 on page 108</td>
</tr>
<tr>
<td>SEV</td>
<td>-</td>
<td>Send Event</td>
<td>-</td>
<td>3.12.9 on page 180</td>
</tr>
<tr>
<td>SHADD16</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed Halving Add 16</td>
<td>-</td>
<td>3.5.10 on page 97</td>
</tr>
<tr>
<td>SHADD8</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed Halving Add 8</td>
<td>-</td>
<td>3.5.10 on page 97</td>
</tr>
<tr>
<td>SHASX</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed Halving Add and Subtract with Exchange</td>
<td>-</td>
<td>3.5.11 on page 98</td>
</tr>
<tr>
<td>SHSUB16</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed Halving Subtract 16</td>
<td>-</td>
<td>3.5.12 on page 99</td>
</tr>
<tr>
<td>SHSUB8</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed Halving Subtract 8</td>
<td>-</td>
<td>3.5.12 on page 99</td>
</tr>
<tr>
<td>SMLABB, SMLABT, SMLATB, SMLATT</td>
<td>Rd, Rn, Rm, Ra</td>
<td>Signed Multiply Accumulate Long (halfwords)</td>
<td>Q</td>
<td>3.6.3 on page 115</td>
</tr>
<tr>
<td>SMLAD, SMLADX</td>
<td>Rd, Rn, Rm, Ra</td>
<td>Signed Multiply Accumulate Dual</td>
<td>Q</td>
<td>3.6.4 on page 116</td>
</tr>
<tr>
<td>SMLAL</td>
<td>RdLo, RdHi, Rn, Rm</td>
<td>Signed Multiply with Accumulate (32 x 32 + 64), 64-bit result</td>
<td>-</td>
<td>3.6.5 on page 117</td>
</tr>
<tr>
<td>SMLALBB, SMLALBT, SMLALTB, SMLALTT</td>
<td>RdLo, RdHi, Rn, Rm</td>
<td>Signed Multiply Accumulate Long, halfwords</td>
<td>-</td>
<td>3.6.5 on page 117</td>
</tr>
<tr>
<td>SMLALD, SMLALDX</td>
<td>RdLo, RdHi, Rn, Rm</td>
<td>Signed Multiply Accumulate Long Dual</td>
<td>-</td>
<td>3.6.5 on page 117</td>
</tr>
<tr>
<td>SMLAWB, SMLAWT</td>
<td>Rd, Rn, Rm, Ra</td>
<td>Signed Multiply Accumulate, word by halfword</td>
<td>Q</td>
<td>3.6.3 on page 115</td>
</tr>
<tr>
<td>SMLSD</td>
<td>Rd, Rn, Rm, Ra</td>
<td>Signed Multiply Subtract Dual</td>
<td>Q</td>
<td>3.6.6 on page 119</td>
</tr>
<tr>
<td>SMLSLD</td>
<td>RdLo, RdHi, Rn, Rm</td>
<td>Signed Multiply Subtract Long Dual</td>
<td>-</td>
<td>3.6.6 on page 119</td>
</tr>
<tr>
<td>SMMLA</td>
<td>Rd, Rn, Rm, Ra</td>
<td>Signed Most significant word Multiply Accumulate</td>
<td>-</td>
<td>3.6.7 on page 121</td>
</tr>
</tbody>
</table>
### Table 23. Cortex®-M7 instructions (continued)

<table>
<thead>
<tr>
<th>Mnemonic</th>
<th>Operands</th>
<th>Brief description</th>
<th>Flags</th>
<th>Page</th>
</tr>
</thead>
<tbody>
<tr>
<td>SMMLS, SMMLR</td>
<td>Rd, Rn, Rm, Ra</td>
<td>Signed Most significant word Multiply Subtract</td>
<td>-</td>
<td>3.6.7 on page 121</td>
</tr>
<tr>
<td>SMMUL, SMMULR</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed Most significant word Multiply</td>
<td>-</td>
<td>3.6.8 on page 122</td>
</tr>
<tr>
<td>SMUAD</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed dual Multiply Add</td>
<td>Q</td>
<td>3.6.9 on page 123</td>
</tr>
<tr>
<td>SMULBB, SMULBT</td>
<td>Rd, Rn, Rm</td>
<td>Signed Multiply (halfwords)</td>
<td>-</td>
<td>3.6.10 on page 124</td>
</tr>
<tr>
<td>SMULL</td>
<td>RdLo, RdHi, Rn, Rm</td>
<td>Signed Multiply (32 x 32), 64-bit result</td>
<td></td>
<td>3.6.11 on page 126</td>
</tr>
<tr>
<td>SMULWB, SMULWT</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed Multiply word by halfword</td>
<td>-</td>
<td>3.6.10 on page 124</td>
</tr>
<tr>
<td>SMUSD, SMUSDX</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed dual Multiply Subtract</td>
<td>-</td>
<td>3.6.9 on page 123</td>
</tr>
<tr>
<td>SSAT</td>
<td>Rd, #n, Rm (,shift #s)</td>
<td>Signed Saturate Q</td>
<td>Q</td>
<td>3.7.1 on page 129</td>
</tr>
<tr>
<td>SSAT16</td>
<td>Rd, #n, Rm</td>
<td>Signed Saturate 16 Q</td>
<td>Q</td>
<td>3.7.2 on page 130</td>
</tr>
<tr>
<td>SSAX</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed Subtract and Add with Exchange</td>
<td>GE</td>
<td>3.5.14 on page 110</td>
</tr>
<tr>
<td>SSUB16</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed Subtract 16 Q</td>
<td></td>
<td>3.5.13 on page 100</td>
</tr>
<tr>
<td>SSUB8</td>
<td>(Rd,) Rn, Rm</td>
<td>Signed Subtract 8 Q</td>
<td></td>
<td>3.5.13 on page 100</td>
</tr>
<tr>
<td>STM</td>
<td>Rn(!), reglist</td>
<td>Store Multiple registers, increment after</td>
<td>-</td>
<td>3.4.6 on page 79</td>
</tr>
<tr>
<td>STMDB, STMEA</td>
<td>Rn(!), reglist</td>
<td>Store Multiple registers, decrement before</td>
<td>-</td>
<td>3.4.6 on page 79</td>
</tr>
<tr>
<td>STMFD, STMIA</td>
<td>Rn(!), reglist</td>
<td>Store Multiple registers, increment after</td>
<td>-</td>
<td>3.4.6 on page 79</td>
</tr>
<tr>
<td>STR</td>
<td>Rt, [Rn, #offset]</td>
<td>Store register word</td>
<td>-</td>
<td>3.4.2 on page 73</td>
</tr>
<tr>
<td>STRB, STRBT</td>
<td>Rt, [Rn, #offset]</td>
<td>Store register byte</td>
<td>-</td>
<td>3.4.2 on page 73</td>
</tr>
<tr>
<td>STRD</td>
<td>Rt, Rt2, [Rn, #offset]</td>
<td>Store register two words</td>
<td>-</td>
<td>3.4.2 on page 73</td>
</tr>
<tr>
<td>STREX</td>
<td>Rd, Rt, [Rn, #offset]</td>
<td>Store register Exclusive</td>
<td>-</td>
<td>3.4.9 on page 83</td>
</tr>
<tr>
<td>STREXB</td>
<td>Rd, Rt, [Rn]</td>
<td>Store register Exclusive Byte</td>
<td>-</td>
<td>3.4.9 on page 83</td>
</tr>
<tr>
<td>STREXH</td>
<td>Rd, Rt, [Rn]</td>
<td>Store register Exclusive Halfword</td>
<td>-</td>
<td>3.4.9 on page 83</td>
</tr>
<tr>
<td>STRH, STRHT</td>
<td>Rt, [Rn, #offset]</td>
<td>Store register Halfword</td>
<td>-</td>
<td>3.4.2 on page 73</td>
</tr>
<tr>
<td>STRT</td>
<td>Rt, [Rn, #offset]</td>
<td>Store register word</td>
<td>-</td>
<td>3.4.2 on page 73</td>
</tr>
<tr>
<td>SUB, SUBS</td>
<td>(Rd,) Rn, Op2</td>
<td>Subtract</td>
<td>N,Z,C,V</td>
<td>3.5.1 on page 87</td>
</tr>
<tr>
<td>Mnemonic</td>
<td>Operands</td>
<td>Brief description</td>
<td>Flags</td>
<td>Page</td>
</tr>
<tr>
<td>----------</td>
<td>----------</td>
<td>------------------</td>
<td>-------</td>
<td>------</td>
</tr>
<tr>
<td>SUB, SUBW</td>
<td>(Rd,) Rn, #imm12</td>
<td>Subtract</td>
<td>-</td>
<td>3.5.1 on page 87</td>
</tr>
<tr>
<td>SVC</td>
<td>#imm</td>
<td>Supervisor Call</td>
<td>-</td>
<td>3.5.1 on page 87</td>
</tr>
<tr>
<td>SXTAB</td>
<td>(Rd,) Rn, Rm, (,ROR #)</td>
<td>Extend 8 bits to 32 and add</td>
<td>-</td>
<td>3.8.3 on page 140</td>
</tr>
<tr>
<td>SXTAB16</td>
<td>(Rd,) Rn, Rm, (,ROR #)</td>
<td>Dual extend 8 bits to 16 and add</td>
<td>-</td>
<td>3.8.3 on page 140</td>
</tr>
<tr>
<td>SXTAH</td>
<td>(Rd,) Rn, Rm, (,ROR #)</td>
<td>Extend 16 bits to 32 and add</td>
<td>-</td>
<td>3.8.3 on page 140</td>
</tr>
<tr>
<td>SXTB16</td>
<td>(Rd,) Rm (,ROR #n)</td>
<td>Signed Extend Byte 16</td>
<td>-</td>
<td>3.9.3 on page 144</td>
</tr>
<tr>
<td>SXTB</td>
<td>(Rd,) Rm (,ROR #n)</td>
<td>Sign extend a byte</td>
<td>-</td>
<td>3.9.3 on page 144</td>
</tr>
<tr>
<td>SXTH</td>
<td>(Rd,) Rm (,ROR #n)</td>
<td>Sign extend a halfword</td>
<td>-</td>
<td>3.9.3 on page 144</td>
</tr>
<tr>
<td>TBB</td>
<td>[Rn, Rm]</td>
<td>Table Branch Byte</td>
<td>-</td>
<td>3.10.4 on page 150</td>
</tr>
<tr>
<td>TBH</td>
<td>[Rn, Rm, LSL #1]</td>
<td>Table Branch Halfword</td>
<td>-</td>
<td>3.10.4 on page 150</td>
</tr>
<tr>
<td>TEQ</td>
<td>Rn, Op2</td>
<td>Test Equivalence</td>
<td>N,Z,C</td>
<td>3.5.15 on page 102</td>
</tr>
<tr>
<td>TST</td>
<td>Rn, Op2</td>
<td>Test</td>
<td>N,Z,C</td>
<td>3.5.15 on page 102</td>
</tr>
<tr>
<td>UADD16</td>
<td>(Rd,) Rn, Rm</td>
<td>Unsigned Add 16</td>
<td>GE</td>
<td>3.5.16 on page 103</td>
</tr>
<tr>
<td>UADD8</td>
<td>(Rd,) Rn, Rm</td>
<td>Unsigned Add 8</td>
<td>GE</td>
<td>3.5.16 on page 103</td>
</tr>
<tr>
<td>USAX</td>
<td>(Rd,) Rn, Rm</td>
<td>Unsigned Subtract and Add with Exchange</td>
<td>GE</td>
<td>3.5.17 on page 104</td>
</tr>
<tr>
<td>UHADD16</td>
<td>(Rd,) Rn, Rm</td>
<td>Unsigned Halving Add 16</td>
<td>-</td>
<td>3.5.18 on page 105</td>
</tr>
<tr>
<td>UHADD8</td>
<td>(Rd,) Rn, Rm</td>
<td>Unsigned Halving Add 8</td>
<td>-</td>
<td>3.5.18 on page 105</td>
</tr>
<tr>
<td>UHASX</td>
<td>(Rd,) Rn, Rm</td>
<td>Unsigned Halving Add and Subtract with Exchange</td>
<td>-</td>
<td>3.5.19 on page 106</td>
</tr>
<tr>
<td>UHSAX</td>
<td>(Rd,) Rn, Rm</td>
<td>Unsigned Halving Subtract and Add with Exchange</td>
<td>-</td>
<td>3.5.19 on page 106</td>
</tr>
<tr>
<td>UHSUB16</td>
<td>(Rd,) Rn, Rm</td>
<td>Unsigned Halving Subtract 16</td>
<td>-</td>
<td>3.5.20 on page 107</td>
</tr>
<tr>
<td>UHSUB8</td>
<td>(Rd,) Rn, Rm</td>
<td>Unsigned Halving Subtract 8</td>
<td>-</td>
<td>3.5.20 on page 107</td>
</tr>
<tr>
<td>UBFX</td>
<td>Rd, Rn, #lsb, #width</td>
<td>Unsigned Bit Field Extract</td>
<td>-</td>
<td>3.9.2 on page 143</td>
</tr>
<tr>
<td>UDIV</td>
<td>(Rd,) Rn, Rm</td>
<td>Unsigned Divide</td>
<td>-</td>
<td>3.6.12 on page 127</td>
</tr>
<tr>
<td>UMAAL</td>
<td>RdLo, RdHi, Rn, Rm</td>
<td>Unsigned Multiply Accumulate Accumulate Long (32 x 32 + 32 +32), 64-bit result</td>
<td>-</td>
<td>3.6.2 on page 113</td>
</tr>
<tr>
<td>UMLAL</td>
<td>RdLo, RdHi, Rn, Rm</td>
<td>Unsigned Multiply with Accumulate (32 x 32 + 64), 64-bit result</td>
<td>-</td>
<td>3.6.2 on page 113</td>
</tr>
<tr>
<td>UMULL</td>
<td>RdLo, RdHi, Rn, Rm</td>
<td>Unsigned Multiply (32 x 32), 64-bit result</td>
<td>-</td>
<td>3.6.2 on page 113</td>
</tr>
<tr>
<td>Mnemonic</td>
<td>Operands</td>
<td>Brief description</td>
<td>Flags</td>
<td>Page</td>
</tr>
<tr>
<td>-------------------</td>
<td>----------</td>
<td>-------------------------------------------------------</td>
<td>-------</td>
<td>------------</td>
</tr>
<tr>
<td>UQADD16</td>
<td>(Rd, Rn, Rm)</td>
<td>Unsigned Saturating Add 16</td>
<td>-</td>
<td>3.7.7 on page 136</td>
</tr>
<tr>
<td>UQADD8</td>
<td>(Rd, Rn, Rm)</td>
<td>Unsigned Saturating Add 8</td>
<td>-</td>
<td>3.7.7 on page 136</td>
</tr>
<tr>
<td>UQASX</td>
<td>(Rd, Rn, Rm)</td>
<td>Unsigned Saturating Add and Subtract with Exchange</td>
<td>-</td>
<td>3.7.6 on page 134</td>
</tr>
<tr>
<td>UQASAX</td>
<td>(Rd, Rn, Rm)</td>
<td>Unsigned Saturating Subtract and Add with Exchange</td>
<td>-</td>
<td>3.7.6 on page 134</td>
</tr>
<tr>
<td>UQSUB16</td>
<td>(Rd, Rn, Rm)</td>
<td>Unsigned Saturating Subtract 16</td>
<td>-</td>
<td>3.7.7 on page 136</td>
</tr>
<tr>
<td>UQSUB8</td>
<td>(Rd, Rn, Rm)</td>
<td>Unsigned Saturating Subtract 8</td>
<td>-</td>
<td>3.7.7 on page 136</td>
</tr>
<tr>
<td>USAD8</td>
<td>(Rd, Rn, Rm)</td>
<td>Unsigned Sum of Absolute Differences</td>
<td>-</td>
<td>3.5.22 on page 108</td>
</tr>
<tr>
<td>USADA8</td>
<td>(Rd, Rn, Rm, Ra)</td>
<td>Unsigned Sum of Absolute Differences and Accumulate</td>
<td>-</td>
<td>3.5.23 on page 109</td>
</tr>
<tr>
<td>USAT</td>
<td>Rd, #n, Rm, (,shift #s)</td>
<td>Unsigned Saturate</td>
<td>Q</td>
<td>3.7.1 on page 129</td>
</tr>
<tr>
<td>USAT16</td>
<td>Rd, #n, Rm</td>
<td>Unsigned Saturate 16</td>
<td>Q</td>
<td>3.7.2 on page 130</td>
</tr>
<tr>
<td>UASX</td>
<td>(Rd, Rn, Rm)</td>
<td>Unsigned Add and Subtract with Exchange</td>
<td>GE</td>
<td>3.5.17 on page 104</td>
</tr>
<tr>
<td>USUB16</td>
<td>(Rd, Rn, Rm)</td>
<td>Unsigned Subtract 16</td>
<td>GE</td>
<td>3.5.24 on page 110</td>
</tr>
<tr>
<td>USUB8</td>
<td>(Rd, Rn, Rm)</td>
<td>Unsigned Subtract 8</td>
<td>GE</td>
<td>3.5.24 on page 110</td>
</tr>
<tr>
<td>UXTAB</td>
<td>(Rd, Rn, Rm, (,ROR #)</td>
<td>Rotate, extend 8 bits to 32 and Add</td>
<td>-</td>
<td>3.8.3 on page 140</td>
</tr>
<tr>
<td>UXTAB16</td>
<td>(Rd, Rn, Rm, (,ROR #)</td>
<td>Rotate, dual extend 8 bits to 16 and Add</td>
<td>-</td>
<td>3.8.3 on page 140</td>
</tr>
<tr>
<td>UXTAH</td>
<td>(Rd, Rn, Rm, (,ROR #)</td>
<td>Rotate, unsigned extend and Add Halfword</td>
<td>-</td>
<td>3.8.3 on page 140</td>
</tr>
<tr>
<td>UXTB</td>
<td>(Rd, Rm, (,ROR #n)</td>
<td>Zero extend a Byte</td>
<td>-</td>
<td>3.9.3 on page 144</td>
</tr>
<tr>
<td>UXTB16</td>
<td>(Rd, Rm, (,ROR #n)</td>
<td>Unsigned Extend Byte 16</td>
<td>-</td>
<td>3.9.3 on page 144</td>
</tr>
<tr>
<td>UXTH</td>
<td>(Rd, Rm, (,ROR #n)</td>
<td>Zero extend a Halfword</td>
<td>-</td>
<td>3.9.3 on page 144</td>
</tr>
<tr>
<td>VABS.F&lt;32</td>
<td>64&gt;</td>
<td>&lt;Sd</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
</tr>
<tr>
<td>VADD.F&lt;32</td>
<td>64&gt;</td>
<td>(&lt;Sd</td>
<td>Dd&gt;, ) &lt;Sn</td>
<td>Dn&gt;, &lt;Sm</td>
</tr>
<tr>
<td>VCMP.F&lt;32</td>
<td>64&gt;</td>
<td>&lt;Sd</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
</tr>
</tbody>
</table>
Table 23. Cortex®-M7 instructions (continued)

<table>
<thead>
<tr>
<th>Mnemonic</th>
<th>Operands</th>
<th>Brief description</th>
<th>Flags</th>
<th>Page</th>
</tr>
</thead>
<tbody>
<tr>
<td>VCMPE.F&lt;32</td>
<td>Dd&gt;, &lt;&lt;Sm</td>
<td>Dm&gt;</td>
<td>Compare two floating-point registers, or one floating-point register and zero with Invalid Operation check</td>
<td>FPSCR</td>
</tr>
<tr>
<td>VVCVT</td>
<td>F&lt;32</td>
<td>64&gt;,&lt;S</td>
<td>U&gt;, &lt;16</td>
<td>32&gt;</td>
</tr>
<tr>
<td>VVCVT</td>
<td>&lt;S</td>
<td>U&gt;, &lt;16</td>
<td>32&gt;.F&lt;32</td>
<td>64&gt;</td>
</tr>
<tr>
<td>VVCVT.S32.F&lt;32</td>
<td>64&gt;</td>
<td>&lt;&lt;Sd</td>
<td>Dd&gt;, &lt;&lt;Sm</td>
<td>Dm&gt;</td>
</tr>
<tr>
<td>VVCVT.&lt;B</td>
<td>T&gt;.F&lt;32</td>
<td>64&gt;.F16</td>
<td>&lt;&lt;Sd</td>
<td>Dd&gt;, &lt;Sm&gt;</td>
</tr>
<tr>
<td>VVCVT.A.F&lt;32</td>
<td>64&gt;</td>
<td>&lt;&lt;Sd</td>
<td>Dd&gt;, &lt;&lt;Sm</td>
<td>Dm&gt;</td>
</tr>
<tr>
<td>VVCVTM.F&lt;32</td>
<td>64&gt;</td>
<td>&lt;&lt;Sd</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
</tr>
<tr>
<td>VVCVTN.F&lt;32</td>
<td>64&gt;</td>
<td>&lt;&lt;Sd</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
</tr>
<tr>
<td>VVCVT.P.F&lt;32</td>
<td>64&gt;</td>
<td>&lt;&lt;Sd</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
</tr>
<tr>
<td>VVCVT.R.S32.F&lt;32</td>
<td>64&gt;</td>
<td>&lt;&lt;Sd</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
</tr>
<tr>
<td>VVCVT.&lt;B</td>
<td>T&gt;.F16,F&lt;32</td>
<td>64&gt;</td>
<td>Sd, &lt;Sm</td>
<td>Dm&gt;</td>
</tr>
<tr>
<td>VDIV.F&lt;32</td>
<td>64&gt;</td>
<td>{&lt;&lt;Sd</td>
<td>Dd&gt;,} &lt;Sn</td>
<td>Dm&gt;, &lt;Sm</td>
</tr>
<tr>
<td>VFMA.F&lt;32</td>
<td>64&gt;</td>
<td>{&lt;&lt;Sd</td>
<td>Dd&gt;,} &lt;Sn</td>
<td>Dm&gt;, &lt;Sm</td>
</tr>
<tr>
<td>VFMS.F&lt;32</td>
<td>64&gt;</td>
<td>{&lt;&lt;Sd</td>
<td>Dd&gt;,} &lt;Sn</td>
<td>Dm&gt;, &lt;Sm</td>
</tr>
<tr>
<td>VFNMA.F&lt;32</td>
<td>64&gt;</td>
<td>{&lt;&lt;Sd</td>
<td>Dd&gt;,} &lt;Sn</td>
<td>Dm&gt;, &lt;Sm</td>
</tr>
<tr>
<td>VFNMS.F&lt;32</td>
<td>64&gt;</td>
<td>{&lt;&lt;Sd</td>
<td>Dd&gt;,} &lt;Sn</td>
<td>Dm&gt;, &lt;Sm</td>
</tr>
<tr>
<td>Mnemonic</td>
<td>Operands</td>
<td>Brief description</td>
<td>Flags</td>
<td>Page</td>
</tr>
<tr>
<td>----------</td>
<td>----------</td>
<td>------------------</td>
<td>-------</td>
<td>------</td>
</tr>
<tr>
<td>VLDM.F&lt;32</td>
<td>Rn(!), list</td>
<td>Load Multiple extension registers</td>
<td>-</td>
<td>3.11.10 on page 159</td>
</tr>
<tr>
<td>VLDR.F&lt;32</td>
<td>&lt;Sd</td>
<td>Dd&gt;, [&lt;Rn&gt;{, #+/--&lt;imm&gt;}]</td>
<td>Load an extension register from memory</td>
<td>-</td>
</tr>
<tr>
<td>VLDR.F&lt;32</td>
<td>&lt;Sd</td>
<td>Dd&gt;, &lt;label&gt;</td>
<td>Load an extension register from memory</td>
<td>-</td>
</tr>
<tr>
<td>VLDR.F&lt;32</td>
<td>&lt;Sd</td>
<td>Dd&gt;, [PC,#-0]</td>
<td>Load an extension register from memory</td>
<td>-</td>
</tr>
<tr>
<td>VMLA.F&lt;32</td>
<td>(&lt;Sd</td>
<td>Dd&gt;,) &lt;Sn</td>
<td>Dn&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
</tr>
<tr>
<td>VMLS.F&lt;32</td>
<td>(&lt;Sd</td>
<td>Dd&gt;,) &lt;Sn</td>
<td>Dn&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
</tr>
<tr>
<td>VMAXNM.F&lt;32</td>
<td>&lt;Sd</td>
<td>Dd&gt;, &lt;Sn</td>
<td>Dn&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
</tr>
<tr>
<td>VMINNM.F&lt;32</td>
<td>&lt;Sd</td>
<td>Dd&gt;, &lt;Sn</td>
<td>Dn&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
</tr>
<tr>
<td>VMOV</td>
<td>&lt;Sd</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
<td>Floating-point Move register</td>
</tr>
<tr>
<td>VMOV</td>
<td>&lt;Sn</td>
<td>Dn&gt;, Rt</td>
<td>Copy Arm core register to single-precision</td>
<td>-</td>
</tr>
<tr>
<td>VMOV</td>
<td>&lt;Sm</td>
<td>Dm&gt;, &lt;Sm</td>
<td>Dm1&gt;, Rt, Rt2</td>
<td>Copy two Arm core registers to two single-precision</td>
</tr>
<tr>
<td>VMOV</td>
<td>Dd[x], Rt</td>
<td>Copy Arm core register to scalar</td>
<td>-</td>
<td>3.11.19 on page 165</td>
</tr>
<tr>
<td>VMOV</td>
<td>Rt, Dn[x]</td>
<td>Copy scalar to Arm core register</td>
<td>-</td>
<td>3.11.19 on page 165</td>
</tr>
<tr>
<td>VMOV.F&lt;32</td>
<td>&lt;Sd</td>
<td>Dd&gt;, #imm</td>
<td>Floating-point Move immediate</td>
<td>-</td>
</tr>
<tr>
<td>VMUL.F&lt;32</td>
<td>(&lt;Sd</td>
<td>Dd&gt;,) &lt;Sn</td>
<td>Dn&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
</tr>
<tr>
<td>VNEG.F&lt;32</td>
<td>&lt;Sd</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
<td>Floating-point Negate</td>
</tr>
<tr>
<td>VNMLA.F&lt;32</td>
<td>&lt;Sd</td>
<td>Dd&gt;, &lt;Sn</td>
<td>Dn&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
</tr>
<tr>
<td>VNMLS.F&lt;32</td>
<td>&lt;Sd</td>
<td>Dd&gt;, &lt;Sn</td>
<td>Dn&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
</tr>
<tr>
<td>VNMULT.F&lt;32</td>
<td>(&lt;Sd</td>
<td>Dd&gt;,) &lt;Sn</td>
<td>Dn&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
</tr>
</tbody>
</table>
3.1.1 Binary compatibility with other Cortex processors

The processor implements the Armv7-M instruction set and features provided by the Armv7-M architecture profile, and is binary compatible with the instruction sets and features implemented in other Cortex®-M profile processors. The user cannot move software from the Cortex®-M7 processor to:

- The Cortex®-M3 processor if it contains floating-point operations or DSP extensions.
- The Cortex®-M4 processor if it contains double-precision floating-point operations.
- The Cortex®-M0 or Cortex®-M0+ processors because these are implementations of the Armv6-M Architecture.

The code designed for other Cortex®-M processors is compatible with Cortex®-M7 as long as it does not rely on bit-banding.

<table>
<thead>
<tr>
<th>Mnemonic</th>
<th>Operands</th>
<th>Brief description</th>
<th>Flags</th>
<th>Page</th>
</tr>
</thead>
<tbody>
<tr>
<td>VRINTA.F&lt;32</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
<td>Float to integer in floating-point format conversion with directed rounding</td>
<td>-</td>
</tr>
<tr>
<td>VRINTM.F&lt;32</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
<td>Float to integer in floating-point format conversion with directed rounding</td>
<td>-</td>
</tr>
<tr>
<td>VRINTN.F&lt;32</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
<td>Float to integer in floating-point format conversion with directed rounding</td>
<td>-</td>
</tr>
<tr>
<td>VRINTP.F&lt;32</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
<td>Float to integer in floating-point format conversion with directed rounding</td>
<td>-</td>
</tr>
<tr>
<td>VRINTR.F&lt;32</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
<td>Float to integer in floating-point format conversion</td>
<td>-</td>
</tr>
<tr>
<td>VRINTX.F&lt;32</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
<td>Float to integer in floating-point format conversion</td>
<td>-</td>
</tr>
<tr>
<td>VRINTZ.F&lt;32</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
<td>Float to integer in floating-point format conversion</td>
<td>-</td>
</tr>
<tr>
<td>VSEL.F&lt;32</td>
<td>Dd&gt;, &lt;Sn</td>
<td>Dn&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
<td>Select register, alternative to a pair of conditional VMOV</td>
</tr>
<tr>
<td>VSQRT.F&lt;32</td>
<td>Dd&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
<td>Calculates floating-point Square Root</td>
<td>-</td>
</tr>
<tr>
<td>VSTR.F&lt;32</td>
<td>Dd&gt;, [Rn]</td>
<td>Stores an extension register to memory</td>
<td>-</td>
<td>3.11.29 on page 170</td>
</tr>
<tr>
<td>VSUB.F&lt;32</td>
<td>Dd&gt;, &lt;Sn</td>
<td>Dn&gt;, &lt;Sm</td>
<td>Dm&gt;</td>
<td>Floating-point Subtract</td>
</tr>
<tr>
<td>WFE</td>
<td>-</td>
<td>Wait For Event</td>
<td>-</td>
<td>3.12.11 on page 181</td>
</tr>
<tr>
<td>WFI</td>
<td>-</td>
<td>Wait For Interrupt</td>
<td>-</td>
<td>3.12.12 on page 182</td>
</tr>
</tbody>
</table>
To ensure a smooth transition, Arm recommends that the code designed to operate on other Cortex®-M profile processor architectures obey the following rules and that the Configuration and Control Register (CCR) be appropriately configured:

- Use word transfers only to access registers in the NVIC and System Control Space (SCS).
- Treat all unused SCS registers and register fields on the processor as Do-Not-Modify.
- Configure the following fields in the CCR register:
  - STKALIGN bit to 1.
  - UNALIGN_TRP bit to 1.
  - Leave all other bits in the CCR register at their original value.

3.2 CMSIS functions

ISO/IEC C code cannot directly access some Cortex®-M7 processor instructions. This section describes intrinsic functions that can generate these instructions, provided by the CMSIS and that might be provided by a C compiler. If a C compiler does not support an appropriate intrinsic function, the user might have to use inline assembler to access some instructions.

The CMSIS provides the following intrinsic functions to generate instructions that ISO/IEC C code cannot directly access:

<table>
<thead>
<tr>
<th>Instruction</th>
<th>CMSIS function</th>
</tr>
</thead>
<tbody>
<tr>
<td>CPSIE I</td>
<td>void __enable_irq(void)</td>
</tr>
<tr>
<td>CPSID I</td>
<td>void __disable_irq(void)</td>
</tr>
<tr>
<td>CPSIE F</td>
<td>void __enable_fault_irq(void)</td>
</tr>
<tr>
<td>CPSID F</td>
<td>void __disable_fault_irq(void)</td>
</tr>
<tr>
<td>ISB</td>
<td>void __ISB(void)</td>
</tr>
<tr>
<td>DSB</td>
<td>void __DSB(void)</td>
</tr>
<tr>
<td>DMB</td>
<td>void __DMB(void)</td>
</tr>
<tr>
<td>REV</td>
<td>uint32_t __REV(uint32_t int value)</td>
</tr>
<tr>
<td>REV16</td>
<td>uint32_t __REV16(uint32_t int value)</td>
</tr>
<tr>
<td>REVSH</td>
<td>uint32_t __REVSH(uint32_t int value)</td>
</tr>
<tr>
<td>RBIT</td>
<td>uint32_t __RBIT(uint32_t int value)</td>
</tr>
<tr>
<td>SEV</td>
<td>void __SEV(void)</td>
</tr>
<tr>
<td>WFE</td>
<td>void __WFE(void)</td>
</tr>
<tr>
<td>WFI</td>
<td>void __WFI(void)</td>
</tr>
</tbody>
</table>

The CMSIS also provides a number of functions for accessing the special registers using MRS and MSR instructions:
3.3 | About the instruction descriptions

The following sections give more information about using the instructions:

- Operands on page 63.
- Restrictions when using PC or SP on page 63.
- Flexible second operand on page 64.
- Shift operations on page 65.
- Address alignment on page 68.
- PC-relative expressions on page 68.
- Conditional execution on page 68.
- Instruction width selection on page 71.

### 3.3.1 Operands

An instruction operand can be an Arm register, a constant, or another instruction-specific parameter. Instructions act on the operands and often store the result in a destination register. When there is a destination register in the instruction, it is usually specified before the operands.

Operands in some instructions are flexible in that they can either be a register or a constant. See Flexible second operand on page 64.

### 3.3.2 Restrictions when using PC or SP

Many instructions have restrictions on whether the user can use the Program Counter (PC) or Stack Pointer (SP) for the operands or destination register. See instruction descriptions for more information.

---

**Table 25. CMSIS functions to access the special registers**

<table>
<thead>
<tr>
<th>Special register</th>
<th>Access</th>
<th>CMSIS function</th>
</tr>
</thead>
<tbody>
<tr>
<td>PRIMASK</td>
<td>Read</td>
<td>uint32_t __get_PRIMASK (void)</td>
</tr>
<tr>
<td></td>
<td>Write</td>
<td>void __set_PRIMASK (uint32_t value)</td>
</tr>
<tr>
<td>FAULTMASK</td>
<td>Read</td>
<td>uint32_t __getFAULTMASK (void)</td>
</tr>
<tr>
<td></td>
<td>Write</td>
<td>void __setFAULTMASK (uint32_t value)</td>
</tr>
<tr>
<td>BASEPRI</td>
<td>Read</td>
<td>uint32_t __getBASEPRI (void)</td>
</tr>
<tr>
<td></td>
<td>Write</td>
<td>void __setBASEPRI (uint32_t value)</td>
</tr>
<tr>
<td>CONTROL</td>
<td>Read</td>
<td>uint32_t __getCONTROL (void)</td>
</tr>
<tr>
<td></td>
<td>Write</td>
<td>void __setCONTROL (uint32_t value)</td>
</tr>
<tr>
<td>MSP</td>
<td>Read</td>
<td>uint32_t __getMSP (void)</td>
</tr>
<tr>
<td></td>
<td>Write</td>
<td>void __setMSP (uint32_t TopOfMainStack)</td>
</tr>
<tr>
<td>PSP</td>
<td>Read</td>
<td>uint32_t __getPSP (void)</td>
</tr>
<tr>
<td></td>
<td>Write</td>
<td>void __setPSP (uint32_t TopOfProcStack)</td>
</tr>
</tbody>
</table>
Note: Bit[0] of any address written to the PC with a BX, BLX, LDM, LDR, or POP instruction must be 1 for correct execution, because this bit indicates the required instruction set, and the Cortex®-M7 processor only supports Thumb instructions.

3.3.3 Flexible second operand

Many general data processing instructions have a flexible second operand. This is shown as Operand2 in the descriptions of the syntax of each instruction.

Operand2 can be a:

- **Constant.**
- **Register with optional shift on page 64.**

**Constant**

Specify an Operand2 constant in the form:

```
#constant
```

where `constant` can be:

- Any constant that can be produced by shifting an 8-bit value left by any number of bits within a 32-bit word.
- Any constant of the form 0x00XY00XY.
- Any constant of the form 0xXY00XY00.
- Any constant of the form 0xXYXYXYXY.

Note: In these constants, X and Y are hexadecimal digits.

In addition, in a small number of instructions, `constant` can take a wider range of values. These are described in the individual instruction descriptions.

When an Operand2 constant is used with the instructions MOVs, MVNS, ANDs, ORRs, ORNS, EORS, BICS, TEQ or TST, the carry flag is updated to bit[31] of the constant, if the constant is greater than 255 and can be produced by shifting an 8-bit value. These instructions do not affect the carry flag if Operand2 is any other constant.

**Instruction substitution**

The assembler might be able to produce an equivalent instruction if an unpermitted constant is specified. For example, the instruction CMP Rd, #0xFFFFFFFE might be assembled as the equivalent of instruction CMN Rd, #0x2.

**Register with optional shift**

Specify an Operand2 register in the form:

```
Rm {, shift}
```

Where:

- `Rm` Is the register holding the data for the second operand.
- `shift` Is an optional shift to be applied to `Rm`. It can be one of:
  
  - **ASR #n** Arithmetic shift right n bits, 1 ≤ n ≤ 32.
  - **LSL #n** Logical shift left n bits, 1 ≤ n ≤ 31.
  - **LSR #n** Logical shift right n bits, 1 ≤ n ≤ 32.
ROR #n  Rotate right n bits, $1 \leq n \leq 31$.
RRX  Rotate right one bit, with extend.
- If omitted, no shift occurs, equivalent to LSL #0.

If the shift is omitted or LSL #0 specified, the instruction uses the value in Rm.

If the shift is specified, it is applied to the value in Rm, and the resulting 32-bit value is used by the instruction. However, the contents in the register Rm remain unchanged. Specifying a register with shift also updates the carry flag when used with certain instructions. For information on the shift operations and how they affect the carry flag, see Shift operations.

3.3.4 Shift operations

Register shift operations move the bits in a register left or right by a specified number of bits, the shift length. Register shift can be performed:

- Directly by the instructions ASR, LSR, LSL, ROR, and RRX, and the result is written to a destination register.
- During the calculation of Operand2 by the instructions that specify the second operand as a register with shift, see Flexible second operand on page 64. The result is used by the instruction.

The permitted shift lengths depend on the shift type and the instruction, see the individual instruction description or Flexible second operand on page 64. If the shift length is 0, no shift occurs. Register shift operations update the carry flag except when the specified shift length is 0. The following sub-sections describe the various shift operations and how they affect the carry flag. In these descriptions, Rm is the register containing the value to be shifted, and n is the shift length.

ASR

Arithmetic shift right by n bits moves the left-hand 32-n bits of the register Rm, to the right by n places, into the right-hand 32-n bits of the result. And it copies the original bit[31] of the register into the left-hand n bits of the result. See Figure 12 on page 66.

The ASR #n operation can be used to divide the value in the register Rm by $2^n$, with the result being rounded towards negative-infinity.

When the instruction is ASRS or when ASR #n is used in Operand2 with the instructions MOVX, MVNS, ANDS, ORRS, ORNS, EORS, BICS, TEQ or TST, the carry flag is updated to the last bit shifted out, bit[$n-1$], of the register Rm.

- If n is 32 or more, then all the bits in the result are set to the value of bit[31] of Rm.
- If n is 32 or more and the carry flag is updated, it is updated to the value of bit[31] of Rm.
LSR

Logical shift right by \(n\) bits moves the left-hand 32-\(n\) bits of the register \(Rm\), to the right by \(n\) places, into the right-hand 32-\(n\) bits of the result. And it sets the left-hand \(n\) bits of the result to 0. See Figure 13.

The LSR \#n operation can be used to divide the value in the register \(Rm\) by \(2^n\), if the value is regarded as an unsigned integer.

When the instruction is LSRS or when LSR \#n is used in Operand2 with the instructions MOVs, MVNS, ANDS, ORRS, ORNS, EORS, BICS, TEQ or TST, the carry flag is updated to the last bit shifted out, bit[\(n-1\)], of the register \(Rm\).

- If \(n\) is 32 or more, then all the bits in the result are cleared to 0.
- If \(n\) is 33 or more and the carry flag is updated, it is updated to 0.

LSL

Logical shift left by \(n\) bits moves the right-hand 32-\(n\) bits of the register \(Rm\), to the left by \(n\) places, into the left-hand 32-\(n\) bits of the result. And it sets the right-hand \(n\) bits of the result to 0. See Figure 14 on page 67.

The user can use the LSL \#n operation to multiply the value in the register \(Rm\) by \(2^n\), if the value is regarded as an unsigned integer or a two's complement signed integer. Overflow can occur without warning.

When the instruction is LSLS or when LSL \#n, with non-zero \(n\), is used in Operand2 with the instructions MOVs, MVNS, ANDS, ORRS, ORNS, EORS, BICS, TEQ or TST, the carry flag is updated to the last bit shifted out, bit[32-\(n\)], of the register \(Rm\). These instructions do not affect the carry flag when used with LSL \#0.

- If \(n\) is 32 or more, then all the bits in the result are cleared to 0.
- If \(n\) is 33 or more and the carry flag is updated, it is updated to 0.
ROR

Rotate right by \(n\) bits moves the left-hand \(32-n\) bits of the register \(Rm\), to the right by \(n\) places, into the right-hand \(32-n\) bits of the result. And it moves the right-hand \(n\) bits of the register into the left-hand \(n\) bits of the result. See Figure 15.

When the instruction is RORS or when ROR \#n\) is used in Operand2 with the instructions MOVs, MVNS, ANDs, ORRs, ORNS, EORS, BICS, TEQ or TST, the carry flag is updated to the last bit rotation, bit[\(n-1\)], of the register \(Rm\).

- If \(n\) is 32, then the value of the result is same as the value in \(Rm\), and if the carry flag is updated, it is updated to bit[31] of \(Rm\).
- ROR with shift length, \(n\), more than 32 is the same as ROR with shift length \(n-32\).

RRX

Rotate right with extend moves the bits of the register \(Rm\) to the right by one bit. And it copies the carry flag into bit[31] of the result. See Figure 16 on page 67.

When the instruction is RRXS or when RRX is used in Operand2 with the instructions MOVs, MVNS, ANDs, ORRs, ORNS, EORS, BICS, TEQ or TST, the carry flag is updated to bit[0] of the register \(Rm\).
3.3.5 Address alignment

An aligned access is an operation where a word-aligned address is used for a word, dual word, or multiple word access, or where a halfword-aligned address is used for a halfword access. Byte accesses are always aligned.

The Cortex®-M7 processor supports unaligned access only for the following instructions:

- LDR, LDRT.
- LDRH, LDRHT.
- LDRSH, LDRSHT.
- STR, STRT.
- STRH, STRHT.

All other load and store instructions generate a UsageFault exception if they perform an unaligned access, and therefore their accesses must be address aligned. For more information about UsageFaults see Fault handling on page 47.

Unaligned accesses are usually slower than aligned accesses. In addition, some memory regions might not support unaligned accesses. Therefore, Arm recommends that programmers ensure that accesses are aligned. To trap accidental generation of unaligned accesses, use the UNALIGN_TRP bit in the Configuration and Control register, see Configuration and control register on page 200.

3.3.6 PC-relative expressions

A PC-relative expression or label is a symbol that represents the address of an instruction or literal data. It is represented in the instruction as the PC value plus or minus a numeric offset. The assembler calculates the required offset from the label and the address of the current instruction. If the offset is too big, the assembler produces an error.

- For B, BL, CBNZ, and CBZ instructions, the value of the PC is the address of the current instruction plus 4 bytes.
- For all other instructions that use labels, the value of the PC is the address of the current instruction plus 4 bytes, with bit[1] of the result cleared to 0 to make it word-aligned.
- The assembler might permit other syntaxes for PC-relative expressions, such as a label plus or minus a number, or an expression of the form [PC, #number].

3.3.7 Conditional execution

Most data processing instructions can optionally update the condition flags in the Application Program Status register (APSR) according to the result of the operation, see Application program status register on page 23. Some instructions update all flags, and some only update a subset. If a flag is not updated, the original value is preserved. See the instruction descriptions for the flags they affect.

The user can execute an instruction conditionally, based on the condition flags set in another instruction, either:

- Immediately after the instruction that updated the flags.
- After any number of intervening instructions that have not updated the flags.

Conditional execution is available by using conditional branches or by adding condition code suffixes to instructions. See Table 26 on page 70 for a list of the suffixes to add to
instructions to make them conditional instructions. The condition code suffix enables the processor to test a condition based on the flags. If the condition test of a conditional instruction fails, the instruction:

- Does not execute.
- Does not write any value to its destination register.
- Does not affect any of the flags.
- Does not generate any exception.

Conditional instructions, except for conditional branches, must be inside an if-then instruction block. See IT on page 148 for more information and restrictions when using the IT instruction. Depending on the vendor, the assembler might automatically insert an IT instruction if there are conditional instructions outside the IT block.

Use the CBZ and CBNZ instructions to compare the value of a register against zero and branch on the result.

This section describes:

- The condition flags on page 69.
- Condition code suffixes on page 70.

The condition flags

The APSR contains the following condition flags:

- **N** Set to 1 when the result of the operation was negative, cleared to 0 otherwise.
- **Z** Set to 1 when the result of the operation was zero, cleared to 0 otherwise.
- **C** Set to 1 when the operation resulted in a carry, cleared to 0 otherwise.
- **V** Set to 1 when the operation caused overflow, cleared to 0 otherwise.

For more information about the APSR see Program status register on page 22.

The C condition flag is set in one of four ways:

- For an addition, including the comparison instruction CMN, C is set to 1 if the addition produced a carry (that is, an unsigned overflow), and to 0 otherwise.
- For a subtraction, including the comparison instruction CMP, C is set to 0 if the subtraction produced a borrow (that is, an unsigned underflow), and to 1 otherwise.
- For non-addition or subtractions that incorporate a shift operation, C is set to the last bit shifted out of the value by the shifter.
- For other non-addition or subtractions, C is normally left unchanged. See the individual instruction descriptions for any special cases.

Overflow occurs when the sign of the result, in bit[31], does not match the sign of the result had the operation been performed at infinite precision, for example:

- If adding two negative values results in a positive value.
- If adding two positive values results in a negative value.
- If subtracting a positive value from a negative value generates a positive value.
- If subtracting a negative value from a positive value generates a negative value.

The Compare operations are identical to subtracting, for CMP, or adding, for CMN, except that the result is discarded. See the instruction descriptions for more information.

Most instructions update the status flags only if the S suffix is specified. See the instruction descriptions for more information.
Condition code suffixes

The instructions that can be conditional have an optional condition code, shown in syntax descriptions as `{cond}`. Conditional execution requires a preceding IT instruction. An instruction with a condition code is only executed if the condition code flags in the APSR meet the specified condition. Table 26 shows the condition codes to use.

The conditional execution can be used with the IT instruction to reduce the number of branch instructions in the code.

Table 26 also shows the relationship between condition code suffixes and the N, Z, C, and V flags.

Table 26. Condition code suffixes

<table>
<thead>
<tr>
<th>Suffix</th>
<th>Flags</th>
<th>Meaning</th>
</tr>
</thead>
<tbody>
<tr>
<td>EQ</td>
<td>Z = 1</td>
<td>Equal</td>
</tr>
<tr>
<td>NE</td>
<td>Z = 0</td>
<td>Not equal</td>
</tr>
<tr>
<td>CS or HS</td>
<td>C = 1</td>
<td>Higher or same, unsigned</td>
</tr>
<tr>
<td>CC or LO</td>
<td>C = 0</td>
<td>Lower, unsigned</td>
</tr>
<tr>
<td>MI</td>
<td>N = 1</td>
<td>Negative</td>
</tr>
<tr>
<td>PL</td>
<td>N = 0</td>
<td>Positive or zero</td>
</tr>
<tr>
<td>VS</td>
<td>V = 1</td>
<td>Overflow</td>
</tr>
<tr>
<td>VC</td>
<td>V = 0</td>
<td>No overflow</td>
</tr>
<tr>
<td>HI</td>
<td>C = 1 and Z = 0</td>
<td>Higher, unsigned</td>
</tr>
<tr>
<td>LS</td>
<td>C = 0 or Z = 1</td>
<td>Lower or same, unsigned</td>
</tr>
<tr>
<td>GE</td>
<td>N = V</td>
<td>Greater than or equal, signed</td>
</tr>
<tr>
<td>LT</td>
<td>N != V</td>
<td>Less than, signed</td>
</tr>
<tr>
<td>GT</td>
<td>Z = 0 and N = V</td>
<td>Greater than, signed</td>
</tr>
<tr>
<td>LE</td>
<td>Z = 1 and N != V</td>
<td>Less than or equal, signed</td>
</tr>
<tr>
<td>AL</td>
<td>Can have any value</td>
<td>Always. This is the default when no suffix is specified.</td>
</tr>
</tbody>
</table>

Example 3-1: absolute value shows the use of a conditional instruction to find the absolute value of a number. R0 = abs(R1).

Example 3-1: absolute value

```
MOVS    R0, R1 ; R0 = R1, setting flags.
IT      MI ; Skipping next instruction if value 0 or positive.
RSBMI   R0, R0, #0 ; If negative, R0 = -R0.
```
Example 3-2: compare and update value shows the use of conditional instructions to update the value of R4 if the signed values R0 is greater than R1 and R2 is greater than R3.

Example 3-2: compare and update value

```
CMP   R0, R1       ; Compare R0 and R1, setting flags.
ITT   GT           ; Skip next two instructions unless GT condition holds.
CMPT   R2, R3       ; If 'greater than', compare R2 and R3, setting flags.
MOVGT R4, R5       ; If still 'greater than', do R4 = R5.
```

3.3.8 Instruction width selection

There are many instructions that can generate either a 16-bit encoding or a 32-bit encoding depending on the operands and destination register specified. For some of these instructions, a specific instruction size can be forced by using an instruction width suffix. The .W suffix forces a 32-bit instruction encoding. The .N suffix forces a 16-bit instruction encoding.

If an instruction width suffix is specified and the assembler cannot generate an instruction encoding of the requested width, it generates an error.

In some cases it might be necessary to specify the .W suffix, for example if the operand is the label of an instruction or literal data, as in the case of branch instructions. This is because the assembler might not automatically generate the right size encoding.

To use an instruction width suffix, place it immediately after the instruction mnemonic and condition code, if any. Example 3-3: instruction width selection shows instructions with the instruction width suffix.

Example 3-3: instruction width selection

```
BCSW  label       ; Creates a 32-bit instruction even for a short branch.
ADDSW R0, R0, R1  ; Creates a 32-bit instruction even though the same operation can be done by a 16-bit instruction.
```
### Memory access instructions

*Table 27* shows the memory access instructions:

<table>
<thead>
<tr>
<th>Mnemonic</th>
<th>Brief description</th>
<th>See</th>
</tr>
</thead>
<tbody>
<tr>
<td>ADR</td>
<td>Generate PC-relative address</td>
<td>ADR on page 73</td>
</tr>
<tr>
<td>CLREX</td>
<td>Clear Exclusive</td>
<td>CLREX on page 84</td>
</tr>
<tr>
<td>LDM(mode)</td>
<td>Load Multiple registers</td>
<td>LDM and STM on page 79</td>
</tr>
<tr>
<td>LDR(type)</td>
<td>Load register using immediate offset</td>
<td>LDR and STR, immediate offset on page 73</td>
</tr>
<tr>
<td>LDR(type)</td>
<td>Load register using register offset</td>
<td>LDR and STR, register offset on page 76</td>
</tr>
<tr>
<td>LDR(type)T</td>
<td>Load register with unprivileged access</td>
<td>LDR and STR, unprivileged on page 77</td>
</tr>
<tr>
<td>LDR</td>
<td>Load register using PC-relative address</td>
<td>LDR, PC-relative on page 78</td>
</tr>
<tr>
<td>LDRD</td>
<td>Load register Dual</td>
<td>LDR and STR, immediate offset on page 73</td>
</tr>
<tr>
<td>LDREX(type)</td>
<td>Load register Exclusive</td>
<td>LDREX and STREX on page 83</td>
</tr>
<tr>
<td>PLD</td>
<td>Preload Data.</td>
<td>PLD on page 81</td>
</tr>
<tr>
<td>POP</td>
<td>Pop registers from stack</td>
<td>PUSH and POP on page 82</td>
</tr>
<tr>
<td>PUSH</td>
<td>Push registers onto stack</td>
<td>PUSH and POP on page 82</td>
</tr>
<tr>
<td>STM(mode)</td>
<td>Store Multiple registers</td>
<td>LDM and STM on page 79</td>
</tr>
<tr>
<td>STR(type)</td>
<td>Store register using immediate offset</td>
<td>LDR and STR, immediate offset on page 73</td>
</tr>
<tr>
<td>STR(type)</td>
<td>Store register using register offset</td>
<td>LDR and STR, register offset on page 76</td>
</tr>
<tr>
<td>STR(type)T</td>
<td>Store register with unprivileged access</td>
<td>LDR and STR, unprivileged on page 77</td>
</tr>
<tr>
<td>STREX(type)</td>
<td>Store register Exclusive</td>
<td>LDREX and STREX on page 83</td>
</tr>
</tbody>
</table>
3.4.1 ADR

Generate PC-relative address.

Syntax

ADR{cond} Rd, label

Where:
cond Is an optional condition code. See Conditional execution on page 68.
Rd Is the destination register.
label Is a PC-relative expression. See PC-relative expressions on page 68.

Operation

ADR generates an address by adding an immediate value to the PC, and writes the result to the destination register.

ADR provides the means by which position-independent code can be generated, because the address is PC-relative.

If ADR is used to generate a target address for a BX or BLX instruction, The user must ensure that bit[0] of the address generated is set to 1 for correct execution.

Values of label must be within the range of -4095 to +4095 from the address in the PC.

The user might have to use the .W suffix to get the maximum offset range or to generate addresses that are not word-aligned. See Instruction width selection on page 71.

Restrictions

Rd must not be SP and must not be PC.

Condition flags

This instruction does not change the flags.

Examples

ADR R1, TextMessage ; Write address value of a location labelled ; as TextMessage to R1.

3.4.2 LDR and STR, immediate offset

Load and Store with immediate offset, pre-indexed immediate offset, or post-indexed immediate offset.

Syntax

\[
\begin{align*}
 & \text{op(type){cond} } \text{Rt, } [\text{Rn } , \ #\text{offset}] & \text{; immediate offset} \\
 & \text{op(type){cond} } \text{Rt, } [\text{Rn}, \ #\text{offset}]! & \text{; pre-indexed} \\
 & \text{op(type){cond} } \text{Rt, } [\text{Rn}], \ #\text{offset} & \text{; post-indexed} \\
 & \text{opD{cond} } \text{Rt, } \text{Rt2, } [\text{Rn } , \ #\text{offset}] & \text{; immediate offset, two words} \\
 & \text{opD{cond} } \text{Rt, } \text{Rt2, } [\text{Rn}, \ #\text{offset}]! & \text{; pre-indexed, two words} \\
 & \text{opD{cond} } \text{Rt, } \text{Rt2, } [\text{Rn}], \ #\text{offset} & \text{; post-indexed, two words}
\end{align*}
\]
Where:

- **op** Is one of:
  - **LDR** Load register.
  - **STR** Store register.

- **type** Is one of:
  - **B** Unsigned byte, zero extend to 32 bits on loads.
  - **SB** Signed byte, sign extend to 32 bits (**LDR** only).
  - **H** Unsigned halfword, zero extend to 32 bits on loads.
  - **SH** Signed halfword, sign extend to 32 bits (**LDR** only).
  - **-** Omit, for word.

- **cond** Is an optional condition code. See *Conditional execution on page 68*.

- **Rt** Is the register to load or store.

- **Rn** Is the register on which the memory address is based.

- **offset** Is an offset from **Rn**. If **offset** is omitted, the address is the contents of **Rn**.

- **Rt2** Is the additional register to load or store for two-word operations.

**Operation**

LDR instructions load one or two registers with a value from memory.

STR instructions store one or two register values to memory.

Load and store instructions with immediate offset can use the following addressing modes:

**Offset addressing**

The offset value is added to or subtracted from the address obtained from the register **Rn**. The result is used as the address for the memory access. The register **Rn** is unaltered. The assembly language syntax for this mode is:

```
[Rn, #offset]
```

**Pre-indexed addressing**

The offset value is added to or subtracted from the address obtained from the register **Rn**. The result is used as the address for the memory access and written back into the register **Rn**. The assembly language syntax for this mode is:

```
[Rn, #offset]!
```

**Post-indexed addressing**

The address obtained from the register **Rn** is used as the address for the memory access. The offset value is added to or subtracted from the address, and written back into the register **Rn**. The assembly language syntax for this mode is:

```
[Rn], #offset
```

The value to load or store can be a byte, halfword, word, or two words. Bytes and halfwords can either be signed or unsigned. See *Address alignment on page 68*.

*Table 28* shows the ranges of offset for immediate, pre-indexed and post-indexed forms.
Restrictions

For load instructions:
- \(Rt\) can be SP or PC for word loads only.
- \(Rt\) must be different from \(Rt2\) for two-word loads.
- \(Rn\) must be different from \(Rt\) and \(Rt2\) in the pre-indexed or post-indexed forms.

When \(Rt\) is PC in a word load instruction:
- Bit[0] of the loaded value must be 1 for correct execution.
- A branch occurs to the address created by changing bit[0] of the loaded value to 0.
- If the instruction is conditional, it must be the last instruction in the IT block.

For store instructions:
- \(Rt\) can be SP for word stores only.
- \(Rt\) must not be PC.
- \(Rn\) must not be PC.
- \(Rn\) must be different from \(Rt\) and \(Rt2\) in the pre-indexed or post-indexed forms.

Condition flags

These instructions do not change the flags.

Examples

```
LDR R8, [R10] ; Loads R8 from the address in R10.
LDRNE R2, [R5, #960]! ; Loads (conditionally) R2 from a word
                     ; 960 bytes above the address in R5,
                     ; and increments R5 by 960.
STR R2, [R9,#const-struc] ; const-struc is an expression
                          ; evaluating to a constant in the range
                          ; 0-4095.
STRH R3, [R4], #4 ; Store R3 as halfword data into
                   ; address in R4, then increment R4 by
                   ; 4.
LDRD R8, R9, [R3, #0x20] ; Load R8 from a word 32 bytes above
                        ; the address in R3, and load R9 from a
                        ; word 36 bytes above the address in
                        ; R3.
STRD R0, R1, [R8], #-16 ; Store R0 to address in R8, and store
                       ; R1 to a word 4 bytes above the
                       ; address in R8, and then decrement R8
                       ; by 16.
```
3.4.3 LDR and STR, register offset

Load and Store with register offset.

Syntax

\[ \text{op}\{\text{type}\}\{\text{cond}\} \ R_t, \ [R_n, \ R_m \ (, \ LSL \ #n)] \]

Where:

- \( \text{op} \) is one of:
  - \( \text{LDR} \): Load register.
  - \( \text{STR} \): Store register.

- \( \text{type} \) is one of:
  - \( \text{B} \): Unsigned byte, zero extend to 32 bits on loads.
  - \( \text{SB} \): Signed byte, sign extend to 32 bits (LDR only).
  - \( \text{H} \): Unsigned halfword, zero extend to 32 bits on loads.
  - \( \text{SH} \): Signed halfword, sign extend to 32 bits (LDR only).
  - \( \text{A} \): Omit, for word.

- \( \text{cond} \) is an optional condition code. See Conditional execution on page 68.

- \( R_t \) is the register to load or store.

- \( R_n \) is the register on which the memory address is based.

- \( R_m \) is a register containing a value to be used as the offset.

- \( LSL \ #n \) is an optional shift, with \( n \) in the range 0 to 3.

Operation

LDR instructions load a register with a value from memory.

STR instructions store a register value into memory.

The memory address to load from or store to is at an offset from the register \( R_n \). The offset is specified by the register \( R_m \) and can be shifted left by up to 3 bits using LSL.

The value to load or store can be a byte, halfword, or word. For load instructions, bytes and halfwords can either be signed or unsigned. See Address alignment on page 68.

Restrictions

In these instructions:

- \( R_n \) must not be PC.
- \( R_m \) must not be SP and must not be PC.
- \( R_t \) can be SP only for word loads and word stores.
- \( R_t \) can be PC only for word loads.

When \( R_t \) is PC in a word load instruction:

- Bit[0] of the loaded value must be 1 for correct execution, and a branch occurs to this halfword-aligned address.
- If the instruction is conditional, it must be the last instruction in the IT block.

Condition flags

These instructions do not change the flags.
Examples

```
STR   R0, [R5, R1] ; Store value of R0 into an address equal to sum of R5 and R1.
LDRSB R0, [R5, R1, LSL #1] ; Read byte value from an address equal to; sum of R5 and two times R1, sign; extended it to a word value and put it, in R0.
STR   R0, [R1, R2, LSL #2] ; Stores R0 to an address equal to sum of; R1 and four times R2.
```

3.4.4 LDR and STR, unprivileged

Load and Store with unprivileged access.

Syntax

```
op(type)T(cond) Rt, [Rn {, #offset}] ; immediate offset
```

Where:

- **op** is one of:
  - LDR: Load register.
  - STR: Store register.

- **type** is one of:
  - B: Unsigned byte, zero extend to 32 bits on loads.
  - SB: Signed byte, sign extend to 32 bits (LDR only).
  - H: Unsigned halfword, zero extend to 32 bits on loads.
  - SH: Signed halfword, sign extend to 32 bits (LDR only).
  - -: Omit, for word.

- **cond** is an optional condition code. See Conditional execution on page 68.

- **Rt** is the register to load or store.

- **Rn** is the register on which the memory address is based.

- **offset** is an offset from Rn and can be 0 to 255. If offset is omitted, the address is the value in Rn.

Operation

These load and store instructions perform the same function as the memory access instructions with immediate offset, see LDR and STR, immediate offset on page 73. The difference is that these instructions have only unprivileged access even when used in privileged software.

When used in unprivileged software, these instructions behave in exactly the same way as normal memory access instructions with immediate offset.

Restrictions

In these instructions:

- **Rn** must not be PC.
- **Rt** must not be SP and must not be PC.
Condition flags
These instructions do not change the flags.

Examples

STRBTEQ  R4, [R7] ; Conditionally store least significant byte
                 ; in R4 to an address in R7, with unprivileged
                 ; access.

LDRHT   R2, [R2, #8] ; Load halfword value from an address equal to
                    ; sum of R2 and 8 into R2, with unprivileged
                    ; access.

3.4.5 LDR, PC-relative
Load register from memory.

Syntax

LDR(type)(cond)  Rt, label
LDRD(cond)  Rt, Rt2, label ; Load two words

Where:

<table>
<thead>
<tr>
<th>type</th>
<th>Is one of:</th>
</tr>
</thead>
<tbody>
<tr>
<td>B</td>
<td>Unsigned byte, zero extend to 32 bits.</td>
</tr>
<tr>
<td>SB</td>
<td>Signed byte, sign extend to 32 bits.</td>
</tr>
<tr>
<td>H</td>
<td>Unsigned halfword, zero extend to 32 bits.</td>
</tr>
<tr>
<td>SH</td>
<td>Signed halfword, sign extend to 32 bits.</td>
</tr>
<tr>
<td>-</td>
<td>Omit, for word.</td>
</tr>
</tbody>
</table>

cond   Is an optional condition code. See Conditional execution on page 68.
Rt     Is the register to load or store.
Rt2    Is the second register to load or store.
label Is a PC-relative expression. See PC-relative expressions on page 68.

Operation

LDR loads a register with a value from a PC-relative memory address. The memory address
is specified by a label or by an offset from the PC.

The value to load or store can be a byte, halfword, or word. For load instructions, bytes and
halfwords can either be signed or unsigned. See Address alignment on page 68.

label must be within a limited range of the current instruction. Table 29 shows the possible
offsets between label and the PC.

Table 29. Offset ranges

<table>
<thead>
<tr>
<th>Instruction type</th>
<th>Offset range</th>
</tr>
</thead>
<tbody>
<tr>
<td>Word, halfword, signed halfword, byte, signed byte</td>
<td>-4095 to 4095</td>
</tr>
<tr>
<td>Two words</td>
<td>-1020 to 1020</td>
</tr>
</tbody>
</table>
Note: The user might have to use the .W suffix to get the maximum offset range. See Instruction width selection on page 71.

Restrictions

In these instructions:

- $Rt$ can be SP or PC only for word loads.
- $Rt2$ must not be SP and must not be PC.
- $Rt$ must be different from $Rt2$.

When $Rt$ is PC in a word load instruction:

- Bit[0] of the loaded value must be 1 for correct execution, and a branch occurs to this halfword-aligned address.
- If the instruction is conditional, it must be the last instruction in the IT block.

Condition flags

These instructions do not change the flags.

Examples

```
LDR    R0, LookUpTable ; Load R0 with a word of data from an address labelled as LookUpTable.
LDRSB  R7, locadata    ; Load a byte value from an address labelled as locadata, sign extend it to a word value, and put it in R7.
```

3.4.6 LDM and STM

Load and Store Multiple registers.

Syntax

```
op(addr_mode){cond} Rn(!!), reglist
```

Where:

- $op$ is one of:
  - LDM: Load Multiple registers.
  - STM: Store Multiple registers.

- $addr_mode$ is any one of the following:
  - IA: Increment address After each access. This is the default.
  - DB: Decrement address Before each access.

- $cond$ is an optional condition code. See Conditional execution on page 68.

- $Rn$ is the register on which the memory addresses are based.

- $!!$ is an optional writeback suffix. If $!!$ is present the final address, that is loaded from or stored to, is written back into $Rn$.

- $reglist$ is a list of one or more registers to be loaded or stored, enclosed in braces. It can contain register ranges. It must be comma separated if it contains more than one register or register range, see Examples on page 80.

LDM and LDMFD are synonyms for LDMIA. LDMFD refers to its use for popping data from Full Descending stacks.
LDMEA is a synonym for LDMDB, and refers to its use for popping data from Empty Ascending stacks.

STM and STMEA are synonyms for STMIA. STMEA refers to its use for pushing data onto Empty Ascending stacks.

STMFD is a synonym for STMDB, and refers to its use for pushing data onto Full Descending stacks

**Operation**

LDM instructions load the registers in `reglist` with word values from memory addresses based on `Rn`.

STM instructions store the word values in the registers in `reglist` to memory addresses based on `Rn`.

For LDM, LDMIA, LDMFD, STM, STMIA, and STMEA the memory addresses used for the accesses are at 4-byte intervals ranging from `Rn` to `Rn + 4 * (n-1)`, where `n` is the number of registers in `reglist`. The accesses happen in order of increasing register numbers, with the lowest numbered register using the lowest memory address and the highest number register using the highest memory address. If the writeback suffix is specified, the value of `Rn + 4 * (n-1)` is written back to `Rn`.

For LDMDB, LDMEA, STMDB, and STMFD the memory addresses used for the accesses are at 4-byte intervals ranging from `Rn` to `Rn - 4 * (n-1)`, where `n` is the number of registers in `reglist`. The accesses happen in order of decreasing register numbers, with the highest numbered register using the highest memory address and the lowest number register using the lowest memory address. If the writeback suffix is specified, the value of `Rn - 4 * (n-1)` is written back to `Rn`.

The PUSH and POP instructions can be expressed in this form. See *PUSH and POP on page 82* for details.

**Restrictions**

In these instructions:

- `Rn` must not be PC.
- `reglist` must not contain SP.
- In any STM instruction, `reglist` must not contain PC.
- In any LDM instruction, `reglist` must not contain PC if it contains LR.
- `reglist` must not contain `Rn` if the writeback suffix is specified.

When PC is in `reglist` in an LDM instruction:

- Bit[0] of the value loaded to the PC must be 1 for correct execution, and a branch occurs to this halfword-aligned address
- If the instruction is conditional, it must be the last instruction in the IT block.

**Condition flags**

These instructions do not change the flags.

**Examples**

```
LDM R8,(R0,R2,R9) ; LDMIA is a synonym for LDM.
STMDB R1!,(R3-R6,R11,R12)
```
Incorrect examples

STM  R5!, (R5, R4, R9) ; Value stored for R5 is unpredictable.
LDM  R2, {} ; There must be at least one register in the list.

3.4.7 PLD
Preload Data.

Syntax

PLD [<Rn>, #<imm12>]
PLD [<Rn>, <Rm> {, LSL #<shift>}]   
PLD <label>

where:

< <Rn> >             Is the base register.
< <imm> >            Is the immediate offset used to form the address.
< <Rm> >             Is the optionally shifted offset register.
< <shift> >          Specifies the shift to apply to the value read from <Rm>, in the range 0-3. If this option is omitted, a shift by 0 is assumed.
< <label> >          The label of the literal item that is likely to be accessed in the near future.

Operation

PLD signals the memory system that data memory accesses from a specified address are likely in the near future. If the address is cacheable then the memory system responds by pre-loading the cache line containing the specified address into the data cache. If the address is not cacheable, or the data cache is disabled, this instruction behaves as no operation.

Restrictions

There are no restrictions.

Condition flags

These instructions do not change the flags.
3.4.8    PUSH and POP

Push registers onto, and pop registers off a full-descending stack.

Syntax

PUSH\(\text{cond}\) reglist
POP\(\text{cond}\) reglist

Where:

\(\text{cond}\) Is an optional condition code. See Conditional execution on page 68.

\(\text{reglist}\) Is a non-empty list of registers, enclosed in braces. It can contain register ranges. It must be comma separated if it contains more than one register or register range.

PUSH and POP are synonyms for STMDB and LDM (or LDMIA) with the memory addresses for the access based on SP, and with the final address for the access written back to the SP. PUSH and POP are the preferred mnemonics in these cases.

Operation

PUSH stores registers on the stack, with the lowest numbered register using the lowest memory address and the highest numbered register using the highest memory address.

POP loads registers from the stack, with the lowest numbered register using the lowest memory address and the highest numbered register using the highest memory address.

PUSH uses the value in the SP register minus four as the highest memory address, POP uses the value in the SP register as the lowest memory address, implementing a full-descending stack. On completion, PUSH updates the SP register to point to the location of the lowest store value, POP updates the SP register to point to the location above the highest location loaded.

If a POP instruction includes PC in its reglist, a branch to this location is performed when the POP instruction has completed. Bit[0] of the value read for the PC is used to update the APSR T-bit. This bit must be 1 to ensure correct operation.

See LDM and STM on page 79 for more information.

Restrictions

In these instructions:

- \(\text{reglist}\) must not contain SP.
- For the PUSH instruction, \(\text{reglist}\) must not contain PC.
- For the POP instruction, \(\text{reglist}\) must not contain PC if it contains LR.

When PC is in \(\text{reglist}\) in a POP instruction:

- Bit[0] of the value loaded to the PC must be 1 for correct execution, and a branch occurs to this halfword-aligned address.
- If the instruction is conditional, it must be the last instruction in the IT block.

Condition flags

These instructions do not change the flags.
Examples

PUSH (R0,R4-R7) ; Push R0,R4,R5,R6,R7 onto the stack
PUSH (R2,LR) ; Push R2 and the link-register onto the stack
POP (R0,R6,PC) ; Pop r0,r6 and PC from the stack, then branch to the
; new PC.

3.4.9 LDREX and STREX
Load and Store Register Exclusive.

Syntax

LDREX(cond) Rt, [Rn {, #offset}]
STREX(cond) Rd, Rt, [Rn {, #offset}]
LDREXB(cond) Rt, [Rn]
STREXB(cond) Rd, Rt, [Rn]
LDREXH(cond) Rt, [Rn]
STREXH(cond) Rd, Rt, [Rn]

Where:

cond Is an optional condition code. See Conditional execution on page 68.
Rd Is the destination register for the returned status.
Rt Is the register to load or store.
Rn Is the register on which the memory address is based.
offset Is an optional offset applied to the value in Rn. If offset is omitted, the address is the value in Rn.

Operation

LDREX, LDREXB, and LDREXH load a word, byte, and halfword respectively from a memory address.
STREX, STREXB, and STREXH attempt to store a word, byte, and halfword respectively to a memory address. The address used in any Store-Exclusive instruction must be the same as the address in the most recently executed Load-exclusive instruction. The value stored by the Store-Exclusive instruction must also have the same data size as the value loaded by the preceding Load-exclusive instruction. This means software must always use a Load-exclusive instruction and a matching Store-Exclusive instruction to perform a synchronization operation, see Synchronization primitives on page 37.

If a Store-Exclusive instruction performs the store, it writes 0 to its destination register. If it does not perform the store, it writes 1 to its destination register. If the Store-Exclusive instruction writes 0 to the destination register, it is guaranteed that no other process in the system has accessed the memory location between the Load-exclusive and Store-Exclusive instructions.

For reasons of performance, keep the number of instructions between corresponding Load-Exclusive and Store-Exclusive instruction to a minimum.

The result of executing a Store-Exclusive instruction to an address that is different from that used in the preceding Load-Exclusive instruction is unpredictable.
Restrictions

In these instructions:

- Do not use PC.
- Do not use SP for Rd and Rt.
- For STREX, Rd must be different from both Rt and Rn.
- The value of offset must be a multiple of four in the range 0-1020.

Condition flags

These instructions do not change the flags.

Examples

```
MOV     R1, #0x1 ; Initialize the 'lock taken' value try
LDREX   R0, [LockAddr] ; Load the lock value
CMP     R0, #0 ; Is the lock free?
ITT     EQ ; IT instruction for STREXEQ and CMPEQ
STREXEQ R0, R1, [LockAddr] ; Try and claim the lock
CMPEQ   R0, #0 ; Did this succeed?
BNE     try ; No - try again
.... ; Yes - we have the lock.
```

3.4.10 CLREX

Clear Exclusive.

Syntax

```
CLREX{cond}
```

Where:

- **cond** is an optional condition code. See Conditional execution on page 68.

Operation

Use CLREX to make the next STREX, STREXB, or STREXH instruction write 1 to its
destination register and fail to perform the store. It is useful in exception handler code to
force the failure of the store exclusive if the exception occurs between a load exclusive
instruction and the matching store exclusive instruction in a synchronization operation.

See Synchronization primitives on page 37 for more information.

Condition flags

This instruction does not change the flags.

Examples

```
CLREX
```
### General data processing instructions

Table 30 shows the data processing instructions:

<table>
<thead>
<tr>
<th>Mnemonic</th>
<th>Brief description</th>
<th>See</th>
</tr>
</thead>
<tbody>
<tr>
<td>ADC</td>
<td>Add with Carry</td>
<td>ADD, ADC, SUB, SBC, and RSB on page 87</td>
</tr>
<tr>
<td>ADD</td>
<td>Add</td>
<td>ADD, ADC, SUB, SBC, and RSB on page 87</td>
</tr>
<tr>
<td>ADDW</td>
<td>Add</td>
<td>ADD, ADC, SUB, SBC, and RSB on page 87</td>
</tr>
<tr>
<td>AND</td>
<td>Logical AND</td>
<td>AND, ORR, EOR, BIC, and ORN on page 89</td>
</tr>
<tr>
<td>ASR</td>
<td>Arithmetic Shift Right</td>
<td>ASR, LSL, LSR, ROR, and RRX on page 90</td>
</tr>
<tr>
<td>BIC</td>
<td>Bit Clear</td>
<td>AND, ORR, EOR, BIC, and ORN on page 89</td>
</tr>
<tr>
<td>CLZ</td>
<td>Count leading zeros</td>
<td>CLZ on page 91</td>
</tr>
<tr>
<td>CMN</td>
<td>Compare Negative</td>
<td>CMP and CMN on page 92</td>
</tr>
<tr>
<td>CMP</td>
<td>Compare</td>
<td>CMP and CMN on page 92</td>
</tr>
<tr>
<td>EOR</td>
<td>Exclusive OR</td>
<td>AND, ORR, EOR, BIC, and ORN on page 89</td>
</tr>
<tr>
<td>LSL</td>
<td>Logical Shift Left</td>
<td>ASR, LSL, LSR, ROR, and RRX on page 90</td>
</tr>
<tr>
<td>LSR</td>
<td>Logical Shift Right</td>
<td>ASR, LSL, LSR, ROR, and RRX on page 90</td>
</tr>
<tr>
<td>MOV</td>
<td>Move</td>
<td>MOV and MVN on page 93</td>
</tr>
<tr>
<td>MOVT</td>
<td>Move Top</td>
<td>MOVT on page 94</td>
</tr>
<tr>
<td>MOVW</td>
<td>Move 16-bit constant</td>
<td>MOV and MVN on page 93</td>
</tr>
<tr>
<td>MVN</td>
<td>Move NOT</td>
<td>MOV and MVN on page 93</td>
</tr>
<tr>
<td>ORN</td>
<td>Logical OR NOT</td>
<td>AND, ORR, EOR, BIC, and ORN on page 89</td>
</tr>
<tr>
<td>ORR</td>
<td>Logical OR</td>
<td>AND, ORR, EOR, BIC, and ORN on page 89</td>
</tr>
<tr>
<td>RBIT</td>
<td>Reverse Bits</td>
<td>REV, REV16, REVSH, and RBIT on page 95</td>
</tr>
<tr>
<td>REV</td>
<td>Reverse byte order in a word</td>
<td>REV, REV16, REVSH, and RBIT on page 95</td>
</tr>
<tr>
<td>REV16</td>
<td>Reverse byte order in each halfword</td>
<td>REV, REV16, REVSH, and RBIT on page 95</td>
</tr>
<tr>
<td>REVSH</td>
<td>Reverse byte order in bottom halfword and sign extend</td>
<td>REV, REV16, REVSH, and RBIT on page 95</td>
</tr>
<tr>
<td>ROR</td>
<td>Rotate Right</td>
<td>ASR, LSL, LSR, ROR, and RRX on page 90</td>
</tr>
<tr>
<td>RRX</td>
<td>Rotate Right with Extend</td>
<td>ASR, LSL, LSR, ROR, and RRX on page 90</td>
</tr>
<tr>
<td>RSB</td>
<td>Reverse Subtract</td>
<td>ADD, ADC, SUB, SBC, and RSB on page 87</td>
</tr>
<tr>
<td>SADD16</td>
<td>Signed Add 16</td>
<td>SADD16 and SADD8 on page 96</td>
</tr>
<tr>
<td>SADD8</td>
<td>Signed Add 8</td>
<td>SADD16 and SADD8 on page 96</td>
</tr>
<tr>
<td>SASX</td>
<td>Signed Add and Subtract with Exchange</td>
<td>SASX and SSAX on page 101</td>
</tr>
<tr>
<td>SSAX</td>
<td>Signed Subtract and Add with Exchange</td>
<td>SASX and SSAX on page 101</td>
</tr>
<tr>
<td>SBC</td>
<td>Subtract with Carry</td>
<td>ADD, ADC, SUB, SBC, and RSB on page 87</td>
</tr>
<tr>
<td>SHADD16</td>
<td>Signed Halving Add 16</td>
<td>SHADD16 and SHADD8 on page 97</td>
</tr>
<tr>
<td>Mnemonic</td>
<td>Brief description</td>
<td>See</td>
</tr>
<tr>
<td>----------</td>
<td>-------------------------------------------------</td>
<td>------------------------------------------</td>
</tr>
<tr>
<td>SHADD8</td>
<td>Signed Halving Add 8</td>
<td>SHADD16 and SHADD8 on page 97</td>
</tr>
<tr>
<td>SHASX</td>
<td>Signed Halving Add and Subtract with Exchange</td>
<td>SHASX and SHSAX on page 98</td>
</tr>
<tr>
<td>SHSAX</td>
<td>Signed Halving Subtract and Add with Exchange</td>
<td>SHASX and SHSAX on page 98</td>
</tr>
<tr>
<td>SHSUB16</td>
<td>Signed Halving Subtract 16</td>
<td>SHSUB16 and SHSUB8 on page 99</td>
</tr>
<tr>
<td>SHSUB8</td>
<td>Signed Halving Subtract 8</td>
<td>SHSUB16 and SHSUB8 on page 99</td>
</tr>
<tr>
<td>SSUB16</td>
<td>Signed Subtract 16</td>
<td>SSUB16 and SSUB8 on page 100</td>
</tr>
<tr>
<td>SSUB8</td>
<td>Signed Subtract 8</td>
<td>SSUB16 and SSUB8 on page 100</td>
</tr>
<tr>
<td>SUB</td>
<td>Subtract</td>
<td>ADD, ADC, SUB, SBC, and RSB on page 87</td>
</tr>
<tr>
<td>SUBW</td>
<td>Subtract</td>
<td>ADD, ADC, SUB, SBC, and RSB on page 87</td>
</tr>
<tr>
<td>TEQ</td>
<td>Test Equivalence</td>
<td>TST and TEQ on page 102</td>
</tr>
<tr>
<td>TST</td>
<td>Test</td>
<td>TST and TEQ on page 102</td>
</tr>
<tr>
<td>UADD16</td>
<td>Unsigned Add 16</td>
<td>UADD16 and UADD8 on page 103</td>
</tr>
<tr>
<td>UADD8</td>
<td>Unsigned Add 8</td>
<td>UADD16 and UADD8 on page 103</td>
</tr>
<tr>
<td>UASX</td>
<td>Unsigned Add and Subtract with Exchange</td>
<td>UASX and USAX on page 104</td>
</tr>
<tr>
<td>USAX</td>
<td>Unsigned Subtract and Add with Exchange</td>
<td>UASX and USAX on page 104</td>
</tr>
<tr>
<td>UHADD16</td>
<td>Unsigned Halving Add 16</td>
<td>UHADD16 and UHADD8 on page 105</td>
</tr>
<tr>
<td>UHADD8</td>
<td>Unsigned Halving Add 8</td>
<td>UHADD16 and UHADD8 on page 105</td>
</tr>
<tr>
<td>UHASX</td>
<td>Unsigned Halving Add and Subtract with Exchange</td>
<td>UHASX and UHSAX on page 106</td>
</tr>
<tr>
<td>UHSAX</td>
<td>Unsigned Halving Subtract and Add with Exchange</td>
<td>UHASX and UHSAX on page 106</td>
</tr>
<tr>
<td>UHSUB16</td>
<td>Unsigned Halving Subtract 16</td>
<td>UHSUB16 and UHSUB8 on page 107</td>
</tr>
<tr>
<td>UHSUB8</td>
<td>Unsigned Halving Subtract 8</td>
<td>UHSUB16 and UHSUB8 on page 107</td>
</tr>
<tr>
<td>USAD8</td>
<td>Unsigned Sum of Absolute Differences</td>
<td>USAD8 on page 108</td>
</tr>
<tr>
<td>USADA8</td>
<td>Unsigned Sum of Absolute Differences and Accumulate</td>
<td>USADA8 on page 109</td>
</tr>
<tr>
<td>USUB16</td>
<td>Unsigned Subtract 16</td>
<td>USUB16 and USUB8 on page 110</td>
</tr>
<tr>
<td>USUB8</td>
<td>Unsigned Subtract 8</td>
<td>USUB16 and USUB8 on page 110</td>
</tr>
</tbody>
</table>
3.5.1 **ADD, ADC, SUB, SBC, and RSB**

Add, Add with carry, Subtract, Subtract with carry, and Reverse Subtract.

**Syntax**

\[
\text{op}(\{S\})\{condition\}\ (Rd,\)\ Rn,\ Operand2 \\
\text{op}\{condition\}\ (Rd,\)\ Rn,\ #\text{imm12}\quad;\text{ADD}\text{ and }\text{SUB}\text{ only}
\]

Where:

- \(\text{op}\) is one of:
  - ADD: Add.
  - ADC: Add with Carry.
  - SUB: Subtract.
  - SBC: Subtract with Carry.
  - RSB: Reverse Subtract.

- \(S\) is an optional suffix. If \(S\) is specified, the condition code flags are updated on the result of the operation, see *Conditional execution on page 68*.

- \(condition\) is an optional condition code. See *Conditional execution on page 68*.

- \(Rd\) is the destination register. If \(Rd\) is omitted, the destination register is \(Rn\).

- \(Rn\) is the register holding the first operand.

- \(Operand2\) is a flexible second operand. See *Flexible second operand on page 64* for details of the options.

- \#\text{imm12}\ is any value in the range 0-4095.

**Operation**

The ADD instruction adds the value of \(Operand2\) or \#\text{imm12}\ to the value in \(Rn\).

The ADC instruction adds the values in \(Rn\) and \(Operand2\), together with the carry flag.

The SUB instruction subtracts the value of \(Operand2\) or \#\text{imm12}\ from the value in \(Rn\).

The SBC instruction subtracts the value of \(Operand2\) from the value in \(Rn\). If the carry flag is clear, the result is reduced by one.

The RSB instruction subtracts the value in \(Rn\) from the value of \(Operand2\). This is useful because of the wide range of options for \(Operand2\).

Use ADC and SBC to synthesize multiword arithmetic, see *Multiword arithmetic examples on page 88*.

See also *ADR on page 73*.

ADDW is equivalent to the ADD syntax that uses the \#\text{imm12}\ operand. SUBW is equivalent to the SUB syntax that uses the \#\text{imm12}\ operand.
Restrictions

In these instructions:

- **Operand2** must not be SP and must not be PC
- **Rd** can be SP only in ADD and SUB, and only with the additional restrictions:
  - **Rn** must also be SP.
  - Any shift in **Operand2** must be limited to a maximum of 3 bits using LSL.
- **Rn** can be SP only in ADD and SUB
- **Rd** can be PC only in the **ADD**(cond) PC, PC, Rm instruction where:
  - The S suffix must not be specified.
  - **Rm** must not be PC and must not be SP.
  - If the instruction is conditional, it must be the last instruction in the IT block.
- With the exception of the **ADD**(cond) PC, PC, Rm instruction, **Rn** can be PC only in ADD and SUB, and only with the additional restrictions:
  - The S suffix must not be specified.
  - The second operand must be a constant in the range 0 to 4095.
  - When using the PC for an addition or a subtraction, bits[1:0] of the PC are rounded to 0b00 before performing the calculation, making the base address for the calculation word-aligned.
  - If the user wants to generate the address of an instruction, the constant has to be adjusted based on the value of the PC. Arm recommends using the ADR instruction instead of ADD or SUB with **Rn** equal to the PC, because the assembler automatically calculates the correct constant for the ADR instruction.

When **Rd** is PC in the **ADD**(cond) PC, PC, Rm instruction:

- Bit[0] of the value written to the PC is ignored.
- A branch occurs to the address created by forcing bit[0] of that value to 0.

Condition flags

If **S** is specified, these instructions update the N, Z, C and V flags according to the result.

Examples

<table>
<thead>
<tr>
<th>Instruction</th>
<th>Operands</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>ADD</td>
<td>R2, R1, R3</td>
<td>; Sets the flags on the result.</td>
</tr>
<tr>
<td>SUBS</td>
<td>R8, R6, #240</td>
<td>; Subtracts contents of R4 from 1280.</td>
</tr>
<tr>
<td>RSB</td>
<td>R4, R4, #1280</td>
<td>; Only executed if C flag set and Z. flag clear.</td>
</tr>
<tr>
<td>ADCHI</td>
<td>R11, R0, R3</td>
<td></td>
</tr>
</tbody>
</table>

Multiword arithmetic examples

*Example 3-4: 64-bit addition* shows two instructions that add a 64-bit integer contained in R2 and R3 to another 64-bit integer contained in R0 and R1, and place the result in R4 and R5.

**Example 3-4: 64-bit addition**

<table>
<thead>
<tr>
<th>Instruction</th>
<th>Operands</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>ADDS</td>
<td>R4, R0, R2</td>
<td>; Add the least significant words.</td>
</tr>
<tr>
<td>ADC</td>
<td>R5, R1, R3</td>
<td>; Add the most significant words with carry.</td>
</tr>
</tbody>
</table>

The multiword values do not have to use consecutive registers. *Example 3-5: 96-bit*
subtraction shows instructions that subtract a 96-bit integer contained in R9, R1, and R11 from another contained in R6, R2, and R8. The example stores the result in R6, R9, and R2.

Example 3-5: 96-bit subtraction

```
SUBS  R6, R6, R9  ; Subtract the least significant words.
SBCS  R9, R2, R1  ; Subtract the middle words with carry.
SBC  R2, R8, R11  ; Subtract the most significant words with carry.
```

3.5.2 AND, ORR, EOR, BIC, and ORN

Logical AND, OR, Exclusive OR, Bit Clear, and OR NOT.

**Syntax**

```
op{S}{cond} {Rd,} Rn, Operand2
```

Where:

- **op** Is one of:
  - AND  Logical AND.
  - ORR  Logical OR, or bit set.
  - EOR  Logical Exclusive OR.
  - BIC  Logical AND NOT, or bit clear.
  - ORN  Logical OR NOT.

- **S** Is an optional suffix. If S is specified, the condition code flags are updated on the result of the operation, see Conditional execution on page 68.

- **cond** Is an optional condition code. See Conditional execution on page 68.

- **Rd** Is the destination register.

- **Rn** Is the register holding the first operand.

- **Operand2** Is a flexible second operand. See Flexible second operand on page 64 for details of the options.

**Operation**

The AND, EOR, and ORR instructions perform bitwise AND, Exclusive OR, and OR operations on the values in Rn and Operand2.

The BIC instruction performs an AND operation on the bits in Rn with the complements of the corresponding bits in the value of Operand2.

The ORN instruction performs an OR operation on the bits in Rn with the complements of the corresponding bits in the value of Operand2.

**Restrictions**

Do not use SP and do not use PC.
Condition flags

If S is specified, these instructions:

- Update the N and Z flags according to the result.
- Can update the C flag during the calculation of Operand2, see Example 3-5: 96-bit subtraction on page 89.
- Do not affect the V flag.

Examples

<table>
<thead>
<tr>
<th>Instruction</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>AND R9, R2, #0xFF00</td>
<td></td>
</tr>
<tr>
<td>ORREQ R2, R0, R5</td>
<td></td>
</tr>
<tr>
<td>ANDS R9, R8, #0x19</td>
<td></td>
</tr>
<tr>
<td>EORS R7, R11, #0x18181818</td>
<td></td>
</tr>
<tr>
<td>BIC R0, R1, #0x1ab</td>
<td></td>
</tr>
<tr>
<td>ORN R7, R11, R14, ROR #4</td>
<td></td>
</tr>
<tr>
<td>ORNS R7, R11, R14, ASR #32</td>
<td></td>
</tr>
</tbody>
</table>

3.5.3 ASR, LSL, LSR, ROR, and RRX

Arithmetic Shift Right, Logical Shift Left, Logical Shift Right, Rotate Right, and Rotate Right with Extend.

Syntax

\[
\text{op(S)\{cond\} Rd, Rm, Rs} \\
\text{op(S)\{cond\} Rd, Rm, #n} \\
\text{RRX(S)\{cond\} Rd, Rm}
\]

Where:

- **op** Is one of:
  - **ASR** Arithmetic Shift Right.
  - **LSL** Logical Shift Left.
  - **LSR** Logical Shift Right.
  - **ROR** Rotate Right.

- **S** Is an optional suffix. If S is specified, the condition code flags are updated on the result of the operation, see Conditional execution on page 68.

- **Rd** Is the destination register.

- **Rm** Is the register holding the value to be shifted.

- **Rs** Is the register holding the shift length to apply to the value in Rm. Only the least significant byte is used and can be in the range 0 to 255.

- **n** Is the shift length. The range of shift length depends on the instruction:
  - **ASR** Shift length from 1 to 32
  - **LSL** Shift length from 0 to 31
  - **LSR** Shift length from 1 to 32
  - **ROR** Shift length from 1 to 31

`MOVS Rd, Rm` is the preferred syntax for `LSLS Rd, Rm, #0`. 
### Operation

ASR, LSL, LSR, and ROR move the bits in the register $Rm$ to the left or right by the number of places specified by constant $n$ or register $Rs$.

RRX moves the bits in register $Rm$ to the right by 1.

In all these instructions, the result is written to $Rd$, but the value in register $Rm$ remains unchanged. For details on what result is generated by the different instructions, see *Shift operations on page 65*.

### Restrictions

Do not use SP and do not use PC.

### Condition flags

If $S$ is specified:

- These instructions update the N and Z flags according to the result.
- The C flag is updated to the last bit shifted out, except when the shift length is 0, see *Shift operations on page 65*.

### Examples

- `ASR R7, R8, #9`; Arithmetic shift right by 9 bits.
- `LSLS R1, R2, #3`; Logical shift left by 3 bits with flag update.
- `LSR R4, R5, #6`; Logical shift right by 6 bits.
- `ROR R4, R5, R6`; Rotate right by the value in the bottom byte of $R6$.
- `RRX R4, R5`; Rotate right with extend.

#### 3.5.4 CLZ

Count Leading Zeros.

### Syntax

\[
\text{CLZ}\{\text{cond}\} \quad Rd, Rm
\]

*Where:*

- $\text{cond}$ is an optional condition code. See *Conditional execution on page 68*.
- $Rd$ is the destination register.
- $Rm$ is the operand register.

### Operation

The CLZ instruction counts the number of leading zeros in the value in $Rm$ and returns the result in $Rd$. The result value is 32 if no bits are set and zero if bit[31] is set.

### Restrictions

Do not use SP and do not use PC.

### Condition flags

This instruction does not change the flags.
Examples

CLZ R4, R9
CLZNE R2, R3

3.5.5 CMP and CMN

Compare and Compare Negative.

Syntax

CMP{cond} Rn, Operand2
CMN{cond} Rn, Operand2

Where:

cond Is an optional condition code. See Conditional execution on page 68.
Rn Is the register holding the first operand.
Operand2 Is a flexible second operand. See Flexible second operand on page 64 for details of the options.

Operation

These instructions compare the value in a register with Operand2. They update the condition flags on the result, but do not write the result to a register.

The CMP instruction subtracts the value of Operand2 from the value in Rn. This is the same as a SUBS instruction, except that the result is discarded.

The CMN instruction adds the value of Operand2 to the value in Rn. This is the same as an ADDS instruction, except that the result is discarded.

Restrictions

In these instructions:

- Do not use PC.
- Operand2 must not be SP.

Condition flags

These instructions update the N, Z, C and V flags according to the result.

Examples

CMP R2, R9
CMN R0, #6400
CMPGT SP, R7, LSL #2
3.5.6 MOV and MVN

Move and Move NOT.

Syntax

\[
\text{MOV(S)(cond) Rd, Operand2} \\
\text{MOV(cond) Rd, #imm16} \\
\text{MVN(S)(cond) Rd, Operand2}
\]

Where:

- **S** is an optional suffix. If S is specified, the condition code flags are updated on the result of the operation, see Conditional execution on page 68.
- **cond** is an optional condition code. See Conditional execution on page 68.
- **Rd** is the destination register.
- **Operand2** is a flexible second operand. See Flexible second operand on page 64 for details of the options.
- **imm16** is any value in the range 0-65535.

Operation

The MOV instruction copies the value of **Operand2** into **Rd**.

When **Operand2** in a MOV instruction is a register with a shift other than LSL #0, the preferred syntax is the corresponding shift instruction:

- \( \text{ASR(S)(cond) Rd, Rm, #n is the preferred syntax for MOV(S)(cond) Rd, Rm, ASR #n.} \)
- \( \text{LSL(S)(cond) Rd, Rm, #n is the preferred syntax for MOV(S)(cond) Rd, Rm, LSL #n if n != 0.} \)
- \( \text{LSR(S)(cond) Rd, Rm, #n is the preferred syntax for MOV(S)(cond) Rd, Rm, LSR #n.} \)
- \( \text{ROR(S)(cond) Rd, Rm, #n is the preferred syntax for MOV(S)(cond) Rd, Rm, ROR #n.} \)
- \( \text{RRX(S)(cond) Rd, Rm is the preferred syntax for MOV(S)(cond) Rd, Rm, RRX.} \)

Also, the MOV instruction permits additional forms of **Operand2** as synonyms for shift instructions:

- \( \text{MOV(S)(cond) Rd, Rm, ASR Rs is a synonym for ASR(S)(cond) Rd, Rm, Rs.} \)
- \( \text{MOV(S)(cond) Rd, Rm, LSL Rs is a synonym for LSL(S)(cond) Rd, Rm, Rs.} \)
- \( \text{MOV(S)(cond) Rd, Rm, LSR Rs is a synonym for LSR(S)(cond) Rd, Rm, Rs.} \)
- \( \text{MOV(S)(cond) Rd, Rm, ROR Rs is a synonym for ROR(S)(cond) Rd, Rm, Rs} \)

See **ASR, LSL, LSR, ROR, and RRX** on page 90.

The MVN instruction takes the value of **Operand2**, performs a bitwise logical NOT operation on the value, and places the result into **Rd**.

The MOVW instruction provides the same function as MOV, but is restricted to using the **imm16** operand.
Restrictions

The user can use SP and PC only in the MOV instruction, with the following restrictions:

- The second operand must be a register without shift.
- The S suffix must not be specified.

When Rd is PC in a MOV instruction:

- Bit[0] of the value written to the PC is ignored.
- A branch occurs to the address created by forcing bit[0] of that value to 0.

Though it is possible to use MOV as a branch instruction, Arm strongly recommends the use of a BX or BLX instruction to branch for software portability to the Arm instruction set.

Condition flags

If S is specified, these instructions:

- Update the N and Z flags according to the result.
- Can update the C flag during the calculation of Operand2, see Flexible second operand on page 64.
- Do not affect the V flag.

Example

- MOVS R11, #0x000B; Write value of 0x000B to R11, flags get updated.
- MOV R1, #0xFA05; Write value of 0xFA05 to R1, flags are not updated.
- MOVS R10, R12; Write value in R12 to R10, flags get updated.
- MOV R3, #23; Write value of 23 to R3.
- MOV R8, SP; Write value of stack pointer to R8.
- MVNS R2, #0xF; Write value of 0xFFFFFFF0 (bitwise inverse of 0xF) to the R2 and update flags.

3.5.7 MOVT

Move Top.

Syntax

MOVT{cond} Rd, #imm16

Where:

- cond Is an optional condition code. See Conditional execution on page 68.
- Rd Is the destination register.
- imm16 Is a 16-bit immediate constant.

Operation

MOVT writes a 16-bit immediate value, imm16, to the top halfword, Rd[31:16], of its destination register. The write does not affect Rd[15:0].

The MOV, MOVT instruction pair enables to generate any 32-bit constant.

Restrictions

Rd must not be SP and must not be PC.
Condition flags
This instruction does not change the flags.

Examples

MOVTR3, #'0xF123'; Write 0xF123 to upper halfword of R3, lower
halfword and APSR are unchanged.

3.5.8 REV, REV16, REVSH, and RBIT
Reverse bytes and Reverse bits.

Syntax

\[ \text{op}\{\text{cond}\} \ Rd, \ Rn \]

Where:

\( \text{op} \)
Is one of:

- REV    Reverse byte order in a word.
- REV16  Reverse byte order in each halfword independently.
- REVSH  Reverse byte order in the bottom halfword, and sign extend to 32
         bits.
- RBIT   Reverse the bit order in a 32-bit word.

\( \text{cond} \)
Is an optional condition code. See \textit{Conditional execution on page 68}.

\( Rd \)
Is the destination register.

\( Rn \)
Is the register holding the operand.

Operation
Use these instructions to change endianness of data:

REV  converts either:

- 32-bit big-endian data into little-endian data.
- 32-bit little-endian data into big-endian data.

REV16 converts either:

- 16-bit big-endian data into little-endian data.
- 16-bit little-endian data into big-endian data.

REVSH converts either:

- 16-bit signed big-endian data into 32-bit signed little-endian data.
- 16-bit signed little-endian data into 32-bit signed big-endian data.

Restrictions
Do not use SP and do not use PC.

Condition flags
These instructions do not change the flags.
Examples

REV  R3, R7 ; Reverse byte order of value in R7 and write it to R3.
REV16 R0, R0 ; Reverse byte order of each 16-bit halfword in R0.
REVSH R0, R5 ; Reverse Signed Halfword.
REVHS R3, R7 ; Reverse with Higher or Same condition.
RBIT R7, R8 ; Reverse bit order of value in R8 and write the result to R7.

3.5.9 SADD16 and SADD8

Signed Add 16 and Signed Add 8.

Syntax

\[ \text{op}(\text{cond})(Rd,) \text{ } Rn, \text{ } Rm \]

Where:

- \( \text{op} \) Is one of:
  - \text{SADD16} Performs two 16-bit signed integer additions.
  - \text{SADD8} Performs four 8-bit signed integer additions.

- \( \text{cond} \) Is an optional condition code. See Conditional execution on page 68.

- \( Rd \) Is the destination register.

- \( Rn \) Is the first register holding the operand.

- \( Rm \) Is the second register holding the operand.

Operation

Use these instructions to perform a halfword or byte add in parallel.

The SADD16 instruction:
1. Adds each halfword from the first operand to the corresponding halfword of the second operand.
2. Writes the result in the corresponding halfwords of the destination register.

The SADD8 instruction:
1. Adds each byte of the first operand to the corresponding byte of the second operand.
2. Writes the result in the corresponding bytes of the destination register.

Restrictions

Do not use SP and do not use PC.

Condition flags

These instructions do not change the flags.

Examples

\[ \text{SADD16} \text{ } R1, \text{ } R0 \] ; Adds the halfwords in R0 to the corresponding halfwords of R1 and writes to corresponding halfword of R1.
\[ \text{SADD8} \text{ } R4, \text{ } R0, \text{ } R5 \] ; Adds bytes of R0 to the corresponding byte in R5 and writes to the corresponding byte in R4.
### 3.5.10 SHADD16 and SHADD8

Signed Halving Add 16 and Signed Halving Add 8.

#### Syntax

\[
\text{op}(\text{cond})\{Rd,\} \ Rn, \ Rm
\]

Where:

- \(op\) is one of:
  - SHADD16: Signed Halving Add 16
  - SHADD8: Signed Halving Add 8
- \(cond\) is an optional condition code. See *Conditional execution on page 68*.
- \(Rd\) is the destination register.
- \(Rn\) is the first operand register.
- \(Rm\) is the second operand register.

#### Operation

Use these instructions to add 16-bit and 8-bit data and then to halve the result before writing the result to the destination register.

The SHADD16 instruction:
1. Adds each halfword from the first operand to the corresponding halfword of the second operand.
2. Shuffles the result by one bit to the right, halving the data.
3. Writes the halfword results in the destination register.

The SHADD8 instruction:
1. Adds each byte of the first operand to the corresponding byte of the second operand.
2. Shuffles the result by one bit to the right, halving the data.
3. Writes the byte results in the destination register.

#### Restrictions

Do not use SP and do not use PC.

#### Condition flags

These instructions do not change the flags.

#### Examples

- `SHADD16 R1, R0` ; Adds halfwords in R0 to corresponding halfword of R1
  ; and writes halved result to corresponding halfword in
  ; R1.
- `SHADD8 R4, R0, R5` ; Adds bytes of R0 to corresponding byte in R5 and
  ; writes halved result to corresponding byte in R4.
3.5.11 SHASX and SHSAX

Signed Halving Add and Subtract with Exchange and Signed Halving Subtract and Add with Exchange.

Syntax

\[ \text{op}(\text{cond}) \{\text{Rd}\}, \text{Rn}, \text{Rm} \]

Where:

- \( \text{op} \) is one of:
  - \( \text{SHASX} \): Add and Subtract with Exchange and Halving.
  - \( \text{SHSAX} \): Subtract and Add with Exchange and Halving.
- \( \text{cond} \) is an optional condition code. See Conditional execution on page 68.
- \( \text{Rd} \) is the destination register.
- \( \text{Rn}, \text{Rm} \) are registers holding the first and second operands.

Operation

The SHASX instruction:

1. Adds the top halfword of the first operand with the bottom halfword of the second operand.
2. Writes the halfword result of the addition to the top halfword of the destination register, shifted by one bit to the right causing a divide by two, or halving.
3. Subtracts the top halfword of the second operand from the bottom highword of the first operand.
4. Writes the halfword result of the division in the bottom halfword of the destination register, shifted by one bit to the right causing a divide by two, or halving.

The SHSAX instruction:

1. Subtracts the bottom halfword of the second operand from the top highword of the first operand.
2. Writes the halfword result of the addition to the bottom halfword of the destination register, shifted by one bit to the right causing a divide by two, or halving.
3. Adds the bottom halfword of the first operand with the top halfword of the second operand.
4. Writes the halfword result of the division in the top halfword of the destination register, shifted by one bit to the right causing a divide by two, or halving.

Restrictions

Do not use SP and do not use PC.

Condition flags

These instructions do not affect the condition code flags.

Examples

\[ \text{SHASX R7, R4, R2} \]

; Adds top halfword of R4 to bottom halfword of R2 and writes halved result to top halfword of R7. Subtracts top halfword of R2 from bottom
3.5.12 SHSUB16 and SHSUB8

Signed Halving Subtract 16 and Signed Halving Subtract 8.

Syntax

\[
\text{op(\{cond\})\{Rd,\} Rn, Rm}
\]

Where:

- \text{op} is one of:
  - \text{SHSUB16} Signed Halving Subtract 16.
  - \text{SHSUB8} Signed Halving Subtract 8.

- \text{cond} is an optional condition code. See Conditional execution on page 68.

- \text{Rd} is the destination register.
- \text{Rn} is the first operand register.
- \text{Rm} is the second operand register.

Operation

Use these instructions to add 16-bit and 8-bit data and then to halve the result before writing the result to the destination register.

The SHSUB16 instruction:
1. Subtracts each halfword of the second operand from the corresponding halfwords of the first operand.
2. Shuffles the result by one bit to the right, halving the data.
3. Writes the halved halfword results in the destination register.

The SHSUB8 instruction:
1. Subtracts each byte of the second operand from the corresponding byte of the first operand.
2. Shuffles the result by one bit to the right, halving the data.
3. Writes the corresponding signed byte results in the destination register.

Restrictions

Do not use SP and do not use PC.

Condition flags

These instructions do not change the flags.
Examples

SHSUB16 R1, R0 ; Subtracts halfwords in R0 from corresponding halfword of R1 and writes to corresponding halfword of R1.
SHSUB8 R4, R0, R5 ; Subtracts bytes of R0 from corresponding byte in R5, and writes to corresponding byte in R4.

3.5.13 SSUB16 and SSUB8

Signed Subtract 16 and Signed Subtract 8.

Syntax

\[ \text{op} \{ \text{cond} \} \{ \text{Rd} \}, \text{Rn}, \text{Rm} \]

Where:

- \( \text{op} \) Is one of:
  - \( \text{SSUB16} \) Performs two 16-bit signed integer subtractions.
  - \( \text{SSUB8} \) Performs four 8-bit signed integer subtractions.
- \( \text{cond} \) Is an optional condition code. See Conditional execution on page 68.
- \( \text{Rd} \) Is the destination register.
- \( \text{Rn} \) Is the first operand register.
- \( \text{Rm} \) Is the second operand register.

Operation

Use these instructions to change endianness of data.

The SSUB16 instruction:
1. Subtracts each halfword from the second operand from the corresponding halfword of the first operand.
2. Writes the difference result of two signed halfwords in the corresponding halfword of the destination register.

The SSUB8 instruction:
1. Subtracts each byte of the second operand from the corresponding byte of the first operand.
2. Writes the difference result of four signed bytes in the corresponding byte of the destination register.

Restrictions

Do not use SP and do not use PC.

Condition flags

These instructions do not change the flags.

Examples

SSUB16 R1, R0 ; Subtracts halfwords in R0 from corresponding halfword of R1
; halfword of R1 and writes to corresponding halfword
; of R1.
SSUB8 R4, R0, R5 ; Subtract bytes of R5 from corresponding byte in
; R0, and writes to corresponding byte of R4.

3.5.14 SASX and SSAX
Signed Add and Subtract with Exchange and Signed Subtract and Add with Exchange.

Syntax

\[ \text{op} \{\text{cond}\} \{\text{Rd}\}, \text{Rm}, \text{Rn} \]

Where:

- \( \text{op} \) Is one of:
  - SASX Signed Add and Subtract with Exchange.
  - SSAX Signed Subtract and Add with Exchange.

- \( \text{cond} \) Is an optional condition code. See \textit{Conditional execution on page 68}.

- \( \text{Rd} \) Is the destination register.

- \( \text{Rn}, \text{Rm} \) Are registers holding the first and second operands.

Operation

The SASX instruction:
1. Adds the signed top halfword of the first operand with the signed bottom halfword of the second operand.
2. Writes the signed result of the addition to the top halfword of the destination register.
3. Subtracts the signed bottom halfword of the second operand from the top signed highword of the first operand.
4. Writes the signed result of the subtraction to the bottom halfword of the destination register.

The SSAX instruction:
1. Subtracts the signed bottom halfword of the second operand from the top signed highword of the first operand.
2. Writes the signed result of the addition to the bottom halfword of the destination register.
3. Adds the signed top halfword of the first operand with the signed bottom halfword of the second operand.
4. Writes the signed result of the subtraction to the top halfword of the destination register.

Restrictions

Do not use SP and do not use PC.

Condition flags

These instructions do not affect the condition code flags.

Examples

SASX R0, R4, R5 ; Adds top halfword of R4 to bottom halfword of R5
3.5.15 TST and TEQ

Test bits and Test Equivalence.

Syntax

TST{(cond)} Rn, Operand2
TEQ{(cond)} Rn, Operand2

Where:

cond Is an optional condition code. See Conditional execution on page 68.
Rn Is the register holding the first operand.
Operand2 Is a flexible second operand. See Flexible second operand on page 64 for details of the options.

Operation

These instructions test the value in a register against Operand2. They update the condition flags based on the result, but do not write the result to a register.

The TST instruction performs a bitwise AND operation on the value in Rn and the value of Operand2. This is the same as the ANDS instruction, except that it discards the result. To test whether a bit of Rn is 0 or 1, use the TST instruction with an Operand2 constant that has that bit set to 1 and all other bits cleared to 0.

The TEQ instruction performs a bitwise Exclusive OR operation on the value in Rn and the value of Operand2. This is the same as the EORS instruction, except that it discards the result.

Use the TEQ instruction to test if two values are equal without affecting the V or C flags. TEQ is also useful for testing the sign of a value. After the comparison, the N flag is the logical Exclusive OR of the sign bits of the two operands.

Restrictions

Do not use SP and do not use PC.

Condition flags

These instructions:

- Update the N and Z flags according to the result.
- Can update the C flag during the calculation of Operand2, see Flexible second operand on page 64.
- Do not affect the V flag.
Examples

TST  R0, #0x3F8 ; Perform bitwise AND of R0 value to 0x3F8,
       ; APSR is updated but result is discarded
TEQEQ R10, R9 ; Conditionally test if value in R10 is equal to
              ; value in R9, APSR is updated but result is
              ; discarded.

3.5.16 UADD16 and UADD8

Unsigned Add 16 and Unsigned Add 8.

Syntax

\texttt{op} (cond) \{Rd, \} \ Rn, \ Rm

Where:

- \texttt{op} is one of:
  - UADD16: Performs two 16-bit unsigned integer additions.
  - UADD8: Performs four 8-bit unsigned integer additions.

- \texttt{cond} is an optional condition code. See Conditional execution on page 68.

- \texttt{Rd} is the destination register.

- \texttt{Rn} is the first register holding the operand.

- \texttt{Rm} is the second register holding the operand.

Operation

Use these instructions to add 16- and 8-bit unsigned data.

The UADD16 instruction:
1. Adds each halfword from the first operand to the corresponding halfword of the second operand.
2. Writes the unsigned result in the corresponding halfwords of the destination register.

The UADD16 instruction:
1. Adds each byte of the first operand to the corresponding byte of the second operand.
2. Writes the unsigned result in the corresponding byte of the destination register.

Restrictions

Do not use SP and do not use PC.

Condition flags

These instructions do not change the flags.

Examples

UADD16 R1, R0 ; Adds halfwords in R0 to corresponding halfword of R1,
               ; writes to corresponding halfword of R1.
UADD8  R4, R0, R5 ; Adds bytes of R0 to corresponding byte in R5 and
                   ; writes to corresponding byte in R4.
3.5.17 UASX and USAX

Add and Subtract with Exchange and Subtract and Add with Exchange.

Syntax

\[ \text{op}(\text{cond}) \{\text{Rd}\}, \text{Rn}, \text{Rm} \]

Where:

- **op** is one of:
  - **UASX** Add and Subtract with Exchange.
  - **USAX** Subtract and Add with Exchange.
- **cond** is an optional condition code. See Conditional execution on page 68.
- **Rd** is the destination register.
- **Rn**, **Rm** are registers holding the first and second operands.

Operation

The UASX instruction:

1. Subtracts the top halfword of the second operand from the bottom halfword of the first operand.
2. Writes the unsigned result from the subtraction to the bottom halfword of the destination register.
3. Adds the top halfword of the first operand with the bottom halfword of the second operand.
4. Writes the unsigned result of the addition to the top halfword of the destination register.

The USAX instruction:

1. Adds the bottom halfword of the first operand with the top halfword of the second operand.
2. Writes the unsigned result of the addition to the bottom halfword of the destination register.
3. Subtracts the bottom halfword of the second operand from the top halfword of the first operand.
4. Writes the unsigned result from the subtraction to the top halfword of the destination register.

Restrictions

Do not use SP and do not use PC.

Condition flags

These instructions do not affect the condition code flags.

Examples

UASX R0, R4, R5 \; Adds top halfword of R4 to bottom halfword of R5 \; and writes to top halfword of R0. \; Subtracts bottom halfword of R5 from top halfword \; of R0 and writes to bottom halfword of R0.

USAX R7, R3, R2 \; Subtracts top halfword of R2 from bottom halfword \; of R3 and writes to bottom halfword of R7. \; Adds top halfword of R3 to bottom halfword of R2
3.5.18 UHADD16 and UHADD8

Unsigned Halving Add 16 and Unsigned Halving Add 8.

Syntax

\[ \text{op}\{\text{cond}\}\{\text{Rd},\} \text{Rn}, \text{Rm} \]

Where:

- \( \text{op} \) is one of:
  - UHADD16Unsigned Halving Add 16.
  - UHADD8Unsigned Halving Add 8.
- \( \text{cond} \) is an optional condition code. See Conditional execution on page 68.
- \( \text{Rd} \) is the destination register.
- \( \text{Rn} \) is the register holding the first operand.
- \( \text{Rm} \) is the register holding the second operand.

Operation

Use these instructions to add 16- and 8-bit data and then to halve the result before writing the result to the destination register.

The UHADD16 instruction:

1. Adds each halfword from the first operand to the corresponding halfword of the second operand.
2. Shuffles the halfword result by one bit to the right, halving the data.
3. Writes the unsigned results to the corresponding halfword in the destination register.

The UHADD8 instruction:

1. Adds each byte of the first operand to the corresponding byte of the second operand.
2. Shuffles the byte result by one bit to the right, halving the data.
3. Writes the unsigned results in the corresponding byte in the destination register.

Restrictions

Do not use SP and do not use PC.

Condition flags

These instructions do not change the flags.

Examples

UHADD16 R7, R3 ; Adds halfwords in R7 to corresponding halfword of R3
; and writes halved result to corresponding halfword
; in R7.

UHADD8 R4, R0, R5 ; Adds bytes of R0 to corresponding byte in R5 and
; writes halved result to corresponding byte in R4.
3.5.19 UHASX and UHSAX

Unsigned Halving Add and Subtract with Exchange and Unsigned Halving Subtract and Add with Exchange.

Syntax

\[ \text{op}\{\text{cond}\} \{\text{Rd}\}, \text{Rn}, \text{Rm} \]

Where:

- \text{op} is one of:
  - \text{UHASX} Add and Subtract with Exchange and Halving.
  - \text{UHSAX} Subtract and Add with Exchange and Halving.
- \text{cond} is an optional condition code. See Conditional execution on page 68.
- \text{Rd} is the destination register.
- \text{Rn}, \text{Rm} are registers holding the first and second operands.

Operation

The UHASX instruction:
1. Adds the top halfword of the first operand with the bottom halfword of the second operand.
2. Shifts the result by one bit to the right causing a divide by two, or halving.
3. Writes the halfword result of the addition to the top halfword of the destination register.
4. Subtracts the top halfword of the second operand from the bottom highword of the first operand.
5. Shifts the result by one bit to the right causing a divide by two, or halving.
6. Writes the halfword result of the division in the bottom halfword of the destination register.

The UHSAX instruction:
1. Subtracts the bottom halfword of the second operand from the top highword of the first operand.
2. Shifts the result by one bit to the right causing a divide by two, or halving.
3. Writes the halfword result of the subtraction in the top halfword of the destination register.
4. Adds the bottom halfword of the first operand with the top halfword of the second operand.
5. Shifts the result by one bit to the right causing a divide by two, or halving.
6. Writes the halfword result of the addition to the bottom halfword of the destination register.

Restrictions

Do not use SP and do not use PC.

Condition flags

These instructions do not affect the condition code flags.

Examples

\text{UHASX} \ R7, \ R4, \ R2 \ ; \text{Adds top halfword of R4 with bottom halfword of}
; R2 and writes halved result to top halfword of R7.
; Subtracts top halfword of R2 from bottom halfword of R7 and writes halved result to bottom halfword of R7.
UHSAX R0, R3, R5
; Subtracts bottom halfword of R5 from top halfword of R3 and writes halved result to top halfword of R0.
; Adds top halfword of R5 to bottom halfword of R3 and writes halved result to bottom halfword of R0.

3.5.20 UHSUB16 and UHSUB8

Unsigned Halving Subtract 16 and Unsigned Halving Subtract 8.

Syntax

\[
\text{op}(\text{cond})\{\text{Rd},\} \, \text{Rn}, \, \text{Rm}
\]

Where:

- **op**
  - Is one of:
    - **UHSUB16**: Performs two unsigned 16-bit integer additions, halves the results, and writes the results to the destination register.
    - **UHSUB8**: Performs four unsigned 8-bit integer additions, halves the results, and writes the results to the destination register.

- **cond**
  - Is an optional condition code. See *Conditional execution on page 68*.

- **Rd**
  - Is the destination register.

- **Rn**
  - Is the first register holding the operand.

- **Rm**
  - Is the second register holding the operand.

Operation

Use these instructions to add 16-bit and 8-bit data and then to halve the result before writing the result to the destination register.

The UHSUB16 instruction:

1. Subtracts each halfword of the second operand from the corresponding halfword of the first operand.
2. Shuffles each halfword result to the right by one bit, halving the data.
3. Writes each unsigned halfword result to the corresponding halfwords in the destination register.

The UHSUB8 instruction:

1. Subtracts each byte of second operand from the corresponding byte of the first operand.
2. Shuffles each byte result by one bit to the right, halving the data.
3. Writes the unsigned byte results to the corresponding byte of the destination register.

Restrictions

Do not use SP and do not use PC.
Condition flags

These instructions do not change the flags.

Examples

UHSUB16 R1, R0  ; Subtracts halfwords in R0 from corresponding
                 ; halfword of R1 and writes halved result to
                 ; corresponding halfword in R1.
UHSUB8 R4, R0, R5 ; Subtracts bytes of R5 from corresponding byte in R0
                 ; and writes halved result to corresponding byte in
                 ; R4.

3.5.21 SEL

Select Bytes. Selects each byte of its result from either its first operand or its second
operand, according to the values of the GE flags.

Syntax

SEL{<c>}{<q>} {<Rd>,} <Rn>, <Rm>

Where:

<c>, <q>   Is a standard assembler syntax field.
<Rd>       Is the destination register.
<Rn>       Is the first operand register.
<Rm>       Is the second operand register.

Operation

The SEL instruction:
1. Reads the value of each bit of APSR.GE.
2. Depending on the value of APSR.GE, assigns the destination register the value of
   either the first or second operand register.

Restrictions

None.

Condition flags

These instructions do not change the flags.

Examples

SADD16 R0, R1, R2  ; Set GE bits based on result.
SEL R0, R0, R3      ; Select bytes from R0 or R3, based on GE.

3.5.22 USAD8

Unsigned Sum of Absolute Differences.

Syntax

USAD8{<cond>}{Rd,} Rn, Rm
Where:

\[ \text{cond} \] is an optional condition code. See *Conditional execution on page 68*.

\[ Rd \] is the destination register.

\[ Rn \] is the first operand register.

\[ Rm \] is the second operand register.

**Operation**

The USAD8 instruction:

1. Subtracts each byte of the second operand register from the corresponding byte of the first operand register.
2. Adds the absolute values of the differences together.
3. Writes the result to the destination register.

**Restrictions**

Do not use SP and do not use PC.

**Condition flags**

These instructions do not change the flags.

**Examples**

\[
\text{USAD8} \ R1, \ R4, \ R0 \quad \text{; Subtracts each byte in R0 from corresponding byte of R4 adds the differences and writes to R1.}
\]

\[
\text{USAD8} \ R0, \ R5 \quad \text{; Subtracts bytes of R5 from corresponding byte in R0, adds the differences and writes to R0.}
\]

3.5.23 **USADA8**

Unsigned Sum of Absolute Differences and Accumulate.

**Syntax**

\[
\text{USADA8}\{\text{cond}\}\{Rd\}, \ Rn, \ Rm, \ Ra
\]

**Where:**

\[ \text{cond} \] is an optional condition code. See *Conditional execution on page 68*.

\[ Rd \] is the destination register.

\[ Rn \] is the first operand register.

\[ Rm \] is the second operand register.

\[ Ra \] is the register that contains the accumulation value.

**Operation**

The USADA8 instruction:
1. Subtracts each byte of the second operand register from the corresponding byte of the first operand register.
2. Adds the unsigned absolute differences together.
3. Adds the accumulation value to the sum of the absolute differences.
4. Writes the result to the destination register.

Restrictions
Do not use SP and do not use PC.

Condition flags
These instructions do not change the flags.

Examples
```
USADA8 R1, R0, R6 ; Subtracts bytes in R0 from corresponding halfword
                    ; of R1 adds differences, adds value of R6, writes
                    ; to R1.
USADA8 R4, R0, R5, R2 ; Subtracts bytes of R5 from corresponding byte in
                    ; R0 adds differences, adds value of R2 writes to
                    ; R4.
```

3.5.24 USUB16 and USUB8
Unsigned Subtract 16 and Unsigned Subtract 8.

Syntax
```
op{cond}{Rd,} Rn, Rm
```
Where:
- `op` Is one of:
  - USUB16 Unsigned Subtract 16.
  - USUB8 Unsigned Subtract 8.
- `cond` Is an optional condition code. See Conditional execution on page 68.
- `Rd` Is the destination register.
- `Rn` Is the first operand register.
- `Rm` Is the second operand register.

Operation
Use these instructions to subtract 16-bit and 8-bit data before writing the result to the destination register.

The USUB16 instruction:
1. Subtracts each halfword from the second operand register from the corresponding halfword of the first operand register.
2. Writes the unsigned result in the corresponding halfwords of the destination register.

The USUB8 instruction:
1. Subtracts each byte of the second operand register from the corresponding byte of the first operand register.
2. Writes the unsigned byte result in the corresponding byte of the destination register.

Restrictions
Do not use SP and do not use PC.

Condition flags
These instructions do not change the flags.

Examples
USUB16 R1, R0 ; Subtracts halfwords in R0 from corresponding halfword of R1 and writes to corresponding halfword in R1.
USUB8 R4, R0, R5 ; Subtracts bytes of R5 from corresponding byte in R0 and writes to the corresponding byte in R4.

3.6 Multiply and divide instructions

Table 31 shows the multiply and divide instructions:

<table>
<thead>
<tr>
<th>Mnemonic</th>
<th>Brief description</th>
<th>See</th>
</tr>
</thead>
<tbody>
<tr>
<td>MLA</td>
<td>Multiply with Accumulate, 32-bit result</td>
<td>MUL, MLA, and MLS on page 112</td>
</tr>
<tr>
<td>MLS</td>
<td>Multiply and Subtract, 32-bit result</td>
<td>MUL, MLA, and MLS on page 112</td>
</tr>
<tr>
<td>MUL</td>
<td>Multiply, 32-bit result</td>
<td>MUL, MLA, and MLS on page 112</td>
</tr>
<tr>
<td>SDIV</td>
<td>Signed Divide</td>
<td>SDIV and UDIV on page 127</td>
</tr>
<tr>
<td>SMLA[B,T]</td>
<td>Signed Multiply Accumulate (halfwords)</td>
<td>SMLLA and SMMLS on page 121</td>
</tr>
<tr>
<td>SMLAD, SMLADX</td>
<td>Signed Multiply Accumulate Dual</td>
<td>SMLAD on page 116</td>
</tr>
<tr>
<td>SMLAL</td>
<td>Signed Multiply with Accumulate (32x32+64), 64-bit result</td>
<td>UMULL, UMLAL, SMULL, and SMLAL on page 126</td>
</tr>
<tr>
<td>SMLAL[B,T]</td>
<td>Signed Multiply Accumulate Long (halfwords)</td>
<td>SMLAL and SMLALD on page 117</td>
</tr>
<tr>
<td>SMLALD, SMLALDX</td>
<td>Signed Multiply Accumulate Long Dual</td>
<td>SMLAL and SMLALD on page 117</td>
</tr>
<tr>
<td>SMLAW[B,T]</td>
<td>Signed Multiply Accumulate (word by halfword)</td>
<td>SMLA and SMLAW on page 115</td>
</tr>
<tr>
<td>SMLSD</td>
<td>Signed Multiply Subtract Dual</td>
<td>SMLSD and SMLSLD on page 119</td>
</tr>
<tr>
<td>SMLSLD</td>
<td>Signed Multiply Subtract Long Dual</td>
<td>SMLSD and SMLSLD on page 119</td>
</tr>
<tr>
<td>SMMLA</td>
<td>Signed Most Significant Word Multiply Accumulate</td>
<td>SMMLA and SMMLS on page 121</td>
</tr>
<tr>
<td>SMMLS, SMMLSR</td>
<td>Signed Most Significant Word Multiply Subtract</td>
<td>SMMLA and SMMLS on page 121</td>
</tr>
</tbody>
</table>
3.6.1 MUL, MLA, and MLS

Multiply, Multiply with Accumulate, and Multiply with Subtract, using 32-bit operands, and producing a 32-bit result.

Syntax

\[
\text{MUL}\{S\}\{\text{cond}\}\{\text{Rd},\} \ Rn, Rm \ ; \text{Multiply} \\
\text{MLA}\{\text{cond}\} \ Rd, Rn, Rm, Ra \ ; \text{Multiply with accumulate} \\
\text{MLS}\{\text{cond}\} \ Rd, Rn, Rm, Ra \ ; \text{Multiply with subtract}
\]

Where:

\[\text{cond}\] Is an optional condition code. See Conditional execution on page 68.

\[S\] Is an optional suffix. If \(S\) is specified, the condition code flags are updated on the result of the operation, see Conditional execution on page 68.

\[Rd\] Is the destination register. If \(Rd\) is omitted, the destination register is \(Rn\).

\[Rn, Rm\] Are registers holding the values to be multiplied.

\[Ra\] Is a register holding the value to be added or subtracted from.

Operation

The MUL instruction multiplies the values from \(Rn\) and \(Rm\), and places the least significant 32 bits of the result in \(Rd\).

The MLA instruction multiplies the values from \(Rn\) and \(Rm\), adds the value from \(Ra\), and places the least significant 32 bits of the result in \(Rd\).
The MLS instruction multiplies the values from \( R_n \) and \( R_m \), subtracts the product from the value from \( R_a \), and places the least significant 32 bits of the result in \( R_d \).

The results of these instructions do not depend on whether the operands are signed or unsigned.

**Restrictions**

In these instructions, do not use SP and do not use PC.

If the S suffix is used with the MUL instruction:
- \( R_d, R_n, \) and \( R_m \) must all be in the range R0 to R7.
- \( R_d \) must be the same as \( R_m \).
- The \( cond \) suffix must not be used.

**Condition flags**

If S is specified, the MUL instruction:
- Updates the N and Z flags according to the result.
- Does not affect the C and V flags.

**Examples**

- `MUL R10, R2, R5`  ; Multiply, \( R_{10} = R_2 \times R_5 \)
- `MLA R10, R2, R1, R5`  ; Multiply with accumulate, \( R_{10} = (R_2 \times R_1) + R_5 \)
- `MULS R0, R2, R2`  ; Multiply with flag update, \( R_0 = R_2 \times R_2 \)
- `MULLT R2, R3, R2`  ; Conditionally multiply, \( R_2 = R_3 \times R_2 \)
- `MLS R4, R5, R6, R7`  ; Multiply with subtract, \( R_4 = R_7 - (R_5 \times R_6) \)

### 3.6.2 UMULL, UMAAL, UMLAL

Unsigned Long Multiply, with optional Accumulate, using 32-bit operands and producing a 64-bit result.

**Syntax**

\[ op\\{ cond \} \ R_d\Lo, \ R_d\Hi, \ R_n, \ R_m \]

Where:

- \( op \)
  - Is one of:
    - \( \text{UMULL} \) Unsigned Long Multiply.
    - \( \text{UMAAL} \) Unsigned Long Multiply with Accumulate.
    - \( \text{UMLAL} \) Unsigned Long Multiply, with Accumulate.

- \( cond \)
  - Is an optional condition code. See *Conditional execution on page 68*.

- \( R_d\Hi, \ R_d\Lo \)
  - Are the destination registers. For \( \text{UMAAL, UMLAL and UMLAL} \) they also hold the accumulating value.

- \( R_n, R_m \)
  - Are registers holding the first and second operands.
Operation

These instructions interpret the values from \( Rn \) and \( Rm \) as unsigned 32-bit integers. The UMULL instruction:

- Multiplies the two unsigned integers in the first and second operands.
- Writes the least significant 32 bits of the result in \( RdLo \).
- Writes the most significant 32 bits of the result in \( RdHi \).

The UMAAL instruction:

- Multiplies the two unsigned 32-bit integers in the first and second operands.
- Adds the unsigned 32-bit integer in \( RdHi \) to the 64-bit result of the multiplication.
- Adds the unsigned 32-bit integer in \( RdLo \) to the 64-bit result of the addition.
- Writes the top 32-bits of the result to \( RdHi \).
- Writes the lower 32-bits of the result to \( RdLo \).

The UMLAL instruction:

- Multiplies the two unsigned integers in the first and second operands.
- Adds the 64-bit result to the 64-bit unsigned integer contained in \( RdHi \) and \( RdLo \).
- Writes the result back to \( RdHi \) and \( RdLo \).

Restrictions

In these instructions:

- Do not use SP and do not use PC.
- \( RdHi \) and \( RdLo \) must be different registers.

Condition flags

These instructions do not affect the condition code flags.

Examples

```
UMULL   R0, R4, R5, R6   ; Multiplies R5 and R6, writes the top 32 bits to
                   ; R4 and the bottom 32 bits to R0.
UMAAL   R3, R6, R2, R7   ; Multiplies R2 and R7, adds R6, adds R3, writes
                   ; the top 32 bits to R6, and the bottom 32 bits
                   ; to R3.
UMLAL   R2, R1, R3, R5   ; Multiplies R5 and R3, adds R1:R2, writes to
                   ; R1:R2.
```
3.6.3  **SMLA and SMLAW**

Signed Multiply Accumulate (halfwords).

**Syntax**

\[
\begin{align*}
&\text{op}(XY)\{\text{cond}\} \quad \text{Rd}, \quad \text{Rn}, \quad \text{Rm} \\
&\text{op}(Y)\{\text{cond}\} \quad \text{Rd}, \quad \text{Rn}, \quad \text{Rm}, \quad \text{Ra}
\end{align*}
\]

Where:

- **op** Is one of:
  - **SMLA** Signed Multiply Accumulate Long (halfwords)
    - \(X\) and \(Y\) specifies which half of the source registers \(\text{Rn}\) and \(\text{Rm}\) are used as the first and second multiply operand.
    - If \(X\) is \(B\), then the bottom halfword, bits [15:0], of \(\text{Rn}\) is used.
    - If \(X\) is \(T\), then the top halfword, bits [31:16], of \(\text{Rn}\) is used.
    - If \(Y\) is \(B\), then the bottom halfword, bits [15:0], of \(\text{Rm}\) is used.
    - If \(Y\) is \(T\), then the top halfword, bits [31:16], of \(\text{Rm}\) is used.
  - **SMLAW** Signed Multiply Accumulate (word by halfword)
    - \(Y\) specifies which half of the source register \(\text{Rm}\) is used as the second multiply operand.
    - If \(Y\) is \(T\), then the top halfword, bits [31:16] of \(\text{Rm}\) is used.
    - If \(Y\) is \(B\), then the bottom halfword, bits [15:0] of \(\text{Rm}\) is used.

- **cond** Is an optional condition code. See *Conditional execution on page 68*.

- **Rd** Is the destination register. If \(\text{Rd}\) is omitted, the destination register is \(\text{Rn}\).

- **Rn, Rm** Are registers holding the values to be multiplied.

- **Ra** Is a register holding the value to be added or subtracted from.

**Operation**

The **SMALBB, SMLABT, SMLATB, SMLATT** instructions:

- Multiplies the specified signed halfword, top or bottom, values from \(\text{Rn}\) and \(\text{Rm}\).
- Adds the value in \(\text{Ra}\) to the resulting 32-bit product.
- Writes the result of the multiplication and addition in \(\text{Rd}\).

The non-specified halfwords of the source registers are ignored.

The **SMLAWB and SMLAWT** instructions:

- Multiply the 32-bit signed values in \(\text{Rn}\) with:
  - The top signed halfword of \(\text{Rm}\), \(T\) instruction suffix.
  - The bottom signed halfword of \(\text{Rm}\), \(B\) instruction suffix.
- Add the 32-bit signed value in \(\text{Ra}\) to the top 32 bits of the 48-bit product
- Writes the result of the multiplication and addition in \(\text{Rd}\).

The bottom 16 bits of the 48-bit product are ignored.

If overflow occurs during the addition of the accumulate value, the instruction sets the Q flag in the APSR. No overflow can occur during the multiplication.
Restrictions
In these instructions, do not use SP and do not use PC.

Condition flags
If an overflow is detected, the Q flag is set.

Examples

SMLABB R5, R6, R4, R1 ; Multiplies bottom halfwords of R6 and R4, adds ; R1 and writes to R5.
SMLATB R5, R6, R4, R1 ; Multiplies top halfword of R6 with bottom ; halfword of R4, adds R1 and writes to R5.
SMLATT R5, R6, R4, R1 ; Multiplies top halfwords of R6 and R4, adds ; R1 and writes the sum to R5.
SMLABT R5, R6, R4, R1 ; Multiplies bottom halfword of R6 with top ; halfword of R4, adds R1 and writes to R5.
SMLABT R4, R3, R2 ; Multiplies bottom halfword of R6 with top ; halfword of R4, adds R1 and writes to R5.
SMLAWB R10, R2, R5, R3 ; Multiplies R2 with bottom halfword of R5, adds ; R3 to the result and writes top 32-bits to R10.
SMLAHT R10, R2, R1, R5 ; Multiplies R2 with top halfword of R1, adds R5 ; and writes top 32-bits to R10.

3.6.4 SMLAD
Signed Multiply Accumulate Long Dual.

Syntax
op{X}{cond} Rd, Rn, Rm, Ra

Where:
op Is one of:
SMLAD Signed Multiply Accumulate Dual.
SMLADX Signed Multiply Accumulate Dual Reverse.

X specifies which halfword of the source register Rn is used as the multiply operand.
If X is omitted, the multiplications are bottom × bottom and top × top.
If X is present, the multiplications are bottom × top and top × bottom.

cond Is an optional condition code. See Conditional execution on page 68.
Rd Is the destination register.
Rn Is the first operand register holding the values to be multiplied.
Rm Is the second operand register.
Ra Is the accumulate value.
Operation

The SMLAD and SMLADX instructions regard the two operands as four halfword 16-bit values. The SMLAD and SMLADX instructions:

- If X is not present, multiply the top signed halfword value in \( Rn \) with the top signed halfword of \( Rm \) and the bottom signed halfword values in \( Rn \) with the bottom signed halfword of \( Rm \).
- Or if X is present, multiply the top signed halfword value in \( Rn \) with the bottom signed halfword of \( Rm \) and the bottom signed halfword values in \( Rn \) with the top signed halfword of \( Rm \).
- Add both multiplication results to the signed 32-bit value in \( Ra \).
- Writes the 32-bit signed result of the multiplication and addition to \( Rd \).

Restrictions

Do not use SP and do not use PC.

Condition flags

These instructions do not change the flags.

Examples

```
SMLAD  R10, R2, R1, R5 ; Multiplies two halfword values in R2 with
 ; corresponding halfwords in R1, adds R5 and
 ; writes to R10.
SMLADDX R0, R2, R4, R6 ; Multiplies top halfword of R2 with bottom
 ; halfword of R4, multiplies bottom halfword of R2
 ; with top halfword of R4, adds R6 and writes to
 ; R0.
```

3.6.5 SMLAL and SMLALD

Signed Multiply Accumulate Long, Signed Multiply Accumulate Long (halfwords) and Signed Multiply Accumulate Long Dual.

Syntax

```
op{cond} RdLo, RdHi, Rn, Rm
op{XY}{cond} RdLo, RdHi, Rn, Rm
op{X}{cond} RdLo, RdHi, Rn, Rm
```

Where:

- Is one of:
  - SMLAL Signed Multiply Accumulate Long.
  - SMLALXY Signed Multiply Accumulate Long (halfwords, X and Y).
    - X and Y specify which halfword of the source registers \( Rn \) and \( Rm \) are used as the first and second multiply operand:
      - If \( X \) is B, then the bottom halfword, bits [15:0], of \( Rn \) is used.
      - If \( X \) is T, then the top halfword, bits [31:16], of \( Rn \) is used.
      - If \( Y \) is B, then the bottom halfword, bits [15:0], of \( Rm \) is used.
      - If \( Y \) is T, then the top halfword, bits [31:16], of \( Rm \) is used.
**SMLALD**  Signed Multiply Accumulate Long Dual.

**SMLALDX**  Signed Multiply Accumulate Long Dual Reversed.

If the \(x\) is omitted, the multiplications are bottom × bottom and top × top.

If \(x\) is present, the multiplications are bottom × top and top × bottom.

\(cond\) is an optional condition code. See *Conditional execution on page 68*.

\(RdHi, RdLo\) are the destination registers.

\(RdLo\) is the lower 32 bits and \(RdHi\) is the upper 32 bits of the 64-bit integer.

For SMLAL, SMLALBB, SMLALBT, SMLALTB, SMLALTT, SMLALD and SMLALDX, they also hold the accumulating value.

\(Rn, Rm\) are registers holding the first and second operands.

**Operation**

The SMLAL instruction:

- Multiplies the two’s complement signed word values from \(Rn\) and \(Rm\).
- Adds the 64-bit value in \(RdLo\) and \(RdHi\) to the resulting 64-bit product.
- Writes the 64-bit result of the multiplication and addition in \(RdLo\) and \(RdHi\).

The SMLALBB, SMLALBT, SMLALTB and SMLALTT instructions:

- Multiplies the specified signed halfword, Top or Bottom, values from \(Rn\) and \(Rm\).
- Adds the resulting sign-extended 32-bit product to the 64-bit value in \(RdLo\) and \(RdHi\).
- Writes the 64-bit result of the multiplication and addition in \(RdLo\) and \(RdHi\).

The non-specified halfwords of the source registers are ignored.

The SMLALD and SMLALDX instructions interpret the values from \(Rn\) and \(Rm\) as four halfword two’s complement signed 16-bit integers. These instructions:

- If \(X\) is not present, multiply the top signed halfword value of \(Rn\) with the top signed halfword of \(Rm\) and the bottom signed halfword values of \(Rn\) with the bottom signed halfword of \(Rm\).
- Or if \(X\) is present, multiply the top signed halfword value of \(Rn\) with the bottom signed halfword of \(Rm\) and the bottom signed halfword values of \(Rn\) with the top signed halfword of \(Rm\).
- Add the two multiplication results to the signed 64-bit value in \(RdLo\) and \(RdHi\) to create the resulting 64-bit product.
- Write the 64-bit product in \(RdLo\) and \(RdHi\).

**Restrictions**

In these instructions:

- Do not use SP and do not use PC.
- \(RdHi\) and \(RdLo\) must be different registers.

**Condition flags**

These instructions do not affect the condition code flags.

**Examples**

SMLAL  \(R4, R5, R3, R8\)  ; Multiplies R3 and R8, adds R5:R4 and writes
3.6.6 SMLSD and SMLSLD

Signed Multiply Subtract Dual and Signed Multiply Subtract Long Dual.

Syntax

\[
op(X)(\text{cond}) \ Rd, \ Rn, \ Rm, \ Ra
\]

Where:

\( op \)

Is one of:

- \( \text{SMLSD} \) Signed Multiply Subtract Dual.
- \( \text{SMLSDX} \) Signed Multiply Subtract Dual Reversed.
- \( \text{SMLSLD} \) Signed Multiply Subtract Long Dual.
- \( \text{SMLSLDX} \) Signed Multiply Subtract Long Dual Reversed.

If \( X \) is present, the multiplications are bottom × top and top × bottom.
If the \( X \) is omitted, the multiplications are bottom × bottom and top × top.

\( \text{cond} \)

Is an optional condition code. See Conditional execution on page 68.

\( \text{Rd} \)

Is the destination condition register.

\( \text{Rn, Rm} \)

Are registers holding the first and second operands.

\( \text{Ra} \)

Is the register holding the accumulate value.

Operation

The SMLSD instruction interprets the values from the first and second operands as four signed halfwords. This instruction:

- Optionally rotates the halfwords of the second operand.
- Performs two signed 16 × 16-bit halfword multiplications.
- Subtracts the result of the upper halfword multiplication from the result of the lower halfword multiplication.
- Adds the signed accumulate value to the result of the subtraction.
- Writes the result of the addition to the destination register.
The Cortex-M7 instruction set

The SMLSLD instruction interprets the values from $Rn$ and $Rm$ as four signed halfwords. This instruction:

- Optionally rotates the halfwords of the second operand.
- Performs two signed $16 \times 16$-bit halfword multiplications.
- Subtracts the result of the upper halfword multiplication from the result of the lower halfword multiplication.
- Adds the 64-bit value in $RdHi$ and $RdLo$ to the result of the subtraction.
- Writes the 64-bit result of the addition to the $RdHi$ and $RdLo$.

**Restrictions**

In these instructions:

- Do not use SP and do not use PC.

**Condition flags**

This instruction sets the Q flag if the accumulate operation overflows. Overflow cannot occur during the multiplications or subtraction.

For the Thumb instruction set, these instructions do not affect the condition code flags.

**Examples**

<table>
<thead>
<tr>
<th>SMLSD</th>
<th>R0, R4, R5, R6</th>
<th>Multiplies bottom halfword of R4 with bottom halfword of R5, multiplies top halfword of R4 with top halfword of R5, substracts second from first, adds R6, writes to R0.</th>
</tr>
</thead>
<tbody>
<tr>
<td>SMLSDX</td>
<td>R1, R3, R2, R0</td>
<td>Multiplies bottom halfword of R3 with top halfword of R2, multiplies top halfword of R3 with bottom halfword of R2, substracts second from first, adds R0, writes to R1.</td>
</tr>
<tr>
<td>SMLSLD</td>
<td>R3, R6, R2, R7</td>
<td>Multiplies bottom halfword of R6 with bottom halfword of R2, multiplies top halfword of R6 with top halfword of R2, substracts second first, adds R6:R3, writes to R6:R3.</td>
</tr>
<tr>
<td>SMLSLDX</td>
<td>R3, R6, R2, R7</td>
<td>Multiplies bottom halfword of R6 with top halfword of R2, multiplies top halfword of R6 with bottom halfword of R2, substracts second from first, adds R6:R3, writes to R6:R3.</td>
</tr>
</tbody>
</table>
3.6.7 SMMLA and SMMLS

Signed Most Significant Word Multiply Accumulate and Signed Most Significant Word Multiply Subtract.

Syntax

\[ \text{op}(R)(\text{cond}) \text{ Rd, Rn, Rm, Ra} \]

Where:

- \( \text{op} \) is one of:
  - \( \text{SMMLA} \) Signed Most Significant Word Multiply Accumulate.
  - \( \text{SMMLS} \) Signed Most Significant Word Multiply Subtract.
- \( R \) is a rounding error flag. If \( R \) is specified, the result is rounded instead of being truncated. In this case the constant 0x80000000 is added to the product before the high word is extracted.
- \( \text{cond} \) is an optional condition code. See Conditional execution on page 68.
- \( \text{Rd} \) is the destination register.
- \( \text{Rn}, \text{Rm} \) are registers holding the first and second multiply operands.
- \( \text{Ra} \) is the register holding the accumulate value.

Operation

The SMMLA instruction interprets the values from \( \text{Rn} \) and \( \text{Rm} \) as signed 32-bit words. The SMMLA instruction:

- Multiples the values in \( \text{Rn} \) and \( \text{Rm} \).
- Optionally rounds the result by adding 0x80000000.
- Extracts the most significant 32 bits of the result.
- Adds the value of \( \text{Ra} \) to the signed extracted value.
- Writes the result of the addition in \( \text{Rd} \).

The SMMLS instruction interprets the values from \( \text{Rn} \) and \( \text{Rm} \) as signed 32-bit words. The SMMLS instruction:

- Multiples the values in \( \text{Rn} \) and \( \text{Rm} \).
- Optionally rounds the result by adding 0x80000000.
- Extracts the most significant 32 bits of the result.
- Subtracts the extracted value of the result from the value in \( \text{Ra} \).
- Writes the result of the subtraction in \( \text{Rd} \).

Restrictions

In these instructions:

- Do not use SP and do not use PC.

Condition flags

These instructions do not affect the condition code flags.
3.6.8 SMMUL

Signed Most Significant Word Multiply.

Syntax

op{R}{cond} Rd, Rn, Rm

Where:

- op Is one of:
  - SMMUL  Signed Most Significant Word Multiply
- R Is a rounding error flag. If R is specified, the result is rounded instead of being truncated. In this case the constant 0x80000000 is added to the product before the high word is extracted.
- cond Is an optional condition code. See Conditional execution on page 68.
- Rd Is the destination register.
- Rn, Rm Are registers holding the first and second operands.

Operation

The SMMUL instruction interprets the values from Rn and Rm as two’s complement 32-bit signed integers. The SMMUL instruction:

- Multiplies the values from Rn and Rm.
- Optionally rounds the result, otherwise truncates the result.
- Writes the most significant signed 32 bits of the result in Rd.

Restrictions

In this instruction:

- Do not use SP and do not use PC.

Condition flags

This instruction does not affect the condition code flags.

Examples

```assembly
SMULL R0, R4, R5 ; Multiplies R4 and R5, truncates top 32 bits and writes to R0.
SMULLR R6, R2 ; Multiplies R6 and R2, rounds the top 32 bits and writes to R6.
```
3.6.9 SMUAD and SMUSD

Signed Dual Multiply Add and Signed Dual Multiply Subtract.

Syntax

\[ \text{op}(X)(\text{cond}) \quad \text{Rd}, \text{Rn}, \text{Rm} \]

Where:

- \text{op} is one of:
  - \text{SMUAD} Signed Dual Multiply Add.
  - \text{SMUADX} Signed Dual Multiply Add Reversed.
  - \text{SMUSD} Signed Dual Multiply Subtract.
  - \text{SMUSDX} Signed Dual Multiply Subtract Reversed.

- If \( X \) is present, the multiplications are bottom \( \times \) top and top \( \times \) bottom.
  - If the \( X \) is omitted, the multiplications are bottom \( \times \) bottom and top \( \times \) top.

- \text{cond} is an optional condition code. See Conditional execution on page 68.

- \text{Rd} is the destination register.

- \text{Rn, Rm} are registers holding the first and the second operands.

Operation

The SMUAD instruction interprets the values from the first and second operands as two signed halfwords in each operand. This instruction:

- Optionally rotates the halfwords of the second operand.
- Performs two signed 16 \( \times \) 16-bit multiplications.
- Adds the two multiplication results together.
- Writes the result of the addition to the destination register.

The SMUSD instruction interprets the values from the first and second operands as two’s complement signed integers. This instruction:

- Optionally rotates the halfwords of the second operand.
- Performs two signed 16 \( \times \) 16-bit multiplications.
- Subtracts the result of the top halfword multiplication from the result of the bottom halfword multiplication.
- Writes the result of the subtraction to the destination register.

Restrictions

In these instructions:

- Do not use SP and do not use PC.

Condition flags

Sets the Q flag if the addition overflows. The multiplications cannot overflow.

Examples

\[ \text{SMUAD} \quad \text{R0, R4, R5} \quad ; \text{Multiplies bottom halfword of R4 with the} \]

\[ \text{; bottom halfword of R5, adds multiplication of} \]


3.6.10 SMUL and SMULW

Signed Multiply (halfwords) and Signed Multiply (word by halfword).

Syntax

\[ \text{op} \{XY\} \{\text{cond}\} \text{Rd}, \text{Rn}, \text{Rm} \]
\[ \text{op} \{Y\} \{\text{cond}\} \text{Rd}, \text{Rn}, \text{Rm} \]

For SMULXY only:

op Is one of:

- \( \text{SMUL}\{XY\} \) Signed Multiply (halfwords)
  - \( X \) and \( Y \) specify which halfword of the source registers \( Rn \) and \( Rm \) is used as the first and second multiply operand.
  - If \( X \) is \( B \), then the bottom halfword, bits [15:0] of \( Rn \) is used.
  - If \( X \) is \( T \), then the top halfword, bits [31:16] of \( Rn \) is used.
  - If \( Y \) is \( B \), then the bottom halfword, bits [15:0], of \( Rm \) is used.
  - If \( Y \) is \( T \), then the top halfword, bits [31:16], of \( Rm \) is used.

- \( \text{SMUL}\{Y\} \) Signed Multiply (word by halfword)
  - \( Y \) specifies which halfword of the source register \( Rm \) is used as the second multiply operand.
  - If \( Y \) is \( B \), then the bottom halfword (bits [15:0]) of \( Rm \) is used.
  - If \( Y \) is \( T \), then the top halfword (bits [31:16]) of \( Rm \) is used.

cond Is an optional condition code. See Conditional execution on page 68.

Rd Is the destination register.

Rn, Rm Are registers holding the first and second operands.

Operation

The SMULBB, SMULTB, SMULBT and SMULTT instructions interprets the values from \( Rn \) and \( Rm \) as four signed 16-bit integers. These instructions:

- Multiply the specified signed halfword, Top or Bottom, values from \( Rn \) and \( Rm \).
- Write the 32-bit result of the multiplication in \( Rd \).
The SMULWT and SMULWB instructions interpret the values from $Rn$ as a 32-bit signed integer and $Rm$ as two halfword 16-bit signed integers. These instructions:

- Multiply the first operand and the top, T suffix, or the bottom, B suffix, halfword of the second operand.
- Write the signed most significant 32 bits of the 48-bit result in the destination register.

**Restrictions**

In these instructions:

- Do not use SP and do not use PC.
- $RdHi$ and $RdLo$ must be different registers.

**Examples**

<table>
<thead>
<tr>
<th>Instruction</th>
<th>Operands</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>SMULBT</td>
<td>R0, R4, R5</td>
<td>Multiplies the bottom halfword of R4 with the top halfword of R5, multiplies results and writes to R0.</td>
</tr>
<tr>
<td>SMULBB</td>
<td>R0, R4, R5</td>
<td>Multiplies the bottom halfword of R4 with the bottom halfword of R5, multiplies results and writes to R0.</td>
</tr>
<tr>
<td>SMULTT</td>
<td>R0, R4, R5</td>
<td>Multiplies the top halfword of R4 with the top halfword of R5, multiplies results and writes to R0.</td>
</tr>
<tr>
<td>SMULTB</td>
<td>R0, R4, R5</td>
<td>Multiplies the top halfword of R4 with the bottom halfword of R5, multiplies results and writes to R0.</td>
</tr>
<tr>
<td>SMULWT</td>
<td>R4, R5, R3</td>
<td>Multiplies R5 with the top halfword of R3, extracts top 32 bits and writes to R4.</td>
</tr>
<tr>
<td>SMULWB</td>
<td>R4, R5, R3</td>
<td>Multiplies R5 with the bottom halfword of R3, extracts top 32 bits and writes to R4.</td>
</tr>
</tbody>
</table>
3.6.11 **UMULL, UMLAL, SMULL, and SMLAL**

Signed and Unsigned Long Multiply, with optional Accumulate, using 32-bit operands and producing a 64-bit result.

**Syntax**

\[ \text{op(\text{cond}) RdLo, RdHi, Rn, Rm} \]

Where:

- **op** Is one of:
  - `UMULL` Unsigned Long Multiply.
  - `UMLAL` Unsigned Long Multiply, with Accumulate.
  - `SMULL` Signed Long Multiply.
  - `SMLAL` Signed Long Multiply, with Accumulate.

- **cond** Is an optional condition code. See *Conditional execution on page 68*.

**RdHi, RdLo** Are the destination registers. For `UMLAL` and `SMLAL` they also hold the accumulating value.

**Rn, Rm** Are registers holding the operands.

**Operation**

The UMULL instruction interprets the values from `Rn` and `Rm` as unsigned integers. It multiplies these integers and places the least significant 32 bits of the result in `RdLo`, and the most significant 32 bits of the result in `RdHi`.

The UMLAL instruction interprets the values from `Rn` and `Rm` as unsigned integers. It multiplies these integers, adds the 64-bit result to the 64-bit unsigned integer contained in `RdHi` and `RdLo`, and writes the result back to `RdHi` and `RdLo`.

The SMULL instruction interprets the values from `Rn` and `Rm` as two’s complement signed integers. It multiplies these integers and places the least significant 32 bits of the result in `RdLo`, and the most significant 32 bits of the result in `RdHi`.

The SMLAL instruction interprets the values from `Rn` and `Rm` as two’s complement signed integers. It multiplies these integers, adds the 64-bit result to the 64-bit signed integer contained in `RdHi` and `RdLo`, and writes the result back to `RdHi` and `RdLo`.

**Restrictions**

In these instructions:

- Do not use SP and do not use PC.
- `RdHi` and `RdLo` must be different registers.

**Condition flags**

These instructions do not affect the condition code flags.

**Examples**

- **UMULL** \( R0, R4, R5, R6 \) ; Unsigned \((R4,R0) = R5 \times R6\)
- **SMLAL** \( R4, R5, R3, R8 \) ; Signed \((R5,R4) = (R5,R4) + R3 \times R8\)
3.6.12 SDIV and UDIV

Signed Divide and Unsigned Divide.

Syntax

<table>
<thead>
<tr>
<th>Instruction</th>
<th>Syntax</th>
</tr>
</thead>
<tbody>
<tr>
<td>SDIV</td>
<td>SDIV(cond) (Rd,) Rn, Rm</td>
</tr>
<tr>
<td>UDIV</td>
<td>UDIV(cond) (Rd,) Rn, Rm</td>
</tr>
</tbody>
</table>

Where:

- **cond** is an optional condition code. See *Conditional execution on page 68*.
- **Rd** is the destination register. If **Rd** is omitted, the destination register is **Rn**.
- **Rn** is the register holding the value to be divided.
- **Rm** is a register holding the divisor.

Operation

The SDIV instruction performs a signed integer division of the value in **Rn** by the value in **Rm**.

The UDIV instruction performs an unsigned integer division of the value in **Rn** by the value in **Rm**.

For both instructions, if the value in **Rn** is not divisible by the value in **Rm**, the result is rounded towards zero.

Restrictions

Do not use SP and do not use PC.

Condition flags

These instructions do not change the flags.

Examples

<table>
<thead>
<tr>
<th>Instruction</th>
<th>Syntax</th>
</tr>
</thead>
<tbody>
<tr>
<td>SDIV</td>
<td>SDIV R0, R2, R4 ; Signed divide, R0 = R2/R4</td>
</tr>
<tr>
<td>UDIV</td>
<td>UDIV R8, R8, R1 ; Unsigned divide, R8 = R8/R1</td>
</tr>
</tbody>
</table>
3.7 Saturating instructions

*Table 32* shows the saturating instructions:

### Table 32. Saturating instructions

<table>
<thead>
<tr>
<th>Mnemonic</th>
<th>Brief description</th>
<th>See</th>
</tr>
</thead>
<tbody>
<tr>
<td>SSAT</td>
<td>Signed Saturate</td>
<td>SSAT and USAT on page 129</td>
</tr>
<tr>
<td>SSAT16</td>
<td>Signed Saturate Halfword</td>
<td>SSAT16 and USAT16 on page 130</td>
</tr>
<tr>
<td>USAT</td>
<td>Unsigned Saturate</td>
<td>SSAT and USAT on page 129</td>
</tr>
<tr>
<td>USAT16</td>
<td>Unsigned Saturate Halfword</td>
<td>SSAT16 and USAT16 on page 130</td>
</tr>
<tr>
<td>QADD</td>
<td>Saturating Add</td>
<td>QADD and QSUB on page 131</td>
</tr>
<tr>
<td>QSUB</td>
<td>Saturating Subtract</td>
<td>QADD and QSUB on page 131</td>
</tr>
<tr>
<td>QSUB16</td>
<td>Saturating Subtract 16</td>
<td>QADD and QSUB on page 131</td>
</tr>
<tr>
<td>QASX</td>
<td>Saturating Add and Subtract with Exchange</td>
<td>QASX and QSAX on page 132</td>
</tr>
<tr>
<td>QSAX</td>
<td>Saturating Subtract and Add with Exchange</td>
<td>QASX and QSAX on page 132</td>
</tr>
<tr>
<td>QDADD</td>
<td>Saturating Double and Add</td>
<td>QDADD and QDSUB on page 133</td>
</tr>
<tr>
<td>QDSUB</td>
<td>Saturating Double and Subtract</td>
<td>QDADD and QDSUB on page 133</td>
</tr>
<tr>
<td>UQADD16</td>
<td>Unsigned Saturating Add 16</td>
<td>UQADD and UQSUB on page 136</td>
</tr>
<tr>
<td>UQADD8</td>
<td>Unsigned Saturating Add 8</td>
<td>UQADD and UQSUB on page 136</td>
</tr>
<tr>
<td>UQASX</td>
<td>Unsigned Saturating Add and Subtract with Exchange</td>
<td>UQASX and UQSAX on page 134</td>
</tr>
<tr>
<td>UQSAX</td>
<td>Unsigned Saturating Subtract and Add with Exchange</td>
<td>UQASX and UQSAX on page 134</td>
</tr>
<tr>
<td>UQSUB16</td>
<td>Unsigned Saturating Subtract 16</td>
<td>UQADD and UQSUB on page 136</td>
</tr>
<tr>
<td>UQSUB8</td>
<td>Unsigned Saturating Subtract 8</td>
<td>UQADD and UQSUB on page 136</td>
</tr>
</tbody>
</table>

For signed \(n\)-bit saturation, this means that:
- If the value to be saturated is less than \(-2^{n-1}\), the result returned is \(-2^{n-1}\)
- If the value to be saturated is greater than \(2^{n-1}-1\), the result returned is \(2^{n-1}-1\)
- Otherwise, the result returned is the same as the value to be saturated.

For unsigned \(n\)-bit saturation, this means that:
- If the value to be saturated is less than 0, the result returned is 0
- If the value to be saturated is greater than \(2^n-1\), the result returned is \(2^n-1\)
- Otherwise, the result returned is the same as the value to be saturated.

If the returned result is different from the value to be saturated, it is called *saturation*. If the saturation occurs, the instruction sets the Q flag to 1 in the APSR. Otherwise, it leaves the Q flag unchanged. To clear the Q flag to 0, the MSR instruction must be used, see *MSR on page 179*.

To read the state of the Q flag, use the MRS instruction, see *MRS on page 178*. 
3.7.1 SSAT and USAT

Signed Saturate and Unsigned Saturate to any bit position, with optional shift before saturating.

Syntax

\[ \text{op}(\text{cond}) \text{ Rd, } \#n, \text{ Rm (, shift } \#s) \]

Where:

- **op** is one of:
  - SSAT: Saturates a signed value to a signed range.
  - USAT: Saturates a signed value to an unsigned range.

- **cond** is an optional condition code. See *Conditional execution on page 68*.

- **Rd** is the destination register.

- **n** specifies the bit position to saturate to:
  - \( n \) ranges from 1 to 32 for SSAT.
  - \( n \) ranges from 0 to 31 for USAT.

- **Rm** is the register containing the value to saturate.

- **shift \#s** is an optional shift applied to \( Rm \) before saturating. It must be one of the following:
  - ASR \( \#s \) where \( s \) is in the range 1 to 31.
  - LSL \( \#s \) where \( s \) is in the range 0 to 31.

Operation

These instructions saturate to a signed or unsigned \( n \)-bit value.

The SSAT instruction applies the specified shift, then saturates to the signed range \( -2^{n-1} \leq x \leq 2^{n-1} - 1 \).

The USAT instruction applies the specified shift, then saturates to the unsigned range \( 0 \leq x \leq 2^{n-1} \).

Restrictions

- Do not use SP and do not use PC.

Condition flags

These instructions do not affect the condition code flags.

If saturation occurs, these instructions set the Q flag to 1.

Examples

```
SSAT    R7, #16, R7, LSL #4 ; Logical shift left value in R7 by 4,
      ; then saturate it as a signed 16-bit
      ; value and write it back to R7.
USATNE  R0, #7, R5         ; Conditionally saturate value in R5 as a
      ; an unsigned 7 bit value and write it to
      ; R0.
```
3.7.2 SSAT16 and USAT16

Signed Saturate and Unsigned Saturate to any bit position for two halfwords.

Syntax

\[ \text{op}(\text{cond}) \text{ Rd, } \#n, \text{ Rm} \]

Where:

- \( \text{op} \) is one of:
  - \( \text{SSAT16} \) Saturates a signed halfword value to a signed range.
  - \( \text{USAT16} \) Saturates a signed halfword value to an unsigned range.

- \( \text{cond} \) is an optional condition code. See Conditional execution on page 68.

- \( \text{Rd} \) is the destination register.

- \( n \) specifies the bit position to saturate to:
  - \( n \) ranges from 1 to 16 for \( \text{SSAT} \).
  - \( n \) ranges from 0 to 15 for \( \text{USAT} \).

- \( \text{Rm} \) is the register containing the value to saturate.

Operation

The \( \text{SSAT16} \) instruction:

1. Saturates two signed 16-bit halfword values of the register with the value to saturate from selected by the bit position in \( n \).
2. Writes the results as two signed 16-bit halfwords to the destination register.

The \( \text{USAT16} \) instruction:

1. Saturates two unsigned 16-bit halfword values of the register with the value to saturate from selected by the bit position in \( n \).
2. Writes the results as two unsigned halfwords in the destination register.

Restrictions

Do not use SP and do not use PC.

Condition flags

These instructions do not affect the condition code flags. If saturation occurs, these instructions set the Q flag to 1.

Examples

\[ \text{SSAT16} \text{ R7, } \#9, \text{ R2} \]

; Saturates the top and bottom highwords of R2 as 9-bit values, writes to corresponding halfword of R7.

\[ \text{USAT16NE} \text{ R0, } \#13, \text{ R5} \]

; Conditionally saturates the top and bottom halfwords of R5 as 13-bit values, writes to corresponding halfword of R0.
3.7.3 **QADD and QSUB**

Saturating Add and Saturating Subtract, signed.

**Syntax**

\[
\text{op}(\text{cond}) \{\text{Rd}\}, \text{Rn}, \text{Rm}
\]

Where:

- \( \text{op} \) is one of:
  - QADD: Saturating 32-bit add.
  - QADD8: Saturating four 8-bit integer additions.
  - QADD16: Saturating two 16-bit integer additions.
  - QSUB: Saturating 32-bit subtraction.
  - QSUB8: Saturating four 8-bit integer subtraction.
  - QSUB16: Saturating two 16-bit integer subtraction.

- \( \text{cond} \) is an optional condition code. See *Conditional execution on page 68*.

- \( \text{Rd} \) is the destination register.

- \( \text{Rn}, \text{Rm} \) are registers holding the first and second operands.

**Operation**

These instructions add or subtract two, four or eight values from the first and second operands and then writes a signed saturated value in the destination register.

The QADD and QSUB instructions apply the specified add or subtract, and then saturate the result to the signed range \(-2^{n-1} \leq x \leq 2^{n-1} - 1\), where \( x \) is given by the number of bits applied in the instruction, 32, 16 or 8.

If the returned result is different from the value to be saturated, it is called *saturation*. If saturation occurs, the QADD and QSUB instructions set the Q flag to 1 in the APSR. Otherwise, it leaves the Q flag unchanged. The 8-bit and 16-bit QADD and QSUB instructions always leave the Q flag unchanged.

To clear the Q flag to 0, the MSR instruction must be used, see *MSR on page 179*.

To read the state of the Q flag, use the MRS instruction, see *MRS on page 178*.

**Restrictions**

Do not use SP and do not use PC.

**Condition flags**

These instructions do not affect the condition code flags.

If saturation occurs, these instructions set the Q flag to 1.

**Examples**

QADD16  R7, R4, R2  ; Adds halfwords of R4 with corresponding halfword of R2, saturates to 16 bits and writes to corresponding halfword of R7.
QADD8  R3, R1, R6  ; Adds bytes of R1 to the corresponding bytes of R6, 
; saturates to 8 bits and writes to corresponding 
; byte of R3.

QSUB16 R4, R2, R3  ; Subtracts halfwords of R3 from corresponding 
; halfword of R2, saturates to 16 bits, writes to 
; corresponding halfword of R4.

QSUB8  R4, R2, R5  ; Subtracts bytes of R5 from the corresponding byte 
; in R2, saturates to 8 bits, writes to corresponding 
; byte of R4.

3.7.4 QASX and QSAX

Saturating Add and Subtract with Exchange and Saturating Subtract and Add with Exchange, signed.

Syntax

\[
\text{op}\{\text{cond}\}\{\text{Rd}\}, \text{Rm}, \text{Rn}
\]

Where:

- \text{op}  
  Is one of:
  - \text{QASX}  Add and Subtract with Exchange and Saturate.
  - \text{QSAX}  Subtract and Add with Exchange and Saturate.

- \text{cond}  
  Is an optional condition code. See Conditional execution on page 68.

- \text{Rd}  
  Is the destination register.

- \text{Rn}, \text{Rm}  
  Are registers holding the first and second operands.

Operation

The QASX instruction:

1. Adds the top halfword of the source operand with the bottom halfword of the second operand.
2. Subtracts the top halfword of the second operand from the bottom highword of the first operand.
3. Saturates the result of the subtraction and writes a 16-bit signed integer in the range \( -2^{15} \leq x \leq 2^{15} - 1 \), where \( x \) equals 16, to the bottom halfword of the destination register.
4. Saturates the results of the sum and writes a 16-bit signed integer in the range \( -2^{15} \leq x \leq 2^{15} - 1 \), where \( x \) equals 16, to the top halfword of the destination register.

The QSAX instruction:

1. Subtracts the bottom halfword of the second operand from the top highword of the first operand.
2. Adds the bottom halfword of the source operand with the top halfword of the second operand.
3. Saturates the results of the sum and writes a 16-bit signed integer in the range \( -2^{15} \leq x \leq 2^{15} - 1 \), where \( x \) equals 16, to the bottom halfword of the destination register.
4. Saturates the result of the subtraction and writes a 16-bit signed integer in the range \( -2^{15} \leq x \leq 2^{15} - 1 \), where \( x \) equals 16, to the top halfword of the destination register.

Restrictions

Do not use SP and do not use PC.
Condition flags
These instructions do not affect the condition code flags.

Examples
QASX R7, R4, R2 ; Adds top halfword of R4 to bottom halfword of R2,
; saturates to 16 bits, writes to top halfword of
; R7, Subtracts top highword of R2 from bottom
; halfword of R4, saturates to 16 bits and writes
; to bottom halfword of R7
QSAX R0, R3, R5 ; Subtracts bottom halfword of R5 from top halfword
; of R3, saturates to 16 bits, writes to top
; halfword of R0
; Adds bottom halfword of R3 to top halfword of R5,
; saturates to 16 bits, writes to bottom halfword
; of R0.

3.7.5 QDADD and QDSUB
Saturating Double and Add and Saturating Double and Subtract, signed.

Syntax
\( op\{cond\} \{Rd\}, Rm, Rn \)

Where:
\( op \) Is one of:
- QDADD Saturating Double and Add.
- QDSUB Saturating Double and Subtract.
\( cond \) Is an optional condition code. See Conditional execution on page 68.
\( Rd \) Is the destination register.
\( Rm, Rn \) Are registers holding the first and second operands.

Operation
The QDADD instruction:
- Doubles the second operand value.
- Adds the result of the doubling to the signed saturated value in the first operand.
- Writes the result to the destination register.

The QDSUB instruction:
- Doubles the second operand value.
- Subtracts the doubled value from the signed saturated value in the first operand.
- Writes the result to the destination register.

Both the doubling and the addition or subtraction have their results saturated to the 32-bit signed integer range \(-2^{31} \leq x \leq 2^{31} - 1\). If saturation occurs in either operation, it sets the Q flag in the APSR.

Restrictions
Do not use SP and do not use PC.
Condition flags
If saturation occurs, these instructions set the Q flag to 1.

Examples
QDADD R7, R4, R2 ; Doubles and saturates R4 to 32 bits, adds R2,
                   ; Saturates to 32 bits, writes to R7
QDSUB  R0, R3, R5 ; Subtracts R3 doubled and saturated to 32 bits
                   ; from R5, saturates to 32 bits, writes to R0.

3.7.6 UQASX and UQSAX
Saturating Add and Subtract with Exchange and Saturating Subtract and Add with
Exchange, unsigned.

Syntax
op{cond} {Rd}, Rm, Rn
Where:
type Is one of:
UQASX Add and Subtract with Exchange and Saturate.
UQSAX Subtract and Add with Exchange and Saturate.
cond Is an optional condition code. See Conditional execution on page 68.
Rd Is the destination register.
Rn, Rm Are registers holding the first and second operands.

Operation
The UQASX instruction:
1. Adds the bottom halfword of the source operand with the top halfword of the second
   operand.
2. Subtracts the bottom halfword of the second operand from the top highword of the first
   operand.
3. Saturates the results of the sum and writes a 16-bit unsigned integer in the range
   \(0 \leq x \leq 2^{16} - 1\), where \(x\) equals 16, to the top halfword of the destination register.
4. Saturates the result of the subtraction and writes a 16-bit unsigned integer in the range
   \(0 \leq x \leq 2^{16} - 1\), where \(x\) equals 16, to the bottom halfword of the destination register.

The UQSAX instruction:
1. Subtracts the bottom halfword of the second operand from the top highword of the first
   operand.
2. Adds the bottom halfword of the first operand with the top halfword of the second
   operand.
3. Saturates the result of the subtraction and writes a 16-bit unsigned integer in the range
   \(0 \leq x \leq 2^{16} - 1\), where \(x\) equals 16, to the top halfword of the destination register.
4. Saturates the results of the addition and writes a 16-bit unsigned integer in the range \(0
   \leq x \leq 2^{16} - 1\), where \(x\) equals 16, to the bottom halfword of the destination register.
Restrictions
Do not use SP and do not use PC.

Condition flags
These instructions do not affect the condition code flags.

Examples

UQASX  R7, R4, R2  ; Adds top halfword of R4 with bottom halfword of R2,  
; saturates to 16 bits, writes to top halfword of R7  
; Subtracts top halfword of R2 from bottom halfword of  
; R4, saturates to 16 bits, writes to bottom halfword  
; of R7  
UQSAX  R0, R3, R5  ; Subtracts bottom halfword of R5 from top halfword of  
; R3, saturates to 16 bits, writes to top halfword of  
; R0  
; Adds bottom halfword of R4 to top halfword of R5  
; saturates to 16 bits, writes to bottom halfword of  
; R0.
3.7.7 UQADD and UQSUB

Saturating Add and Saturating Subtract Unsigned.

Syntax

\[
\text{op}\{\text{cond}\} \{\text{Rd}\}, \text{Rn}, \text{Rm}
\]

Where:

- **op** Is one of:
  - UQADD8 Saturating four unsigned 8-bit integer additions.
  - UQADD16 Saturating two unsigned 16-bit integer additions.
  - UDSUB8 Saturating four unsigned 8-bit integer subtractions.
  - UQSUB16 Saturating two unsigned 16-bit integer subtractions.

- **cond** Is an optional condition code. See Conditional execution on page 68.

- **Rd** Is the destination register.

- **Rn, Rm** Are registers holding the first and second operands.

Operation

These instructions add or subtract two or four values and then writes an unsigned saturated value in the destination register.

The UQADD16 instruction:

- Adds the respective top and bottom halfwords of the first and second operands.
- Saturates the result of the additions for each halfword in the destination register to the unsigned range \(0 \leq x \leq 2^{16}-1\), where \(x\) is 16.

The UQADD8 instruction:

- Adds each respective byte of the first and second operands.
- Saturates the result of the addition for each byte in the destination register to the unsigned range \(0 \leq x \leq 2^8-1\), where \(x\) is 8.

The UQSUB16 instruction:

- Subtracts both halfwords of the second operand from the respective halfwords of the first operand.
- Saturates the result of the differences in the destination register to the unsigned range \(0 \leq x \leq 2^{16}-1\), where \(x\) is 16.

The UQSUB8 instructions:

- Subtracts the respective bytes of the second operand from the respective bytes of the first operand.
- Saturates the results of the differences for each byte in the destination register to the unsigned range \(0 \leq x \leq 2^8-1\), where \(x\) is 8.

Restrictions

Do not use SP and do not use PC.
Condition flags

These instructions do not affect the condition code flags.

Examples

UQADD16 R7, R4, R2 ; Adds halfwords in R4 to corresponding halfword in R2, saturates to 16 bits, writes to corresponding halfword of R7
UQADD8 R4, R2, R5 ; Adds bytes of R2 to corresponding byte of R5, saturates to 8 bits, writes to corresponding bytes of R4
UQSUB16 R6, R3, R0 ; Subtracts halfwords in R0 from corresponding halfword in R3, saturates to 16 bits, writes to corresponding halfword in R6
UQSUB8 R1, R5, R6 ; Subtracts bytes in R6 from corresponding byte of R5, saturates to 8 bits, writes to corresponding byte of R1.

3.8 Packing and unpacking instructions

Table 33 shows the instructions that operate on packing and unpacking data:

<table>
<thead>
<tr>
<th>Mnemonic</th>
<th>Brief description</th>
<th>See</th>
</tr>
</thead>
<tbody>
<tr>
<td>PKH</td>
<td>Pack Halfword</td>
<td>PKHBT and PKHTB on page 138</td>
</tr>
<tr>
<td>SXTAB</td>
<td>Extend 8 bits to 32 and add</td>
<td>SXTA and UXTA on page 140</td>
</tr>
<tr>
<td>SXTAB16</td>
<td>Dual extend 8 bits to 16 and add</td>
<td>SXTA and UXTA on page 140</td>
</tr>
<tr>
<td>SXTAH</td>
<td>Extend 16 bits to 32 and add</td>
<td>SXTA and UXTA on page 140</td>
</tr>
<tr>
<td>SXTB</td>
<td>Sign extend a byte</td>
<td>SXT and UXT on page 144</td>
</tr>
<tr>
<td>SXTB16</td>
<td>Dual extend 8 bits to 16 and add</td>
<td>SXT and UXT on page 144</td>
</tr>
<tr>
<td>SXTH</td>
<td>Sign extend a halfword</td>
<td>SXT and UXT on page 144</td>
</tr>
<tr>
<td>UXTAB</td>
<td>Extend 8 bits to 32 and add</td>
<td>SXTA and UXTA on page 140</td>
</tr>
<tr>
<td>UXTAB16</td>
<td>Dual extend 8 bits to 16 and add</td>
<td>SXTA and UXTA on page 140</td>
</tr>
<tr>
<td>UXTAH</td>
<td>Extend 16 bits to 32 and add</td>
<td>SXTA and UXTA on page 140</td>
</tr>
<tr>
<td>UXTB</td>
<td>Zero extend a byte</td>
<td>SXT and UXT on page 144</td>
</tr>
<tr>
<td>UXTB16</td>
<td>Dual zero extend 8 bits to 16 and add</td>
<td>SXT and UXT on page 144</td>
</tr>
<tr>
<td>UXTH</td>
<td>Zero extend a halfword</td>
<td>SXT and UXT on page 144</td>
</tr>
</tbody>
</table>
3.8.1 PKHBT and PKHTB

Pack Halfword.

Syntax

\[ \text{op(condition) } \{ \text{Rd}, \text{Rn}, \text{Rm } \{, \text{LSL } \#imm} \} \]

Where:

- \text{op} is one of:
  - PKHBT: Pack Halfword, bottom and top with shift.
  - PKHTB: Pack Halfword, top and bottom with shift.
- \text{cond} is an optional condition code. See Conditional execution on page 68.
- \text{Rd} is the destination register.
- \text{Rn} is the first operand register.
- \text{Rm} is the second operand register holding the value to be optionally shifted.
- \text{imm} is the shift length. The type of shift length depends on the instruction:
  - For PKHBT:
    - LSL: A left shift with a shift length from 1 to 31, 0 means no shift.
  - For PKHTB:
    - ASR: An arithmetic shift right with a shift length from 1 to 32, a shift of 32-bits is encoded as 0b00000.

Operation

The PKHBT instruction:
1. Writes the value of the bottom halfword of the first operand to the bottom halfword of the destination register.
2. If shifted, the shifted value of the second operand is written to the top halfword of the destination register.

The PKHTB instruction:
1. Writes the value of the top halfword of the first operand to the top halfword of the destination register.
2. If shifted, the shifted value of the second operand is written to the bottom halfword of the destination register.

Restrictions

- \text{Rd} must not be SP and must not be PC.

Condition flags

This instruction does not change the flags.
Examples

PKHBT    R3, R4, R5 LSL #0  ; Writes bottom halfword of R4 to bottom
        ; halfword of R3, writes top halfword of R5,
        ; unshifted, to top halfword of R3
PKHTB    R4, R0, R2 ASR #1  ; Writes R2 shifted right by 1 bit to bottom
        ; halfword of R4, and writes top halfword of R0
        ; to top halfword of R4.

3.8.2  SXT and UXT

Sign extend and Zero extend.

Syntax

\[ \text{op}\{\text{cond}\}\ {Rd}\ {,}\ Rm\ {,}\ \text{ROR}\ #n} \]

Where:

- **op**
  - Is one of:
    - SXTB  Sign extends an 8-bit value to a 32-bit value.
    - SXTH  Sign extends a 16-bit value to a 32-bit value.
    - SXTB16 Sign extends two 8-bit values to two 16-bit values.
    - UXTB  Zero extends an 8-bit value to a 32-bit value.
    - UXTH  Zero extends a 16-bit value to a 32-bit value.
    - UXTB16 Zero extends two 8-bit values to two 16-bit values.

- **cond**
  - Is an optional condition code. See Conditional execution on page 68.

- **Rd**
  - Is the destination register.

- **Rm**
  - Is the register holding the value to extend.

- **ROR #n**
  - Is one of:
    - ROR #8 Value from \( Rm \) is rotated right 8 bits.
    - ROR #16 Value from \( Rm \) is rotated right 16 bits.
    - ROR #24 Value from \( Rm \) is rotated right 24 bits.
  - If ROR \( #n \) is omitted, no rotation is performed.

Operation

These instructions do the following:

1. Rotate the value from \( Rm \) right by 0, 8, 16 or 24 bits.
2. Extract bits from the resulting value:
   - SXTB extracts bits[7:0] and sign extends to 32 bits.
   - UXTB extracts bits[7:0] and zero extends to 32 bits.
   - SXTH extracts bits[15:0] and sign extends to 32 bits.
   - UXTH extracts bits[15:0] and zero extends to 32 bits.
   - SXTB16 extracts bits[7:0] and sign extends to 16 bits, and extracts bits [23:16] and sign extends to 16 bits.
   - UXTB16 extracts bits[7:0] and zero extends to 16 bits, and extracts bits [23:16] and zero extends to 16 bits.
Restrictions
Do not use SP and do not use PC.

Condition flags
These instructions do not affect the flags.

Examples
SXTH R4, R6, ROR #16 ; Rotates R6 right by 16 bits, obtains bottom
; halfword of result, sign extends to 32 bits and
; writes to R4
UXTB R3, R10 ; Extracts lowest byte of value in R10, zero
; extends, and writes to R3.

3.8.3 SXTA and UXTA
Signed and Unsigned Extend and Add.

Syntax
\[ \text{op} \{\text{cond}\} \{\text{Rd,}\} \text{Rn, Rm }\{, \text{ ROR } #n\} \]
\[ \text{op} \{\text{cond}\} \{\text{Rd,}\} \text{Rn, Rm }\{, \text{ ROR } #n\} \]

Where:
- \text{op} \quad \text{Is one of:}
  - SXTAB \quad \text{Sign extends an 8-bit value to a 32-bit value and add.}
  - SXTAH \quad \text{Sign extends a 16-bit value to a 32-bit value and add.}
  - SXTAB16 \quad \text{Sign extends two 8-bit values to two 16-bit values and add.}
  - UXTAB \quad \text{Zero extends an 8-bit value to a 32-bit value and add.}
  - UXTAH \quad \text{Zero extends a 16-bit value to a 32-bit value and add.}
  - UXTAB16 \quad \text{Zero extends two 8-bit values to two 16-bit values and add.}
- \text{cond} \quad \text{Is an optional condition code. See } \text{Conditional execution on page 68.}
- \text{Rd} \quad \text{Is the destination register.}
- \text{Rn} \quad \text{Is the first operand register.}
- \text{Rm} \quad \text{Is the register holding the value to rotate and extend.}
- \text{ROR } #n \quad \text{Is one of:}
  - \text{ROR } #8 \quad \text{Value from } Rm \text{ is rotated right 8 bits.}
  - \text{ROR } #16 \quad \text{Value from } Rm \text{ is rotated right 16 bits.}
  - \text{ROR } #24 \quad \text{Value from } Rm \text{ is rotated right 24 bits.}
  \text{If } \text{ROR } #n \text{ is omitted, no rotation is performed.}

Operation
These instructions do the following:
1. Rotate the value from \text{Rm} right by 0, 8, 16 or 24 bits.
2. Extract bits from the resulting value:
   - \text{SXTAB} extracts bits[7:0] from \text{Rm} and sign extends to 32 bits.
   - \text{UXTAB} extracts bits[7:0] from \text{Rm} and zero extends to 32 bits.
• SXTAH extracts bits[15:0] from $Rm$ and sign extends to 32 bits.
• UXTAH extracts bits[15:0] from $Rm$ and zero extends to 32 bits.
• SXTAB16 extracts bits[7:0] from $Rm$ and sign extends to 16 bits, and extracts bits [23:16] from $Rm$ and sign extends to 16 bits.
• UXTAB16 extracts bits[7:0] from $Rm$ and zero extends to 16 bits, and extracts bits [23:16] from $Rm$ and zero extends to 16 bits.

3. Adds the signed or zero extended value to the word or corresponding halfword of $Rn$ and writes the result in $Rd$.

Restrictions
Do not use SP and do not use PC.

Condition flags
These instructions do not affect the flags.

Examples

```
SXTAH  R4, R8, R6, ROR #16 ; Rotates R6 right by 16 bits, obtains bottom halfword, sign extends to 32 bits, adds R8, and writes to R4
UXTAB  R3, R4, R10  ; Extracts bottom byte of R10 and zero extends to 32 bits, adds R4, and writes to R3.
```

3.9 Bit field instructions

_Table 34_ shows the instructions that operate on adjacent sets of bits in registers or bit fields:

<table>
<thead>
<tr>
<th>Mnemonic</th>
<th>Brief description</th>
<th>See</th>
</tr>
</thead>
<tbody>
<tr>
<td>BFC</td>
<td>Bit Field Clear</td>
<td><em>BFC and BFI on page 142</em></td>
</tr>
<tr>
<td>BFI</td>
<td>Bit Field Insert</td>
<td><em>BFC and BFI on page 142</em></td>
</tr>
<tr>
<td>SBFX</td>
<td>Signed Bit Field Extract</td>
<td><em>SBFX and UBFX on page 143</em></td>
</tr>
<tr>
<td>SXTB</td>
<td>Sign extend a byte</td>
<td><em>SXT and UXT on page 144</em></td>
</tr>
<tr>
<td>SXTH</td>
<td>Sign extend a halfword</td>
<td><em>SXT and UXT on page 144</em></td>
</tr>
<tr>
<td>UBFX</td>
<td>Unsigned Bit Field Extract</td>
<td><em>SBFX and UBFX on page 143</em></td>
</tr>
<tr>
<td>UXTB</td>
<td>Zero extend a byte</td>
<td><em>SXT and UXT on page 144</em></td>
</tr>
<tr>
<td>UXTH</td>
<td>Zero extend a halfword</td>
<td><em>SXT and UXT on page 144</em></td>
</tr>
</tbody>
</table>
3.9.1 BFC and BFI

Bit Field Clear and Bit Field Insert.

Syntax

\[
\begin{align*}
\text{BFC}(\text{cond}) & \quad \text{Rd}, \ #\text{lsb}, \ #\text{width} \\
\text{BFI}(\text{cond}) & \quad \text{Rd}, \ \text{Rn}, \ #\text{lsb}, \ #\text{width}
\end{align*}
\]

Where:

\begin{itemize}
\item \text{cond} is an optional condition code. See Conditional execution on page 68.
\item \text{Rd} is the destination register.
\item \text{Rn} is the source register.
\item \text{lsb} is the position of the least significant bit of the bit field. \text{lsb} must be in the range 0 to 31.
\item \text{width} is the width of the bit field and must be in the range 1 to 32-\text{lsb}.
\end{itemize}

Operation

BFC clears a bit field in a register. It clears \text{width} bits in \text{Rd}, starting at the low bit position \text{lsb}. Other bits in \text{Rd} are unchanged.

BFI copies a bit field into one register from another register. It replaces \text{width} bits in \text{Rd} starting at the low bit position \text{lsb}, with \text{width} bits from \text{Rn} starting at bit[0]. Other bits in \text{Rd} are unchanged.

Restrictions

Do not use SP and do not use PC.

Condition flags

These instructions do not affect the flags.

Examples

\[
\begin{align*}
\text{BFC} & \quad \text{R4}, \ #8, \ #12 \quad \text{; Clear bit 8 to bit 19 (12 bits) of R4 to 0} \\
\text{BFI} & \quad \text{R9}, \ \text{R2}, \ #8, \ #12 \quad \text{; Replace bit 8 to bit 19 (12 bits) of R9 with} \\
& \quad \text{; bit 0 to bit 11 from R2.}
\end{align*}
\]
3.9.2 SBFX and UBFX
Signed Bit Field Extract and Unsigned Bit Field Extract.

Syntax

\[ SBFX(\text{cond}) \ R_d, \ R_n, \ #_\text{lsb}, \ #_\text{width} \]
\[ UBFX(\text{cond}) \ R_d, \ R_n, \ #_\text{lsb}, \ #_\text{width} \]

Where:
- \( \text{cond} \) is an optional condition code. See Conditional execution on page 68.
- \( R_d \) is the destination register.
- \( R_n \) is the source register.
- \( \text{lsb} \) is the position of the least significant bit of the bit field. \( \text{lsb} \) must be in the range 0 to 31.
- \( \text{width} \) is the width of the bit field and must be in the range \( 1 \) to \( 32 - \text{lsb} \).

Operation
SBFX extracts a bit field from one register, sign extends it to 32 bits, and writes the result to the destination register.

UBFX extracts a bit field from one register, zero extends it to 32 bits, and writes the result to the destination register.

Restrictions
Do not use SP and do not use PC.

Condition flags
These instructions do not affect the flags.

Examples

\[ SBFX \ R_0, \ R_1, \ #20, \ #4 \] ; Extract bit 20 to bit 23 (4 bits) from \( R_1 \) and sign extend to 32 bits and then write the result to \( R_0 \).

\[ UBFX \ R_8, \ R_{11}, \ #9, \ #10 \] ; Extract bit 9 to bit 18 (10 bits) from \( R_{11} \) and zero extend to 32 bits and then write the result to \( R_8 \).
### 3.9.3 SXT and UXT

Sign extend and Zero extend.

#### Syntax

SXT\(\text{extend}(\text{cond})\) \(\{\text{Rd},\}\) \(\text{Rm}\) \(\{\), \(\text{ROR}\ #n\}\)

UXT\(\text{extend}(\text{cond})\) \(\{\text{Rd}\},\) \(\text{Rm}\) \(\{\), \(\text{ROR}\ #n\}\)

Where:

- **extend** Is one of:
  - B Extends an 8-bit value to a 32-bit value.
  - H Extends a 16-bit value to a 32-bit value.
- **cond** Is an optional condition code. See *Conditional execution on page 68*.
- **Rd** Is the destination register.
- **Rm** Is the register holding the value to extend.
- **ROR \#n** Is one of:
  - ROR \#8 Value from \(Rm\) is rotated right 8 bits.
  - ROR \#16 Value from \(Rm\) is rotated right 16 bits.
  - ROR \#24 Value from \(Rm\) is rotated right 24 bits.

If **ROR \#n** is omitted, no rotation is performed.

#### Operation

These instructions do the following:

1. Rotate the value from \(Rm\) right by 0, 8, 16 or 24 bits.
2. Extract bits from the resulting value:
   - SXTB extracts bits[7:0] and sign extends to 32 bits.
   - UXTB extracts bits[7:0] and zero extends to 32 bits.
   - SXTH extracts bits[15:0] and sign extends to 32 bits.
   - UXTH extracts bits[15:0] and zero extends to 32 bits.

#### Restrictions

Do not use SP and do not use PC.

#### Condition flags

These instructions do not affect the flags.

#### Examples

```
SXTB R4, R6, ROR #16 ; Rotate R6 right by 16 bits, then obtain the lower halfword of the result and then sign extend to 32 bits and write the result to R4.
UXTH R3, R10 ; Extract lowest byte of the value in R10 and zero extend it, and write the result to R3.
```
3.10 Branch and control instructions

Table 35 shows the branch and control instructions:

<table>
<thead>
<tr>
<th>Mnemonic</th>
<th>Brief description</th>
<th>See</th>
</tr>
</thead>
<tbody>
<tr>
<td>B</td>
<td>Branch</td>
<td>B, BL, BX, and BLX on page 145</td>
</tr>
<tr>
<td>BL</td>
<td>Branch with Link</td>
<td>B, BL, BX, and BLX on page 145</td>
</tr>
<tr>
<td>BLX</td>
<td>Branch indirect with Link</td>
<td>B, BL, BX, and BLX on page 145</td>
</tr>
<tr>
<td>BX</td>
<td>Branch indirect</td>
<td>B, BL, BX, and BLX on page 145</td>
</tr>
<tr>
<td>CBNZ</td>
<td>Compare and Branch if Non Zero</td>
<td>CBZ and CBNZ on page 147</td>
</tr>
<tr>
<td>CBZ</td>
<td>Compare and Branch if Zero</td>
<td>CBZ and CBNZ on page 147</td>
</tr>
<tr>
<td>IT</td>
<td>If-Then</td>
<td>IT on page 148</td>
</tr>
<tr>
<td>TBB</td>
<td>Table Branch Byte</td>
<td>TBB and TBH on page 150</td>
</tr>
<tr>
<td>TBH</td>
<td>Table Branch Halfword</td>
<td>TBB and TBH on page 150</td>
</tr>
</tbody>
</table>

3.10.1 B, BL, BX, and BLX

Branch instructions.

Syntax

\[
B(\text{cond}) \text{ label} \\
BL(\text{cond}) \text{ label} \\
BX(\text{cond}) \text{ Rm} \\
BLX(\text{cond}) \text{ Rm}
\]

Where:

- B: Is branch (immediate).
- BL: Is branch with link (immediate).
- BX: Is branch indirect (register).
- BLX: Is branch indirect with link (register).
- label: Is a PC-relative expression. See PC-relative expressions on page 68.
- Rm: Is a register that indicates an address to branch to. Bit[0] of the value in Rm must be 1, but the address to branch to is created by changing bit[0] to 0.

Operation

All these instructions cause a branch to label, or to the address indicated in Rm. In addition:
- The BL and BLX instructions write the address of the next instruction to LR (the link register, R14).
- The BX and BLX instructions result in a UsageFault exception if bit[0] of Rm is 0.
Bcond label is the only conditional instruction that can be either inside or outside an IT block. All other branch instructions must be conditional inside an IT block, and must be unconditional outside the IT block, see IT on page 148.

Table 36 shows the ranges for the various branch instructions:

<table>
<thead>
<tr>
<th>Instruction</th>
<th>Branch range</th>
</tr>
</thead>
<tbody>
<tr>
<td>B label</td>
<td>-16 MB to +16 MB</td>
</tr>
<tr>
<td>Bcond label (outside IT block)</td>
<td>-1 MB to +1 MB</td>
</tr>
<tr>
<td>Bcond label (inside IT block)</td>
<td>-16 MB to +16 MB</td>
</tr>
<tr>
<td>BL(cond) label</td>
<td>-16 MB to +16 MB</td>
</tr>
<tr>
<td>BX(cond) Rm</td>
<td>Any value in register</td>
</tr>
<tr>
<td>BLX(cond) Rm</td>
<td>Any value in register</td>
</tr>
</tbody>
</table>

The user might have to use the .W suffix to get the maximum branch range. See Instruction width selection on page 71.

Restrictions
The restrictions are:

- Do not use PC in the BLX instruction.
- For BX and BLX, bit[0] of Rm must be 1 for correct execution but a branch occurs to the target address created by changing bit[0] to 0.
- When any of these instructions is inside an IT block, it must be the last instruction of the IT block.

Bcond is the only conditional instruction that is not required to be inside an IT block. However, it has a longer branch range when it is inside an IT block.

Condition flags
These instructions do not change the flags.

Examples

B loopA ; Branch to loopA
BLX ng  ; Conditionally branch to label ng
B.W target ; Branch to target within 16MB range
BEQ target ; Conditionally branch to target
BEQ.W target ; Conditionally branch to target within 1MB
BL funC ; Branch with link (Call) to function funC, return ; address stored in LR
BX LR   ; Return from function call
BXNE R0 ; Conditionally branch to address stored in R0
BLX R0 ; Branch with link and exchange (Call) to a address ; stored in R0.
3.10.2 **CBZ and CBNZ**

Compare and Branch on Zero, Compare and Branch on Non-Zero.

**Syntax**

CBZ Rn, label

CBNZ Rn, label

Where:

- Rn is the register holding the operand.
- label is the branch destination.

**Operation**

Use the CBZ or CBNZ instructions to avoid changing the condition code flags and to reduce the number of instructions.

`CBZ Rn, label` does not change condition flags but is otherwise equivalent to:

```assembly
CMP Rn, #0
BEQ label
```

`CBNZ Rn, label` does not change condition flags but is otherwise equivalent to:

```assembly
CMP Rn, #0
BNE label
```

**Restrictions**

The restrictions are:

- Rn must be in the range of R0 to R7.
- The branch destination must be within 4 to 130 bytes after the instruction.
- These instructions must not be used inside an IT block.

**Condition flags**

These instructions do not change the flags.

**Examples**

```assembly
CBZ R5, target ; Forward branch if R5 is zero
CBNZ R0, target ; Forward branch if R0 is not zero
```
3.10.3 IT

If-Then condition instruction.

**Syntax**

\[
\text{IT}\{x(y(z))\} \quad \text{cond}
\]

Where:

- \(x\) specifies the condition switch for the second instruction in the IT block.
- \(y\) specifies the condition switch for the third instruction in the IT block.
- \(z\) specifies the condition switch for the fourth instruction in the IT block.
- \(\text{cond}\) specifies the condition for the first instruction in the IT block.

The condition switch for the second, third, and fourth instruction in the IT block can be either:

- **T** Then. Applies the condition \(\text{cond}\) to the instruction.
- **E** Else. Applies the inverse condition of \(\text{cond}\) to the instruction.

It is possible to use \(\text{AL}\) (the *always* condition) for \(\text{cond}\) in an IT instruction. If this is done, all of the instructions in the IT block must be unconditional, and each of \(x\), \(y\), and \(z\) must be **T** or omitted but not **E**.

**Operation**

The IT instruction makes up to four following instructions conditional. The conditions can be all the same, or some of them can be the logical inverse of the others. The conditional instructions following the IT instruction form the **IT block**.

The instructions in the IT block, including any branches, must specify the condition in the \{\text{cond}\} part of their syntax.

The assembler might be able to generate the required IT instructions for conditional instructions automatically, so it is not needed to write them yourself. See the assembler documentation for details.

A BKPT instruction in an IT block is always executed, even if its condition fails.

Exceptions can be taken between an IT instruction and the corresponding IT block, or within an IT block. Such an exception results in entry to the appropriate exception handler, with suitable return information in LR and stacked PSR.

Instructions designed for use for exception returns can be used as normal to return from the exception, and execution of the IT block resumes correctly. This is the only way that a PC-modifying instruction is permitted to branch to an instruction in an IT block.

**Restrictions**

The following instructions are not permitted in an IT block:

- **IT**.
- **CBZ** and **CBNZ**.
- **CPSID** and **CPSIE**.
Other restrictions when using an IT block are:

- A branch or any instruction that modifies the PC must either be outside an IT block or must be the last instruction inside the IT block. These are:
  - ADD PC, PC, Rm.
  - MOV PC, Rm.
  - B, BL, BX, BLX.
  - Any LDM, LDR, or POP instruction that writes to the PC.
  - TBB and TBH.
- Do not branch to any instruction inside an IT block, except when returning from an exception handler.
- All conditional instructions except $Bcond$ must be inside an IT block. $Bcond$ can be either outside or inside an IT block but has a larger branch range if it is inside one.
- Each instruction inside the IT block must specify a condition code suffix that is either the same or logical inverse as for the other instructions in the block.

The assembler might place extra restrictions on the use of IT blocks, such as prohibiting the use of assembler directives within them.

**Condition flags**

This instruction does not change the flags.

**Example**

```
ITTE NE               ; Next 3 instructions are conditional
ANDNE R0, R0, R1     ; ANDNE does not update condition flags
ADDSNE R2, R2, #1    ; ADDSNE updates condition flags
MOVEQ R2, R3         ; Conditional move

CMP R0, #9           ; Convert R0 hex value (0 to 15) into ASCII
                   ; ('0'-'9', 'A'-'F')
ITE GT               ; Next 2 instructions are conditional
ADDT GT R1, R0, #55   ; Convert 0xA -> 'A'
ADDLLE R1, R0, #48    ; Convert 0x0 -> '0'

IT GT                ; IT block with only one conditional instruction
ADDT GT R1, R1, #1    ; Increment R1 conditionally

ITTEE EQ             ; Next 4 instructions are conditional
MOVEQ R0, R1         ; Conditional move
ADDEQ R2, R2, #10     ; Conditional add
ANDNE R3, R3, #1      ; Conditional AND
BNE.W dloop          ; Branch instruction can only be used in the last
                   ; instruction of an IT block

IT NE                ; Next instruction is conditional
ADD R0, R0, R1       ; Syntax error: no condition code used in IT block
```
3.10.4 TBB and TBH

Table Branch Byte and Table Branch Halfword.

Syntax

TBB \[Rn, Rm\]
TBH \[Rn, Rm, LSL \#1\]

Where:

- **Rn** is the register containing the address of the table of branch lengths. If \( Rn \) is PC, then the address of the table is the address of the byte immediately following the **TBB** or **TBH** instruction.
- **Rm** is the index register. This contains an index into the table. For halfword tables, \( LSL \#1 \) doubles the value in \( Rm \) to form the right offset into the table.

Operation

These instructions cause a PC-relative forward branch using a table of single byte offsets for **TBB**, or halfword offsets for **TBH**. **Rn** provides a pointer to the table, and **Rm** supplies an index into the table. For **TBB** the branch offset is twice the unsigned value of the byte returned from the table, and for **TBH** the branch offset is twice the unsigned value of the halfword returned from the table. The branch occurs to the address at that offset from the address of the byte immediately after the **TBB** or **TBH** instruction.

Restrictions

The restrictions are:
- **Rn** must not be SP.
- **Rm** must not be SP and must not be PC.
- When any of these instructions is used inside an IT block, it must be the last instruction of the IT block.

Condition flags

These instructions do not change the flags.

Examples

```assembly
ADR.W R0, BranchTable(Byte
TBB [R0, R1] ; R1 is the index, R0 is the base address of
; the branch table

Case1
; an instruction sequence follows
Case2
; an instruction sequence follows
Case3
; an instruction sequence follows
BranchTable(Byte
DCB 0 ; Case1 offset calculation
DCB ((Case2-Case1)/2) ; Case2 offset calculation
DCB ((Case3-Case1)/2) ; Case3 offset calculation

TBH [PC, R1, LSL \#1] ; R1 is the index, PC is used as base of the
```
; branch table

BranchTable_H
  DCW  ((CaseA - BranchTable_H)/2) ; CaseA offset calculation
  DCW  ((CaseB - BranchTable_H)/2) ; CaseB offset calculation
  DCW  ((CaseC - BranchTable_H)/2) ; CaseC offset calculation

CaseA
  ; an instruction sequence follows
CaseB
  ; an instruction sequence follows
CaseC
  ; an instruction sequence follows

3.11 Floating-point instructions

This section provides the instruction set that the single-precision and double-precision FPU uses.

Table 37 shows the floating-point instructions.

These instructions are only available if the FPU is included, and enabled, in the system.
See Enabling the FPU on page 238 for information about enabling the floating-point unit.

<table>
<thead>
<tr>
<th>Mnemonic</th>
<th>Brief description</th>
<th>See</th>
</tr>
</thead>
<tbody>
<tr>
<td>VABS</td>
<td>Floating-point Absolute</td>
<td>VABS on page 153</td>
</tr>
<tr>
<td>VADD</td>
<td>Floating-point Add</td>
<td>VADD on page 153</td>
</tr>
<tr>
<td>VCMP</td>
<td>Compares two floating-point registers, or one floating-point register and zero</td>
<td>VCMP, VCMPE on page 154</td>
</tr>
<tr>
<td>VCMPE</td>
<td>Compares two floating-point registers, or one floating-point register and zero with Invalid Operation check</td>
<td>VCMP, VCMPE on page 154</td>
</tr>
<tr>
<td>VCVT</td>
<td>Converts between floating-point and integer</td>
<td>VCVT between floating-point and fixed-point on page 156</td>
</tr>
<tr>
<td>VCVT</td>
<td>Converts between floating-point and fixed point</td>
<td>VCVT between floating-point and fixed-point on page 156</td>
</tr>
<tr>
<td>VCVTR</td>
<td>Converts between floating-point and integer with rounding</td>
<td>VCVT, VCVTR between floating-point and integer on page 155</td>
</tr>
<tr>
<td>VCVTB</td>
<td>Converts half-precision value to single-precision</td>
<td>VCVTB, VCVTT on page 157</td>
</tr>
<tr>
<td>VCVTT</td>
<td>Converts single-precision register to half-precision</td>
<td>VCVTB, VCVTT on page 157</td>
</tr>
<tr>
<td>VDIV</td>
<td>Floating-point Divide</td>
<td>VDIV on page 157</td>
</tr>
<tr>
<td>VFMA</td>
<td>Floating-point Fused Multiply Accumulate</td>
<td>VFMA, VFMS on page 158</td>
</tr>
<tr>
<td>VFNMA</td>
<td>Floating-point Fused Negate Multiply Accumulate</td>
<td>VFNMA, VFNMS on page 159</td>
</tr>
<tr>
<td>VFMS</td>
<td>Floating-point Fused Multiply Subtract</td>
<td>VFMA, VFMS on page 158</td>
</tr>
<tr>
<td>VFNMS</td>
<td>Floating-point Fused Negate Multiply Subtract</td>
<td>VFNMA, VFNMS on page 159</td>
</tr>
<tr>
<td>VLDM</td>
<td>Loads Multiple extension registers</td>
<td>VLDM on page 159</td>
</tr>
<tr>
<td>Mnemonic</td>
<td>Brief description</td>
<td>See</td>
</tr>
<tr>
<td>--------------</td>
<td>------------------------------------------------------------</td>
<td>------------------------------------------------</td>
</tr>
<tr>
<td>VLDR</td>
<td>Loads an extension register from memory</td>
<td>VLDR on page 160</td>
</tr>
<tr>
<td>VMLA</td>
<td>Floating-point Multiply Accumulate</td>
<td>VMLA, VMLS on page 161</td>
</tr>
<tr>
<td>VMLS</td>
<td>Floating-point Multiply Subtract</td>
<td>VMLA, VMLS on page 161</td>
</tr>
<tr>
<td>VMOV</td>
<td>Floating-point Move Immediate</td>
<td>VMOV immediate on page 162</td>
</tr>
<tr>
<td>VMOV</td>
<td>Floating-point Move register</td>
<td>VMOV register on page 162</td>
</tr>
<tr>
<td>VMOV</td>
<td>Copies Arm core register to single-precision</td>
<td>VMOV Arm core register to single-precision on page 163</td>
</tr>
<tr>
<td>VMOV</td>
<td>Copies 2 Arm core registers to 2 single-precision</td>
<td>VMOV two Arm core registers to two single-precision registers on page 164</td>
</tr>
<tr>
<td>VMOV</td>
<td>Copies between Arm core register to scalar</td>
<td>VMOV Arm core register to scalar on page 165</td>
</tr>
<tr>
<td>VMOV</td>
<td>Copies between Scalar to Arm core register</td>
<td>VMOV scalar to Arm core register on page 163</td>
</tr>
<tr>
<td>VMRS</td>
<td>Moves to Arm core register from floating-point System register</td>
<td>VMRS on page 165</td>
</tr>
<tr>
<td>VMSR</td>
<td>Moves to floating-point System register from Arm core register</td>
<td>VMSR on page 166</td>
</tr>
<tr>
<td>VMUL</td>
<td>Multiplies floating-point</td>
<td>VMUL on page 166</td>
</tr>
<tr>
<td>VNEG</td>
<td>Floating-point negate</td>
<td>VNEG on page 167</td>
</tr>
<tr>
<td>VNMLA</td>
<td>Floating-point multiply and add</td>
<td>VNMLA, VNMLS, VNMUL on page 167</td>
</tr>
<tr>
<td>VNMLS</td>
<td>Floating-point multiply and subtract</td>
<td>VNMLA, VNMLS, VNMUL on page 167</td>
</tr>
<tr>
<td>VNMUL</td>
<td>Floating-point multiply</td>
<td>VNMLA, VNMLS, VNMUL on page 167</td>
</tr>
<tr>
<td>VPOP</td>
<td>Pop extension registers</td>
<td>VPOP on page 168</td>
</tr>
<tr>
<td>VPUSH</td>
<td>Pushes extension registers</td>
<td>VPUSH on page 169</td>
</tr>
<tr>
<td>VSQRT</td>
<td>Floating-point square root</td>
<td>VSQRT on page 169</td>
</tr>
<tr>
<td>VSTM</td>
<td>Stores Multiple extension registers</td>
<td>VSTM on page 170</td>
</tr>
<tr>
<td>VSTR</td>
<td>Stores an extension register to memory</td>
<td>VSTR on page 170</td>
</tr>
<tr>
<td>VSUB</td>
<td>Floating-point Subtract</td>
<td>VSUB on page 171</td>
</tr>
<tr>
<td>VSEL</td>
<td>Selects register, alternative to a pair of conditional VMOV</td>
<td>VSEL on page 172</td>
</tr>
<tr>
<td>VMAXNM, VMINNM</td>
<td>Maximum, Minimum with IEEE754-2008 NaN handling</td>
<td>VMAXNM, VMINNM on page 172</td>
</tr>
<tr>
<td>VCVTA, VCVTN, VCVTP, VCVTM</td>
<td>Float to integer conversion with directed rounding</td>
<td>VCVTA, VCVTN, VCVTP, VCVTM on page 173</td>
</tr>
</tbody>
</table>
Table 37. Floating-point instructions (continued)

<table>
<thead>
<tr>
<th>Mnemonic</th>
<th>Brief description</th>
<th>See</th>
</tr>
</thead>
<tbody>
<tr>
<td>VRINTR, VRINTX</td>
<td>Float to integer (in floating-point format) conversion</td>
<td>VRINTR, VRINTX on page 173</td>
</tr>
<tr>
<td>VRINTA, VRINTN, VRINTP, VRINTM</td>
<td>Float to integer (in floating-point format) conversion with directed rounding</td>
<td>VRINTA, VRINTN, VRINTP, VRINTM, VRINTZ on page 174</td>
</tr>
</tbody>
</table>

### 3.11.1 VABS

Floating-point Absolute.

**Syntax**

VABS\((\text{cond})\).F<32|64> \(<\text{Sd}|\text{Dd}>\), \(<\text{Sm}|\text{Dm}>\)

**Where:**

- \(\text{cond}\) Is an optional condition code. See Conditional execution on page 68.
- \(<\text{Sd}|\text{Dd}>\), \(<\text{Sm}|\text{Dm}>\)

Are the destination floating-point value and the operand floating-point value.

**Operation**

This instruction:
1. Takes the absolute value of the operand floating-point register.
2. Places the results in the destination floating-point register.

**Restrictions**

There are no restrictions.

**Condition flags**

This instruction does not change the flags.

**Examples**

VABS.F32 S4, S6

### 3.11.2 VADD

Floating-point Add.

**Syntax**

VADD\((\text{cond})\).F<32|64> \((<\text{Sd}|\text{Dd}>), (<\text{Sn}|\text{Dn}>), (<\text{Sm}|\text{Dm}>)\)

VADD\((\text{cond})\).F64 \(\{\text{Dd}\}, \{\text{Dn}, \text{Dm}\}\)

**Where:**

- \(\text{cond}\) Is an optional condition code. See Conditional execution on page 68.
- \(<\text{Sd}|\text{Dd}>\) Is the destination floating-point value.
<Sn|Dn>, <Sm|Dm>

Are the operand floating-point values.

**Operation**

This instruction:
1. Adds the values in the two floating-point operand registers.
2. Places the results in the destination floating-point register.

**Restrictions**

There are no restrictions.

**Condition flags**

This instruction does not change the flags.

**Examples**

VADD.F32 S4, S6, S7

### 3.11.3 VCMP, VCMPE

Compares two floating-point registers, or one floating-point register and zero.

**Syntax**

VCMP(E){cond}.F<32|64> <Sd|Dd>, <Sm|Dm>

VCMP(E){cond}.F<32|64> <Sd|Dd>, #0.0

Where:
- **cond** is an optional condition code. See *Conditional execution on page 68*.
- **E** If present, any NaN operand causes an Invalid Operation exception. Otherwise, only a signaling NaN causes the exception.
- **<Sd|Dd>** is the floating-point operand to compare.
- **<Sm|Dm>** is the floating-point operand that is compared with.

**Operation**

This instruction:
1. Compares either:
   - Two floating-point registers.
   - Or one floating-point register and zero.
2. Writes the result to the FPSCR flags.

**Restrictions**

This instruction can optionally raise an Invalid Operation exception if either operand is any type of NaN. It always raises an Invalid Operation exception if either operand is a signaling NaN.

**Condition flags**

When this instruction writes the result to the FPSCR flags, the values are normally transferred to the Arm flags by a subsequent VMRS instruction, see *VMRS on page 165*. 
Examples

VCMP.F32 S4, #0.0
VCMP.F32 S4, S2

3.11.4 VCVT, VCVTR between floating-point and integer

Converts a value in a register from floating-point to and from a 32-bit integer.

Syntax

VCVT(R){cond}.Tm.F<32|64> <Sd|Dd>, <Sm|Dm>
VCVT(cond).F<32|64>.Tm <Sd|Dd>, <Sm|Dm>

Where:

- **R**
  - If R is specified, the operation uses the rounding mode specified by the FPSCR.
  - If R is omitted, the operation uses the *Round towards Zero* rounding mode.

- **cond**
  - Is an optional condition code. See Conditional execution on page 68.

- **Tm**
  - Is the data type for the operand. It must be one of:
    - **S32** signed 32-bit value.
    - **U32** unsigned 32-bit value.

- **<Sd|Dd>, <Sm|Dm>**
  - Are the destination register and the operand register.

Operation

These instructions:

1. Either:
   - Convert a value in a register from floating-point value to a 32-bit integer.
   - Convert from a 32-bit integer to floating-point value.

2. Place the result in a second register.

The floating-point to integer operation normally uses the *Round towards Zero* rounding mode, but can optionally use the rounding mode specified by the FPSCR.

The integer to floating-point operation uses the rounding mode specified by the FPSCR.

Restrictions

There are no restrictions.

Condition flags

These instructions do not change the flags.
3.11.5 VCVT between floating-point and fixed-point

Converts a value in a register from floating-point to and from fixed-point.

**Syntax**

\[
\text{VCVT} \{\text{cond}\}.\text{Td}.F<32|64>. <Sd|Dd>, <Sd|Dd>, \#\text{fbits} \\
\text{VCVT} \{\text{cond}\}.F<32|64>.\text{Td} <Sd|Dd>, <Sd|Dd>, \#\text{fbits}
\]

Where:

- **cond** is an optional condition code. See *Conditional execution on page 68*.
- **Td** is the data type for the fixed-point number. It must be one of:
  - S16 signed 16-bit value.
  - U16 unsigned 16-bit value.
  - S32 signed 32-bit value.
  - U32 unsigned 32-bit value.
- **<Sd|Dd>** is the destination register and the operand register.
- **fbits** is the number of fraction bits in the fixed-point number:
  - If Td is S16 or U16, fbits must be in the range 0-16.
  - If Td is S32 or U32, fbits must be in the range 1-32.

**Operation**

This instruction:

1. Either
   - Converts a value in a register from floating-point to fixed-point.
   - Converts a value in a register from fixed-point to floating-point.
2. Places the result in a second register.

The floating-point values are single-precision or double-precision.

The fixed-point value can be 16-bit or 32-bit. Conversions from fixed-point values take their operand from the low-order bits of the source register and ignore any remaining bits.

Signed conversions to fixed-point values sign-extend the result value to the destination register width.

Unsigned conversions to fixed-point values zero-extend the result value to the destination register width.

The floating-point to fixed-point operation uses the Round towards Zero rounding mode.

The fixed-point to floating-point operation uses the Round to Nearest rounding mode.

**Restrictions**

There are no restrictions.

**Condition flags**

These instructions do not change the flags.
3.11.6 VCVTB, VCVTT

Converts between half-precision and single-precision or double-precision without intermediate rounding.

Syntax

VCVT(y)(cond).F<32|64>.F16 <Sd|Dd>, Sm
VCVT(y)(cond).F16.F<32|64> Sd, <Sm|Dm>

Where:

- y Specifies which half of the operand register Sm or destination register Sd is used for the operand or destination:
  - If y is B, then the bottom half, bits [15:0], of Sm or Sd is used.
  - If y is T, then the top half, bits [31:16], of Sm or Sd is used.

- cond Is an optional condition code. See Conditional execution on page 68.

- <Sd|Dd> Is the destination register.
- <Sm|Dm> Is the operand register.

Operation

This instruction with the .F16.F<32|64> suffix:
1. Converts the half-precision value in the top or bottom half of a single-precision register to single-precision or double-precision value.
2. Writes the result to a single-precision or double-precision register.

This instruction with the .F<32|64>.F16 suffix:
1. Converts the value in a double-precision or single-precision register to half-precision value.
2. Writes the result into the top or bottom half of a single-precision register, preserving the other half of the target register.

Restrictions

There are no restrictions.

Condition flags

These instructions do not change the flags.

3.11.7 VDIV

Divides floating-point values.

Syntax

VDIV(cond).F<32|64> {<Sd|Dd>,} <Sn|Dn>, <Sm|Dm>

Where:

- cond Is an optional condition code. See Conditional execution on page 68.
- <Sd|Dd> Is the destination register.
- <Sn|Dn>, <Sm|Dm> Are the operand registers.
Operation

This instruction:
1. Divides one floating-point value by another floating-point value.
2. Writes the result to the floating-point destination register.

Restrictions

There are no restrictions.

Condition flags

These instructions do not change the flags.

3.11.8 VFMA, VFMS

Floating-point Fused Multiply Accumulate and Subtract.

Syntax

\[
\text{VFMA}\{\text{cond}\}.F<32|64>\ {<Sd|Dd>,} \ {<Sn|Dn>,} \ {<Sm|Dm>}
\]

\[
\text{VFMS}\{\text{cond}\}.F<32|64>\ {<Sd|Dd>,} \ {<Sn|Dn>,} \ {<Sm|Dm>}
\]

Where:
- \text{cond} is an optional condition code. See Conditional execution on page 68.
- \text{<Sd|Dd>} is the destination register.
- \text{<Sn|Dn>, <Sm|Dm>} are the operand registers.

Operation

The VFMA instruction:
1. Multiplies the floating-point values in the operand registers.
2. Accumulates the results into the destination register.

The result of the multiply is not rounded before the accumulation.

The VFMS instruction:
1. Negates the first operand register.
2.Multiplies the floating-point values of the first and second operand registers.
3. Adds the products to the destination register.
4. Places the results in the destination register.

The result of the multiply is not rounded before the addition.

Restrictions

There are no restrictions.

Condition flags

These instructions do not change the flags.
3.11.9 VFNMA, VFNMS

Floating-point Fused Negate Multiply Accumulate and Subtract.

Syntax

VFNMA(\text{cond}).F<32|64> \{<Sd|Dd>, \} \langle Sn|Dn>, \langle Sm|Dm> \\
VFNMS(\text{cond}).F<32|64> \{<Sd|Dd>, \} \langle Sn|Dn>, \langle Sm|Dm>

Where:
\text{cond} \quad \text{Is an optional condition code. See Conditional execution on page 68.} \\
\langle Sd|Dd \rangle \quad \text{Is the destination register.} \\
\langle Sn|Dn \rangle, \langle Sm|Dm \rangle \quad \text{Are the operand registers.}

Operation

The VFNMA instruction:
1. Negates the first floating-point operand register.
2. Multiplies the first floating-point operand with second floating-point operand.
3. Adds the negation of the floating-point destination register to the product.
4. Places the result into the destination register.

The result of the multiply is not rounded before the addition.

The VFNMS instruction:
1. Multiplies the first floating-point operand with second floating-point operand.
2. Adds the negation of the floating-point value in the destination register to the product.
3. Places the result in the destination register.

The result of the multiply is not rounded before the addition.

Restrictions

There are no restrictions.

Condition flags

These instructions do not change the flags.

3.11.10 VLDM

Floating-point Load Multiple.

Syntax

VLDM(\text{mode})\{cond\}\{.size\} \text{Rn(!)}, \text{list}

Where:
\text{mode} \quad \text{Is the addressing mode:} \\
\text{IA} \quad \text{Increment after. The consecutive addresses start at the address specified in Rn.} \\
\text{DB} \quad \text{Decrement before. The consecutive addresses end just before the address specified in Rn.} \\
\text{cond} \quad \text{Is an optional condition code. See Conditional execution on page 68.}
size  Is an optional data size specifier.
Rn   Is the base register. The SP can be used.
!    Is the command to the instruction to write a modified value back to Rn. This is required if mode == DB, and is optional if mode == IA.
list Is the list of extension registers to be loaded, as a list of consecutively numbered doubleword or singleword registers, separated by commas and surrounded by brackets.

Operation
This instruction loads multiple extension registers from consecutive memory locations using an address from an Arm core register as the base address.

Restrictions
The restrictions are:
- If size is present, it must be equal to the size in bits, 32 or 64, of the registers in list.
- For the base address, the SP can be used. In the Arm instruction set, if ! is not specified the PC can be used.
- list must contain at least one register. If it contains doubleword registers, it must not contain more than 16 registers.
- If using the Decrement Before addressing mode, the write back flag, !, must be appended to the base register specification.

Condition flags
These instructions do not change the flags.

Example
VLDMIA.F64 r1, {d3,d4,d5}

3.11.11 VLDR
Loads a single extension register from memory.

Syntax
VLDR{cond}{.F<32|64>} <sd|Dd>, [Rn[#imm]]
VLDR{cond}{.F<32|64>} <sd|Dd>, label
VLDR{cond}{.F<32|64>} <sd|Dd>, [PC, #imm]

Where:
cond  Is an optional condition code. See Conditional execution on page 68.
32, 64 Are the optional data size specifiers.
Dd    Is the destination register for a doubleword load.
Sd    Is the destination register for a singleword load.
Rn    Is the base register. The SP can be used.
imm   Is the + or - immediate offset used to form the address. Permitted address values are multiples of 4 in the range 0 to 1020.
label Is the label of the literal data item to be loaded.
Operation
This instruction loads a single extension register from memory, using a base address from an Arm core register, with an optional offset.

Restrictions
There are no restrictions.

Condition flags
These instructions do not change the flags.

3.11.12 VMLA, VMLS
Multiplies two floating-point values, and accumulates or subtracts the result.

Syntax
VMLA\{(cond)\}.F<32|64> <Sd|Dd>, <Sn|Dn>, <Sm|Dm>
VMLS\{(cond)\}.F<32|64> <Sd|Dd>, <Sn|Dn>, <Sm|Dm>

Where:
\(cond\) Is an optional condition code. See Conditional execution on page 68.
\(<Sd|Dd>\) Is the destination floating-point value.
\(<Sn|Dn>, <Sm|Dm>\) Are the operand floating-point values.

Operation
The floating-point Multiply Accumulate instruction:
1. Multiplies two floating-point values.
2. Adds the results to the destination floating-point value.

The floating-point Multiply Subtract instruction:
1. Multiplies two floating-point values.
2. Subtracts the products from the destination floating-point value.
3. Places the results in the destination register.

Restrictions
There are no restrictions.

Condition flags
These instructions do not change the flags.
3.11.13 VMOV immediate

Moves floating-point Immediate.

Syntax

\[ \text{VMOV}\,(\text{cond})\,F<32|64>\,\,<Sd|Dd>,\,\,#imm \]

Where:
- \( \text{cond} \) is an optional condition code. See Conditional execution on page 68.
- \(<Sd|Dd>\) is the destination register.
- \( \text{imm} \) is a floating-point constant.

Operation

This instruction copies a constant value to a floating-point register.

Restrictions

There are no restrictions.

Condition flags

These instructions do not change the flags.

3.11.14 VMOV register

Copies the contents of one register to another.

Syntax

\[ \text{VMOV}\,(\text{cond})\,F<32|64>\,\,<Sd|Dd>,\,\,<Sm|Dm> \]

Where:
- \( \text{cond} \) is an optional condition code. See Conditional execution on page 68.
- \( \text{Dd} \) is the destination register, for a doubleword operation.
- \( \text{Dm} \) is the source register, for a doubleword operation.
- \( \text{Sd} \) is the destination register, for a singleword operation.
- \( \text{Sm} \) is the source register, for a singleword operation.

Operation

This instruction copies the contents of one floating-point register to another.

Restrictions

There are no restrictions.

Condition flags

These instructions do not change the flags.
3.11.15 \textbf{VMOV scalar to Arm core register}
Transfers one word of a doubleword floating-point register to an Arm core register.

\textbf{Syntax}
\begin{verbatim}
VMOV(cond) Rt, Dn[x]
\end{verbatim}
Where:
\begin{itemize}
  \item \textit{cond} Is an optional condition code. See \textit{Conditional execution on page 68}.
  \item \textit{Rt} Is the destination Arm core register.
  \item \textit{Dn} Is the 64-bit doubleword register.
  \item \textit{x} Specifies which half of the doubleword register to use:
    \begin{itemize}
      \item If \textit{x} is 0, use lower half of doubleword register
      \item If \textit{x} is 1, use upper half of doubleword register.
    \end{itemize}
\end{itemize}

\textbf{Operation}
This instruction transfers one word from the upper or lower half of a doubleword floating-point register to an Arm core register.

\textbf{Restrictions}
\textit{Rt} cannot be PC or SP.

\textbf{Condition flags}
These instructions do not change the flags.

3.11.16 \textbf{VMOV Arm core register to single-precision}
Transfers a single-precision register to and from an Arm core register.

\textbf{Syntax}
\begin{verbatim}
VMOV(cond) Sn, Rt
\end{verbatim}
Where:
\begin{itemize}
  \item \textit{cond} Is an optional condition code. See \textit{Conditional execution on page 68}.
  \item \textit{Sn} Is the single-precision floating-point register.
  \item \textit{Rt} Is the Arm core register.
\end{itemize}

\textbf{Operation}
This instruction transfers:
\begin{itemize}
  \item The contents of a single-precision register to an Arm core register.
  \item The contents of an Arm core register to a single-precision register.
\end{itemize}

\textbf{Restrictions}
\textit{Rt} cannot be PC or SP.

\textbf{Condition flags}
These instructions do not change the flags.
3.11.17 **VMOV two Arm core registers to two single-precision registers**

Transfers two consecutively numbered single-precision registers to and from two Arm core registers.

**Syntax**

\[
\text{VMOV}(\text{cond}) \ Sm, \ Sm1, \ Rt, \ Rt2 \\
\text{VMOV}(\text{cond}) \ Rt, \ Rt2, \ Sm, \ Sm1 \\
\]

Where:

- \( \text{cond} \) is an optional condition code. See *Conditional execution on page 68*.
- \( Sm \) is the first single-precision register.
- \( Sm1 \) is the second single-precision register. This is the next single-precision register after \(<Sm>\).
- \( Rt \) is the Arm core register that \(<Sm>\) is transferred to or from.
- \( Rt2 \) is the Arm core register that \(<Sm1>\) is transferred to or from.

**Operation**

This instruction transfers:

- The contents of two consecutively numbered single-precision registers to two Arm core registers.
- The contents of two Arm core registers to a pair of single-precision registers.

**Restrictions**

The restrictions are:

- The floating-point registers must be contiguous, one after the other.
- The Arm core registers do not have to be contiguous.
- \( Rt \) cannot be PC or SP.

**Condition flags**

These instructions do not change the flags.

3.11.18 **VMOV two Arm core registers and a double-precision register**

Transfers two words from two Arm core registers to a doubleword register, or from a doubleword register to two Arm core registers.

**Syntax**

\[
\text{VMOV}(\text{cond}) \ Dm, \ Rt, \ Rt2 \\
\text{VMOV}(\text{cond}) \ Rt, \ Rt2, \ Dm \\
\]

Where:

- \( \text{cond} \) is an optional condition code. See *Conditional execution on page 68*.
- \( Dm \) is the double-precision register.
- \( Rt, \ Rt2 \) are the two Arm core registers.
**Operation**

This instruction:
- Transfers two words from two Arm core registers to a doubleword register.
- Transfers a doubleword register to two Arm core registers.

**Restrictions**

There are no restrictions.

**Condition flags**

These instructions do not change the flags.

### 3.11.19 VMOV Arm core register to scalar

Transfers one word to a floating-point register from an Arm core register.

**Syntax**

\[ \text{VMOV}\{\text{cond}\}\{.32\}\ Dd\{x\},\ Rt \]

Where:
- `cond` is an optional condition code. See *Conditional execution on page 68*.
- `.32` is an optional data size specifier.
- `Dd\{x\}` is the destination, where \(x\) defines which half of the doubleword is transferred, as follows:
  - If \(x\) is 0, the lower half is extracted.
  - If \(x\) is 1, the upper half is extracted.
- `Rt` is the source Arm core register.

**Operation**

This instruction transfers one word to the upper or lower half of a doubleword floating-point register from an Arm core register.

**Restrictions**

`Rt` cannot be PC or SP.

**Condition flags**

These instructions do not change the flags.

### 3.11.20 VMRS

Moves to Arm core register from floating-point System register.

**Syntax**

\[ \text{VMRS}\{\text{cond}\}\ Rt,\ FPSCR \]
\[ \text{VMRS}\{\text{cond}\}\ APSR_{nzcv},\ FPSCR \]

Where:
- `cond` is an optional condition code. See *Conditional execution on page 68*.
- `Rt` is the destination Arm core register. This register can be R0-R14.
APSR_nzcv  Transfer floating-point flags to the APSR flags.

Operation
This instruction performs one of the following actions:
- Copies the value of the FPSCR to a general-purpose register.
- Copies the value of the FPSCR flag bits to the APSR N, Z, C, and V flags.

Restrictions
Rt cannot be PC or SP.

Condition flags
These instructions optionally change the N, Z, C, and V flags.

3.11.21 VMSR
Moves to floating-point System register from Arm core register.

Syntax
VMSR{cond} FPSCR, Rt
Where:
cond Is an optional condition code. See Conditional execution on page 68.
Rt Is the general-purpose register to be transferred to the FPSCR.

Operation
This instruction moves the value of a general-purpose register to the FPSCR. See Floating-point status control register on page 236 for more information.

Restrictions
Rt cannot be PC or SP.

Condition flags
This instruction updates the FPSCR.

3.11.22 VMUL
Floating-point Multiply.

Syntax
VMUL{cond}.F<32|64> (<Sd|Dd>,) <Sn|Dn>, <Sm|Dm>
Where:
cond Is an optional condition code. See Conditional execution on page 68.
<Sd|Dd> Is the destination floating-point value.
<Sn|Dn>, <Sm|Dm> Are the operand floating-point values.
Operation
This instruction:
1. Multiplies two floating-point values.
2. Places the results in the destination register.

Restrictions
There are no restrictions.

Condition flags
These instructions do not change the flags.

3.11.23 VNEG
Floating-point Negate.

Syntax
\[ \text{VNEG} \{\text{cond}\}.F<32|64> \ <Sd|Dd>, \ <Sm|Dm} \]

Where:
\[
\text{cond} \quad \text{Is an optional condition code. See Conditional execution on page 68.}
\]
\[
\text{<Sd|Dd>} \quad \text{Is the destination floating-point value.}
\]
\[
\text{<Sm|Dm>} \quad \text{Is the operand floating-point value.}
\]

Operation
This instruction:
1. Negates a floating-point value.
2. Places the results in a second floating-point register.
The floating-point instruction inverts the sign bit.

Restrictions
There are no restrictions.

Condition flags
These instructions do not change the flags.

3.11.24 VNMLA, VNMLS, VNMUL
Floating-point multiply with negation followed by add or subtract.

Syntax
\[ \text{VNMLA} \{\text{cond}\}.F<32|64> \ <Sd|Dd>, \ <Sn|Dn>, \ <Sm|Dm}\]
\[ \text{VNMLS} \{\text{cond}\}.F<32|64> \ <Sd|Dd>, \ <Sn|Dn>, \ <Sm|Dm}\]
\[ \text{VNMUL} \{\text{cond}\}.F<32|64> \ {<Sd|Dd>}, \ <Sn|Dn>, \ <Sm|Dm}\]

Where:
\[
\text{cond} \quad \text{Is an optional condition code. See Conditional execution on page 68.}
\]
\[
\text{<Sd|Dd>} \quad \text{Is the destination floating-point register.}
\]
<Sn|Dn>, <Sm|Dm> Are the operand floating-point registers.

**Operation**

The VNMLA instruction:
1. Multiplies two floating-point register values.
2. Adds the negation of the floating-point value in the destination register to the negation of the product.
3. Writes the result back to the destination register.

The VNMLS instruction:
1. Multiplies two floating-point register values.
2. Adds the negation of the floating-point value in the destination register to the product.
3. Writes the result back to the destination register.

The VNMUL instruction:
1. Multiplies two floating-point register values.
2. Writes the negation of the result to the destination register.

**Restrictions**

There are no restrictions.

**Condition flags**

These instructions do not change the flags.

### 3.11.25 VPOP

Floating-point extension register Pop.

**Syntax**

VPOP{cond}{.size} list

Where:
- **cond** Is an optional condition code. See *Conditional execution on page 68*.
- **size** Is an optional data size specifier. If present, it must be equal to the size in bits, 32 or 64, of the registers in *list*.
- **list** Is a list of extension registers to be loaded, as a list of consecutively numbered doubleword or singleword registers, separated by commas and surrounded by brackets.

**Operation**

This instruction loads multiple consecutive extension registers from the stack.

**Restrictions**

The list must contain at least one register, and not more than sixteen registers.

**Condition flags**

These instructions do not change the flags.
3.11.26  **VPUSH**

Floating-point extension register Push.

**Syntax**

\[
\text{VPUSH}\{\text{cond}\}{.\text{size}} \text{ list}
\]

Where:

- **cond** Is an optional condition code. See *Conditional execution on page 68*.
- **size** Is an optional data size specifier. If present, it must be equal to the size in bits, 32 or 64, of the registers in list.
- **list** Is a list of the extension registers to be stored, as a list of consecutively numbered doubleword or singleword registers, separated by commas and surrounded by brackets.

**Operation**

This instruction stores multiple consecutive extension registers to the stack.

**Restrictions**

list must contain at least one register, and not more than sixteen.

**Condition flags**

These instructions do not change the flags.

3.11.27  **VSQRT**

Floating-point Square Root.

**Syntax**

\[
\text{VSQRT}\{\text{cond}\}.F<32|64> \text{ <Sd|Dd> }, \text{ <Sm|Dm>}
\]

Where:

- **cond** Is an optional condition code. See *Conditional execution on page 68*.
- **<Sd|Dd>** Is the destination floating-point value.
- **<Sm|Dm>** Is the operand floating-point value.

**Operation**

This instruction:

- Calculates the square root of the value in a floating-point register.
- Writes the result to another floating-point register.

**Restrictions**

There are no restrictions.

**Condition flags**

These instructions do not change the flags.
3.11.28 VSTM
Floating-point Store Multiple.

Syntax

[VSTM(mode)(cond)(.size) Rn(!), list]

Where:
- **mode**
  - IA Increment After. The consecutive addresses start at the address specified in Rn. This is the default and can be omitted.
  - DB Decrement Before. The consecutive addresses end just before the address specified in Rn.
- **cond**
  - Is an optional condition code. See Conditional execution on page 68.
- **size**
  - Is an optional data size specifier. If present, it must be equal to the size in bits, 32 or 64, of the registers in list.
- **Rn**
  - Is the base register. The SP can be used.
- **!**
  - Is the function that causes the instruction to write a modified value back to Rn. Required if mode == DB.
- **list**
  - Is a list of the extension registers to be stored, as a list of consecutively numbered doubleword or singleword registers, separated by commas and surrounded by brackets.

Operation
This instruction stores multiple extension registers to consecutive memory locations using a base address from an Arm core register.

Restrictions
The restrictions are:
- list must contain at least one register. If it contains doubleword registers it must not contain more than 16 registers.
- Use of the PC as Rn is deprecated.

Condition flags
These instructions do not change the flags.

3.11.29 VSTR
Floating-point Store.

Syntax

[VSTR(cond)(.32) Sd, [Rn(, #imm)]]
[VSTR(cond)(.64) Dd, [Rn(, #imm)]]

Where:
- **cond**
  - Is an optional condition code. See Conditional execution on page 68.
- **32, 64**
  - Are the optional data size specifiers.
- **Sd**
  - Is the source register for a singleword store.
- **Dd**
  - Is the source register for a doubleword store.
\( Rn \)  
Is the base register. The SP can be used.

\( \text{imm} \)  
Is the + or - immediate offset used to form the address. Values are multiples of 4 in the range 0-1020. \( \text{imm} \) can be omitted, meaning an offset of +0.

**Operation**

This instruction stores a single extension register to memory, using an address from an Arm core register, with an optional offset, defined in \( \text{imm} \):

**Restrictions**

The use of PC for \( Rn \) is deprecated.

**Condition flags**

These instructions do not change the flags.

### 3.11.30 VSUB

Floating-point Subtract.

**Syntax**

\[
\text{VSUB}(\text{cond}) . F<32|64> \ (\langle Sd | Dd \rangle, \ <Sn | Dn >, \ <Sm | Dm >)
\]

Where:

- \( \text{cond} \) Is an optional condition code. See [Conditional execution on page 68](#).
- \( \langle Sd | Dd \rangle \) Is the destination floating-point value.
- \( \langle Sn | Dn >, \ <Sm | Dm > \)  
  Are the operand floating-point values.

**Operation**

This instruction:

1. Subtracts one floating-point value from another floating-point value.
2. Places the results in the destination floating-point register.

**Restrictions**

There are no restrictions.

**Condition flags**

These instructions do not change the flags.
3.11.31 VSEL

Provides an alternative to a pair of conditional VMOV instructions.

Encoding

\[
\text{VSEL(\text{cond}).F}_{32|64} <\text{Sd}|\text{Dd}>, <\text{Sn}|\text{Dn}>, <\text{Sm}|\text{Dm}>
\]

Where:
- \text{cond} is an optional condition code. See \textit{Conditional execution on page 68}. VSEL has a subset of the condition codes. The condition codes for VSEL are limited to GE, GT, EQ and VS, with the effect that LT, LE, NE and VC is achievable by exchanging the source operands.
- \text{<Sd}|\text{Dd}> is the destination single-precision or double-precision floating-point value.
- \text{<Sn}|\text{Dn}>, \text{<Sm}|\text{Dm}> are the operand single-precision or double-precision floating-point values.

Operation

Depending on the result of the condition code, this instruction moves either:
- \text{<Sn}|\text{Dn}> source register to the destination register.
- \text{<Sm}|\text{Dm}> source register to the destination register.

Restrictions

The VSEL instruction must not occur inside an IT block.

Condition flags

These instructions do not change the flags.

3.11.32 VMAXNM, VMINNM

Return the minimum or the maximum of two floating-point numbers with NaN handling as specified by IEEE754-2008.

Encoding

\[
\text{VMAXNM.F}_{32|64} <\text{Sd}|\text{Dd}>, <\text{Sn}|\text{Dn}>, <\text{Sm}|\text{Dm}>
\]
\[
\text{VMINNM.F}_{32|64} <\text{Sd}|\text{Dd}>, <\text{Sn}|\text{Dn}>, <\text{Sm}|\text{Dm}>
\]

Where:
- \text{<Sd}|\text{Dd}> is the destination single-precision or double-precision floating-point value.
- \text{<Sn}|\text{Dn}>, \text{<Sm}|\text{Dm}> are the operand single-precision or double-precision floating-point values.

Operation

The VMAXNM instruction compares two source registers, and moves the largest to the destination register.

The VMINNM instruction compares two source registers, and moves the lowest to the destination register.
Restrictions
There are no restrictions.

Condition flags
These instructions do not change the flags.

3.11.33 VCVTA, VCVTN, VCVTP, VCVTM
Floating-point to integer conversion with directed rounding.

Syntax
VCVT<rmode>.S32.F<32|64> <Sd>, <Sm|Dm>
VCVT<rmode>.U32.F<32|64> <Sd>, <Sm|Dm>

Where:
<Sd|Dd> Is the destination single-precision or double-precision floating-point value.
<Sn|Dn>, <Sm|Dm> Are the operand single-precision or double-precision floating-point values.

<rmode> Is one of:
A Round to nearest ties away.
M Round to nearest even.
N Round towards plus infinity.
P Round towards minus infinity.

Operation
These instructions:
1. Read the source register.
2. Convert to integer with directed rounding.
3. Write to the destination register.

Restrictions
There are no restrictions.

Condition flags
These instructions do not change the flags.

3.11.34 VRINTR, VRINTX
Round a floating-point value to an integer in floating-point format.

Encoding
VRINT(R,X)(cond).F<32|64> <Sd|Dd>, <Sm|Dm>

Where:
cond Is an optional condition code. See Conditional execution on page 68.
<Sd|Dd> Is the destination floating-point value.
<Sm|Dm> Are the operand floating-point values.
Operation

These instructions:
1. Read the source register.
2. Round to the nearest integer value in floating-point format using the rounding mode specified by the FPSCR.
3. Write the result to the destination register.
4. For the VRINTZX instruction only. Generate a floating-point exception if the result is not exact.

Restrictions

There are no restrictions.

Condition flags

These instructions do not change the flags.

3.11.35 VRINTA, VRINTN, VRINTP, VRINTM, VRINTZ

Round a floating-point value to an integer in floating-point format using directed rounding.

Encoding

VRINT<rmode>.F<32|64> <Sd|Dd>, <Sm|Dm>

Where:
<Sm|Dm> Is the destination single-precision or double-precision floating-point value.
<Sn|Dm>, <Sm|Dm> Are the operand single-precision or double-precision floating-point values.
<rmode> Is one of:
A Round to nearest ties away.
M Round to Nearest Even.
N Round towards Plus Infinity.
P Round towards Minus Infinity.
Z Round towards Zero.

Operation

These instructions:
1. Read the source register.
2. Round to the nearest integer value with a directed rounding mode specified by the instruction.
3. Write the result to the destination register.

Restrictions

These instructions cannot be conditional. These instructions cannot generate an inexact exception even if the result is not exact.
Condition flags
These instructions do not change the flags.

## 3.12 Miscellaneous instructions

*Table 38* shows the remaining Cortex®-M7 instructions:

<table>
<thead>
<tr>
<th>Mnemonic</th>
<th>Brief description</th>
<th>See</th>
</tr>
</thead>
<tbody>
<tr>
<td>BKPT</td>
<td>Breakpoint</td>
<td>BKPT on page 175</td>
</tr>
<tr>
<td>CPSID</td>
<td>Change Processor State, Disable Interrupts</td>
<td>CPS on page 176</td>
</tr>
<tr>
<td>CPSIE</td>
<td>Change Processor State, Enable Interrupts</td>
<td>CPS on page 176</td>
</tr>
<tr>
<td>DMB</td>
<td>Data Memory Barrier</td>
<td>DMB on page 177</td>
</tr>
<tr>
<td>DSB</td>
<td>Data Synchronization Barrier</td>
<td>DSB on page 177</td>
</tr>
<tr>
<td>ISB</td>
<td>Instruction Synchronization Barrier</td>
<td>ISB on page 178</td>
</tr>
<tr>
<td>MRS</td>
<td>Move from special register to register</td>
<td>MRS on page 178</td>
</tr>
<tr>
<td>MSR</td>
<td>Move from register to special register</td>
<td>MSR on page 179</td>
</tr>
<tr>
<td>NOP</td>
<td>No Operation</td>
<td>NOP on page 180</td>
</tr>
<tr>
<td>SEV</td>
<td>Send Event</td>
<td>SEV on page 180</td>
</tr>
<tr>
<td>SVC</td>
<td>Supervisor Call</td>
<td>SVC on page 181</td>
</tr>
<tr>
<td>WFE</td>
<td>Wait For Event</td>
<td>WFE on page 181</td>
</tr>
<tr>
<td>WFI</td>
<td>Wait For Interrupt</td>
<td>WFI on page 182</td>
</tr>
</tbody>
</table>

### 3.12.1 BKPT

Breakpoint.

**Syntax**

`BKPT #imm`

Where:

- `imm` is an expression evaluating to an integer in the range 0-255 (8-bit value).

**Operation**

The BKPT instruction causes the processor to enter Debug state. Debug tools can use this to investigate system state when the instruction at a particular address is reached.

`imm` is ignored by the processor. If required, a debugger can use it to store additional information about the breakpoint.

The BKPT instruction can be placed inside an IT block, but it executes unconditionally, unaffected by the condition specified by the IT instruction.
Condition flags
This instruction does not change the flags.

Examples

BKPT #0x3 ; Breakpoint with immediate value set to 0x3 (debugger can
; extract the immediate value by locating it using the PC)

Arm does not recommend the use of the BKPT instruction with an immediate value set
to 0xAB for any purpose other than Semi-hosting.

3.12.2 CPS
Change Processor State.

Syntax

CPSeffect iflags
Where:

effect Is one of:
IE Clears the special purpose register.
ID Sets the special purpose register.

iflags Is a sequence of one or more flags:
i Set or clear PRIMASK.
f Set or clear FAULTMASK.

Operation

CPS changes the PRIMASK and FAULTMASK special register values. See Exception mask
registers on page 25 for more information about these registers.

Restrictions

The restrictions are:

• Use CPS only from privileged software. It has no effect if used in unprivileged software.
• CPS cannot be conditional and so must not be used inside an IT block.

Condition flags
This instruction does not change the condition flags.

Examples

CPSID i ; Disable interrupts and configurable fault handlers (set
; PRIMASK)
CPSID f ; Disable interrupts and all fault handlers (set FAULTMASK)
CPSIE i ; Enable interrupts and configurable fault handlers (clear
; PRIMASK)
CPSIE f ; Enable interrupts and fault handlers (clear FAULTMASK)
3.12.3 DMB
Data Memory Barrier.

Syntax
DMB(cond)
Where:
cond Is an optional condition code. See Conditional execution on page 68.

Operation
DMB acts as a data memory barrier. It ensures that all explicit memory accesses that appear, in program order, before the DMB instruction are completed before any explicit memory accesses that appear, in program order, after the DMB instruction. DMB does not affect the ordering or execution of instructions that do not access memory.

Condition flags
This instruction does not change the flags.

Examples
DMB ; Data Memory Barrier

3.12.4 DSB
Data Synchronization Barrier.

Syntax
DSB(cond)
Where:
cond Is an optional condition code. See Conditional execution on page 68.

Operation
DSB acts as a special data synchronization memory barrier. Instructions that come after the DSB, in program order, do not execute until the DSB instruction completes. The DSB instruction completes when all explicit memory accesses before it complete.

Condition flags
This instruction does not change the flags.

Examples
DSB ; Data Synchronisation Barrier
3.12.5  ISB

Instruction Synchronization Barrier.

Syntax

ISB{(cond)}

Where:

cond  Is an optional condition code. See Conditional execution on page 68.

Operation

ISB acts as an instruction synchronization barrier. It flushes the pipeline of the processor, so that all instructions following the ISB are fetched from cache or memory again, after the ISB instruction has been completed.

Condition flags

This instruction does not change the flags.

Examples

ISB ; Instruction Synchronisation Barrier

3.12.6  MRS

Move the contents of a special register to a general-purpose register.

Syntax

MRS{(cond) } Rd, spec_reg

Where:

cond  Is an optional condition code. See Conditional execution on page 68.

Rd  Is the destination register.

spec_reg  Can be any of: APSR, IPSR, EPSR, IEPSR, IAPSR, EAPSR, PSR, MSP, PSP, PRIMASK, BASEPRI, BASEPRI_MAX, FAULTMASK, or CONTROL.

All the EPSR and IPSR fields are zero when read by the MRS instruction.

Operation

Use MRS in combination with MSR as part of a read-modify-write sequence for updating a PSR, for example to clear the Q flag.

In process swap code, the programmers model state of the process being swapped out must be saved, including relevant PSR contents. Similarly, the state of the process being swapped in must also be restored. These operations use MRS in the state-saving instruction sequence and MSR in the state-restoring instruction sequence.

BASEPRI_MAX is an alias of BASEPRI when used with the MRS instruction.

See MSR on page 179.
Restrictions

*Rd* must not be SP and must not be PC.

Condition flags

This instruction does not change the flags.

Examples

MRS R0, PRIMASK; Read PRIMASK value and write it to R0

### 3.12.7 MSR

Move the contents of a general-purpose register into the specified special register.

**Syntax**

\[
\text{MSR\{cond\} spec\_reg, Rn}
\]

Where:

- *cond* is an optional condition code. See Conditional execution on page 68.
- *Rn* is the source register.
- *spec\_reg* can be any of: APSR_nzcvq, APSR_g, APSR_nzcvqg, MSP, PSP, PRIMASK, BASEPRI, BASEPRI\_MAX, FAULTMASK, or CONTROL.
  
  APSR can be used to refer to APSR_nzcvq.

**Operation**

The register access operation in MSR depends on the privilege level. Unprivileged software can only access the APSR, see Table 4 on page 23. Privileged software can access all special registers.

In unprivileged software writes to unallocated or execution state bits in the PSR are ignored.

When writing to BASEPRI\_MAX, the instruction writes to BASEPRI only if either:

- *Rn* is non-zero and the current BASEPRI value is 0.
- *Rn* is non-zero and less than the current BASEPRI value.

See MRS on page 178.

**Restrictions**

*Rn* must not be SP and must not be PC.

**Condition flags**

This instruction updates the flags explicitly based on the value in *Rn*.

**Examples**

MSR CONTROL, R1; Read R1 value and write it to the CONTROL register
3.12.8 **NOP**

No Operation.

**Syntax**

NOP\(\text{(cond)}\)

Where:

\(\text{cond}\) is an optional condition code. See *Conditional execution on page 68*.

**Operation**

NOP does nothing. NOP is not necessarily a time-consuming NOP. The processor might remove it from the pipeline before it reaches the execution stage.

Use NOP for padding, for example to place the following instruction on a 64-bit boundary.

**Condition flags**

This instruction does not change the flags.

**Examples**

\[
\text{NOP} \; \text{; No operation}
\]

3.12.9 **SEV**

Send Event.

**Syntax**

SEV\(\text{(cond)}\)

Where:

\(\text{cond}\) is an optional condition code. See *Conditional execution on page 68*.

**Operation**

SEV is a hint instruction that causes an event to be signaled to all processors within a multiprocessor system. It also sets the local event register to 1, see *Power management on page 50*.

**Condition flags**

This instruction does not change the flags.

**Examples**

\[
\text{SEV} \; \text{; Send Event}
\]
3.12.10 SVC
Supervisor Call.

Syntax

```
SVC{cond} #imm
```

Where:

- `cond` is an optional condition code. See Conditional execution on page 68.
- `imm` is an expression evaluating to an integer in the range 0-255 (8-bit value).

Operation

The SVC instruction causes the SVC exception.

`imm` is ignored by the processor. If required, it can be retrieved by the exception handler to determine what service is being requested.

Condition flags

This instruction does not change the flags.

Examples

```
SVC #0x32 ; Supervisor Call (SVCall handler can extract the immediate value by locating it through the stacked PC)
```

3.12.11 WFE
Wait For Event.

Syntax

```
WFE{cond}
```

Where:

- `cond` is an optional condition code. See Conditional execution on page 68.

Operation

WFE is a hint instruction.

If the event register is 0, WFE suspends execution until one of the following events occurs:

- An exception, unless masked by the exception mask registers or the current priority level.
- An exception enters the Pending state, if SEVONPEND in the System Control register is set.
- A Debug Entry request, if Debug is enabled.
- An event signaled by a peripheral or another processor in a multiprocessor system using the SEV instruction.

If the event register is 1, WFE clears it to 0 and returns immediately.

For more information see Power management on page 50.
Condition flags
This instruction does not change the flags.

Examples
WFE ; Wait for event

3.12.12 WFI
Wait for Interrupt.

Syntax
WFI(cond)
Where:
cond Is an optional condition code. See Conditional execution on page 68.

Operation
WFI is a hint instruction that suspends execution until one of the following events occurs:
• A non-masked interrupt occurs and is taken.
• An interrupt masked by PRIMASK becomes pending.
• A Debug Entry request.

Condition flags
This instruction does not change the flags.

Examples
WFI ; Wait for interrupt
Cortex-M7 peripherals

4.1 About the Cortex-M7 peripherals

The address map of the Private peripheral bus (PPB) is:

Table 39. Core peripheral register regions

<table>
<thead>
<tr>
<th>Address</th>
<th>Core peripheral</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0xE000E008-0xE000E00F</td>
<td>System control block</td>
<td>Table 50 on page 192</td>
</tr>
<tr>
<td>0xE000E010-0xE000E01F</td>
<td>System timer</td>
<td>Table 71 on page 213</td>
</tr>
<tr>
<td>0xE000E100-0xE000E4EF</td>
<td>Nested Vectored Interrupt Controller</td>
<td>Table 40 on page 184</td>
</tr>
<tr>
<td>0xE000ED00-0xE000ED3F</td>
<td>System control block</td>
<td>Table 50 on page 192</td>
</tr>
<tr>
<td>0xE000ED78-0xE000ED84</td>
<td>Processor features</td>
<td>Table 77 on page 217</td>
</tr>
<tr>
<td>0xE000ED90-0xE000EDB8</td>
<td>Memory Protection Unit</td>
<td>Table 84 on page 222</td>
</tr>
<tr>
<td>0xE000EF00-0xE000EF03</td>
<td>Nested Vectored Interrupt Controller</td>
<td>Table 40 on page 184</td>
</tr>
<tr>
<td>0xE000EF30-0xE000EF44</td>
<td>Floating-Point Unit</td>
<td>Table 94 on page 233</td>
</tr>
<tr>
<td>0xE000EF50-0xE000EF78</td>
<td>Cache maintenance operations</td>
<td>Table 100 on page 240</td>
</tr>
<tr>
<td>0xE000EF90-0xE000EFA8</td>
<td>Access control</td>
<td>Table 104 on page 245</td>
</tr>
</tbody>
</table>

In the register descriptions:

- the register type is described as follows:
  RW  Read and write.
  RO  Read-only.
  WO  Write-only.

- the required privilege gives the privilege level required to access the register, as follows:
  Privileged
  Only privileged software can access the register.
  Unprivileged
  Both unprivileged and privileged software can access the register.

Attempting to access a privileged register from unprivileged software results in a BusFault.
4.2 Nested Vectored Interrupt Controller

This section describes the NVIC and the registers it uses. The NVIC supports:

- 1 to 240 interrupts.
- A programmable priority level of 0-255 for each interrupt. A higher level corresponds to a lower priority, so level 0 is the highest interrupt priority.
- Level and pulse detection of interrupt signals.
- Dynamic reprioritization of interrupts.
- Grouping of priority values into group priority and subpriority fields.
- Interrupt tail-chaining.
- An external Non Maskable Interrupt (NMI)

The processor automatically stacks its state on exception entry and unstacks this state on exception exit, with no instruction overhead. This provides low latency exception handling.

The hardware implementation of the NVIC registers is:

<table>
<thead>
<tr>
<th>Address</th>
<th>Name</th>
<th>Type</th>
<th>Required privilege</th>
<th>Reset value</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0xE000E100-0xE000E11C</td>
<td>NVIC_ISER0-NVIC_ISER7</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Interrupt set-enable registers on page 185</td>
</tr>
<tr>
<td>0xE000E180-0xE000E19C</td>
<td>NVIC_ICER0-NVIC_ICER7</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Interrupt clear-enable registers on page 186</td>
</tr>
<tr>
<td>0xE000E200-0xE000E21C</td>
<td>NVIC_ISPR0-NVIC_ISPR7</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Interrupt set-pending registers on page 187</td>
</tr>
<tr>
<td>0xE000E280-0xE000E29C</td>
<td>NVIC_ICPR0-NVIC_ICPR7</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Interrupt clear-pending registers on page 188</td>
</tr>
<tr>
<td>0xE000E300-0xE000E31C</td>
<td>NVIC_IABR0-NVIC_IABR7</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Interrupt active bit registers on page 188</td>
</tr>
<tr>
<td>0xE000E400-0xE000E4EF</td>
<td>NVIC_IPR0-NVIC_IPR59</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Interrupt priority registers on page 188</td>
</tr>
<tr>
<td>0xE000EF00</td>
<td>STIR</td>
<td>WO</td>
<td>Configurable(1)</td>
<td>0x00000000</td>
<td>Software trigger interrupt register on page 189</td>
</tr>
</tbody>
</table>

1. See the register description for more information.
4.2.1 Accessing the Cortex®-M7 NVIC registers using CMSIS

CMSIS functions enable the software portability between different Cortex®-M profile processors. To access the NVIC registers when using CMSIS, use the following functions:

<table>
<thead>
<tr>
<th>CMSIS function</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>void NVIC_EnableIRQ(IRQn_Type IRQn)</td>
<td>Enables an interrupt or exception.</td>
</tr>
<tr>
<td>void NVIC_DisableIRQ(IRQn_Type IRQn)</td>
<td>Disables an interrupt or exception.</td>
</tr>
<tr>
<td>void NVIC_SetPendingIRQ(IRQn_Type IRQn)</td>
<td>Sets the pending status of interrupt or exception to 1.</td>
</tr>
<tr>
<td>void NVIC_ClearPendingIRQ(IRQn_Type IRQn)</td>
<td>Clears the pending status of interrupt or exception to 0.</td>
</tr>
<tr>
<td>uint32_t NVIC_GetPendingIRQ(IRQn_Type IRQn)</td>
<td>Reads the pending status of interrupt or exception. This function returns non-zero value if the pending status is set to 1.</td>
</tr>
<tr>
<td>void NVIC_SetPriority(IRQn_Type IRQn, uint32_t priority)</td>
<td>Sets the priority of an interrupt or exception with configurable priority level to 1.</td>
</tr>
<tr>
<td>uint32_t NVIC_GetPriority(IRQn_Type IRQn)</td>
<td>Reads the priority of an interrupt or exception with configurable priority level. This function return the current priority level.</td>
</tr>
</tbody>
</table>

1. The input parameter IRQn is the IRQ number, see Table 19 on page 40 for more information.

4.2.2 Interrupt set-enable registers

The NVIC_ISER0-NVIC_ISER7 registers enable interrupts, and show which interrupts are enabled. See the register summary in Table 40 on page 184 for the register attributes.

The bit assignments are:

**Figure 17. ISER bit assignments**

**Table 42. ISER bit assignments**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
</table>
| [31:0] | SETENA | Interrupt set-enable bits.  
| Write: | | 0: No effect.  
| | | 1: Enable interrupt.  
| Read: | | 0: Interrupt disabled.  
| | | 1: Interrupt enabled. |
If a pending interrupt is enabled, the NVIC activates the interrupt based on its priority. If an interrupt is not enabled, asserting its interrupt signal changes the interrupt state to pending, but the NVIC never activates the interrupt, regardless of its priority.

4.2.3 Interrupt clear-enable registers

The NVIC_ICER0-NVIC_ICER7 registers disable interrupts, and show which interrupts are enabled. See the register summary in Table 40 on page 184 for the register attributes.

The bit assignments are:

![Figure 18. ICER bit assignment](MSv39677V1)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:0]</td>
<td>CLRENA</td>
<td>Interrupt clear-enable bits.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>Write:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: No effect.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: Disable interrupt.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>Read:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: Interrupt disabled.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: Interrupt enabled.</td>
</tr>
</tbody>
</table>

4.2.4 Interrupt set-pending registers

The NVIC_ISPR0-NVIC_ISPR7 registers force interrupts into the pending state, and show which interrupts are pending. See the register summary in Table 40 on page 184 for the register attributes.

The bit assignments are:

![Figure 19. ISPR bit assignments](MSv39677V1)
Writing 1 to the ISPR bit corresponding to:
- An interrupt that is pending has no effect.
- A disabled interrupt sets the state of that interrupt to pending.

### 4.2.5 Interrupt clear-pending registers

The NVIC_ICPR0-NCVIC_ICPR7 registers remove the pending state from interrupts, and show which interrupts are pending. See the register summary in Table 40 on page 184 for the register attributes.

The bit assignments are:

**Table 44. ISPR bit assignments**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:0]</td>
<td>SETPEND</td>
<td>Interrupt set-pending bits. Write:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: No effect. 1: Changes interrupt state to pending.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>Read:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: Interrupt is not pending. 1: Interrupt is pending.</td>
</tr>
</tbody>
</table>

Writing 1 to an ICPR bit does not affect the active state of the corresponding interrupt.
4.2.6  Interrupt active bit registers

The NVIC_IABR0-NVIC_IABR7 registers indicate which interrupts are active. See the register summary in Table 40 on page 184 for the register attributes.

The bit assignments are:

**Figure 21. IABR bit assignments**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:0]</td>
<td>ACTIVE</td>
<td>Interrupt active flags: 0: Interrupt not active. 1: Interrupt active.</td>
</tr>
</tbody>
</table>

A bit is read as one if the status of the corresponding interrupt is active or active and pending.

4.2.7  Interrupt priority registers

The NVIC_IPR0-NVIC_IPR59 registers provide an 8-bit priority field for each interrupt. These registers are byte-accessible. See the register summary in Table 40 on page 184 for their attributes. Each register holds four priority fields as shown:

**Figure 22. IPR bit assignments**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>IPR59</td>
<td>PRI_N239</td>
<td>PRI_N238</td>
</tr>
<tr>
<td>IPR0</td>
<td>PRI_N4n+3</td>
<td>PRI_N4n+2</td>
</tr>
<tr>
<td>IPR0</td>
<td>PRI_N3</td>
<td>PRI_N2</td>
</tr>
</tbody>
</table>
Find the IPR number and byte offset for interrupt \( m \) as follows:

- the corresponding IPR number, see Table 46 on page 188
- the byte offset of the required Priority field in this register is \( m \) DIV 4, where:
  - Byte offset 0 refers to register bits[7:0].
  - Byte offset 1 refers to register bits[15:8].
  - Byte offset 2 refers to register bits[23:16].
  - Byte offset 3 refers to register bits[31:24].

### 4.2.8 Software trigger interrupt register

Write to the STIR to generate an interrupt from software. See the register summary in Table 40 on page 184 for the STIR attributes.

When the USERSETMPEEND bit in the SCR is set to 1, unprivileged software can access the STIR, see System control register on page 199.

Only privileged software can enable unprivileged access to the STIR.

The bit assignments are:

#### Table 47. IPR bit assignments

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:24]</td>
<td>Priority, byte offset 3</td>
<td>Each priority field holds a priority value, 0-255. The lower the value, the greater the priority of the corresponding interrupt. The processor implements only bits[7:n] of each field, bits[n-1:0] read as zero and ignore writes.</td>
</tr>
<tr>
<td>[23:16]</td>
<td>Priority, byte offset 2</td>
<td></td>
</tr>
<tr>
<td>[15:8]</td>
<td>Priority, byte offset 1</td>
<td></td>
</tr>
<tr>
<td>[7:0]</td>
<td>Priority, byte offset 0</td>
<td></td>
</tr>
</tbody>
</table>

#### Table 48. STIR bit assignments

<table>
<thead>
<tr>
<th>Bits</th>
<th>Field</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:9]</td>
<td>-</td>
<td>Reserved.</td>
</tr>
<tr>
<td>[8:0]</td>
<td>INTID</td>
<td>Interrupt ID of the interrupt to trigger, in the range 0-239. For example, a value of 0x03 specifies interrupt IRQ3.</td>
</tr>
</tbody>
</table>
4.2.9 Level-sensitive and pulse interrupts

The processor supports both level-sensitive and pulse interrupts. The pulse interrupts are also described as edge-triggered interrupts.

A level-sensitive interrupt is held asserted until the peripheral deasserts the interrupt signal. Typically this happens because the ISR accesses the peripheral, causing it to clear the interrupt request. A pulse interrupt is an interrupt signal sampled synchronously on the rising edge of the processor clock. To ensure the NVIC detects the interrupt, the peripheral must assert the interrupt signal for at least one clock cycle, during which the NVIC detects the pulse and latches the interrupt.

When the processor enters the ISR, it automatically removes the pending state from the interrupt, see Hardware and software control of interrupts on page 190. For a level-sensitive interrupt, if the signal is not deasserted before the processor returns from the ISR, the interrupt becomes pending again, and the processor must execute its ISR again. This means that the peripheral can hold the interrupt signal asserted until it no longer requires servicing.

Hardware and software control of interrupts

- The Cortex®-M7 latches all interrupts. A peripheral interrupt becomes pending for one of the following reasons:
  - The NVIC detects that the interrupt signal is HIGH and the interrupt is not active.
  - The NVIC detects a rising edge on the interrupt signal.
  - Software writes to the corresponding interrupt set-pending register bit, see Interrupt set-pending registers on page 186, or to the STIR to make an interrupt pending, see Software trigger interrupt register on page 189.

A pending interrupt remains pending until one of the following:

- The processor enters the ISR for the interrupt. This changes the state of the interrupt from pending to active. Then:
  - For a level-sensitive interrupt, when the processor returns from the ISR, the NVIC samples the interrupt signal. If the signal is asserted, the state of the interrupt changes to pending, which might cause the processor to immediately re-enter the ISR. Otherwise, the state of the interrupt changes to inactive.
  - For a pulse interrupt, the NVIC continues to monitor the interrupt signal, and if this is pulsed the state of the interrupt changes to pending and active. In this case, when the processor returns from the ISR the state of the interrupt changes to pending, which might cause the processor to immediately re-enter the ISR.
  - If the interrupt signal is not pulsed while the processor is in the ISR, when the processor returns from the ISR the state of the interrupt changes to inactive.

- The software writes to the corresponding interrupt clear-pending register bit.
  For a level-sensitive interrupt, if the interrupt signal is still asserted, the state of the interrupt does not change. Otherwise, the state of the interrupt changes to inactive.
  For a pulse interrupt, the state of the interrupt changes to:
    - Inactive, if the state was pending.
    - Active, if the state was active and pending.
4.2.10 NVIC design hints and tips

Ensure that the software uses correctly aligned register accesses. The processor does not support unaligned accesses to NVIC registers. See the individual register descriptions for the supported access sizes.

An interrupt can enter pending state even if it is disabled. Disabling an interrupt only prevents the processor from taking that interrupt.

Before programming VTOR to relocate the vector table, ensure the vector table entries of the new vector table are set up for fault handlers, NMI, and all enabled exception-like interrupts. For more information see Vector table offset register on page 197.

NVIC programming hints

The software uses the CPSIE I and CPSID I instructions to enable and disable interrupts. The CMSIS provides the following intrinsic functions for these instructions:

void __disable_irq(void) // Disable Interrupts
void __enable_irq(void) // Enable Interrupts

In addition, the CMSIS provides a number of functions for NVIC control, including:

<table>
<thead>
<tr>
<th>CMSIS interrupt control function</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>void NVIC_SetPriorityGrouping(uint32_t priority_grouping)</td>
<td>Set the priority grouping</td>
</tr>
<tr>
<td>void NVIC_EnableIRQ(IRQn_t IRQn)</td>
<td>Enable IRQn</td>
</tr>
<tr>
<td>void NVIC_DisableIRQ(IRQn_t IRQn)</td>
<td>Disable IRQn</td>
</tr>
<tr>
<td>uint32_t NVIC_GetPendingIRQ (IRQn_t IRQn)</td>
<td>Return true (IRQ-Number) if IRQn is pending</td>
</tr>
<tr>
<td>void NVIC_SetPendingIRQ (IRQn_t IRQn)</td>
<td>Set IRQn pending</td>
</tr>
<tr>
<td>void NVIC_ClearPendingIRQ (IRQn_t IRQn)</td>
<td>Clear IRQn pending status</td>
</tr>
<tr>
<td>uint32_t NVIC_GetActive (IRQn_t IRQn)</td>
<td>Return the IRQ number of the active interrupt</td>
</tr>
<tr>
<td>void NVIC_SetPriority (IRQn_t IRQn, uint32_t priority)</td>
<td>Set priority for IRQn</td>
</tr>
<tr>
<td>uint32_t NVIC_GetPriority (IRQn_t IRQn)</td>
<td>Read priority of IRQn</td>
</tr>
</tbody>
</table>

The input parameter IRQn is the IRQ number, see Table 19 on page 40. For more information about these functions see the CMSIS documentation.
4.3 System control block

The System Control Block (SCB) provides system implementation information, and system control. This includes configuration, control, and reporting of the system exceptions. The system control block registers are:

Table 50. Summary of the system control block registers

<table>
<thead>
<tr>
<th>Address</th>
<th>Name</th>
<th>Type</th>
<th>Required privilege</th>
<th>Reset value</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0xE000E008</td>
<td>ACTLR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Auxiliary control register on page 193</td>
</tr>
<tr>
<td>0xE000ED00</td>
<td>CPUID</td>
<td>RO</td>
<td>Privileged</td>
<td>0x410FC270</td>
<td>CPUID base register on page 194</td>
</tr>
<tr>
<td>0xE000ED04</td>
<td>ICSR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Interrupt control and state register on page 194</td>
</tr>
<tr>
<td>0xE000ED08</td>
<td>VTOR</td>
<td>RW</td>
<td>Privileged</td>
<td>Unknown</td>
<td>Vector table offset register on page 197</td>
</tr>
<tr>
<td>0xE000ED0C</td>
<td>AIRCR</td>
<td>RW</td>
<td>Privileged</td>
<td>0xFA050000</td>
<td>Application interrupt and reset control register on page 197</td>
</tr>
<tr>
<td>0xE000ED10</td>
<td>SCR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>System control register on page 199</td>
</tr>
<tr>
<td>0xE000ED14</td>
<td>CCR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000200</td>
<td>Configuration and control register on page 200</td>
</tr>
<tr>
<td>0xE000ED18</td>
<td>SHPR1</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>System handler priority register 1 on page 202</td>
</tr>
<tr>
<td>0xE000ED1C</td>
<td>SHPR2</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>System handler priority register 2 on page 203</td>
</tr>
<tr>
<td>0xE000ED20</td>
<td>SHPR3</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>System handler priority register 3 on page 203</td>
</tr>
<tr>
<td>0xE000ED24</td>
<td>SHCRS</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>System handler control and state register on page 204</td>
</tr>
<tr>
<td>0xE000ED28</td>
<td>CFSR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Configurable fault status register on page 205</td>
</tr>
<tr>
<td>0xE000ED28</td>
<td>MMSR(2)</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00</td>
<td>MemManage fault address register on page 211</td>
</tr>
<tr>
<td>0xE000ED29</td>
<td>BFSR(2)</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00</td>
<td>BusFault status register on page 207</td>
</tr>
<tr>
<td>0xE000ED2A</td>
<td>UFSR(2)</td>
<td>RW</td>
<td>Privileged</td>
<td>0x0000</td>
<td>Auxiliary control register on page 193</td>
</tr>
<tr>
<td>0xE000ED2C</td>
<td>HFSR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>HardFault status register on page 210</td>
</tr>
<tr>
<td>0xE000ED34</td>
<td>MMAR</td>
<td>RW</td>
<td>Privileged</td>
<td>Unknown</td>
<td>MemManage fault address register on page 211</td>
</tr>
<tr>
<td>0xE000ED38</td>
<td>BFAR</td>
<td>RW</td>
<td>Privileged</td>
<td>Unknown</td>
<td>BusFault address register on page 212</td>
</tr>
<tr>
<td>0xE000ED3C</td>
<td>AFSR</td>
<td>RAZ/WI</td>
<td>Privileged</td>
<td>-</td>
<td>Auxiliary Fault Status register not implemented</td>
</tr>
</tbody>
</table>

1. See the register description for more information.
2. A subregister of the CFSR.
### 4.3.1 Auxiliary control register

The ACTLR provides disable bits for the following processor functions:

- FPU exception outputs.
- Dual-issue functionality.
- Flushing of the trace output from the ITM and DWT.
- Dynamic read allocate mode.

By default this register is set to provide optimum performance from the Cortex®-M7 processor, and does not normally require modification.

See the register summary in Table 50 on page 192 for the ACTLR attributes. The bit assignments are:

#### Figure 24. ACTLR bit assignments

<table>
<thead>
<tr>
<th>Bit Location</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:13]</td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>[12]</td>
<td>DISITMATBFLUSH</td>
<td>Disables ITM and DWT ATB flush:</td>
</tr>
<tr>
<td></td>
<td>0: Normal operation.</td>
<td></td>
</tr>
<tr>
<td></td>
<td>1: ITM and DWT ATB flush disabled.</td>
<td></td>
</tr>
<tr>
<td>[11]</td>
<td>DISRAMODE</td>
<td>Disables dynamic read allocate mode for Write-Back Write-Allocate memory regions:</td>
</tr>
<tr>
<td></td>
<td>0: Normal operation.</td>
<td></td>
</tr>
<tr>
<td></td>
<td>1: Dynamic read allocate mode disabled.</td>
<td></td>
</tr>
<tr>
<td>[10]</td>
<td>FPEXCODIS</td>
<td>Disables FPU exception outputs:</td>
</tr>
<tr>
<td></td>
<td>0: Normal operation.</td>
<td></td>
</tr>
<tr>
<td></td>
<td>1: FPU exception outputs are disabled.</td>
<td></td>
</tr>
<tr>
<td>[9:3]</td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>[2]</td>
<td>DISFOLD</td>
<td>Disables dual-issue functionality:</td>
</tr>
<tr>
<td></td>
<td>0: Normal operation.</td>
<td></td>
</tr>
<tr>
<td></td>
<td>1: Dual-issue functionality is disabled. Setting this bit decreases performance.</td>
<td></td>
</tr>
<tr>
<td>[1:0]</td>
<td>Reserved</td>
<td></td>
</tr>
</tbody>
</table>
4.3.2 CPUID base register

The CPUID register contains the processor part number, version, and implementation information. See the register summary in Table 50 on page 192 for its attributes. The bit assignments are:

![Figure 25. CPUID bit assignments](MSv39661V1)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:24]</td>
<td>Implementer</td>
<td>Implementer code: 0x41 Arm</td>
</tr>
<tr>
<td>[23:20]</td>
<td>Variant</td>
<td>Variant number, the r value in the rnpn product revision identifier: 0x0 Revision 0</td>
</tr>
<tr>
<td>[19:16]</td>
<td>Constant</td>
<td>Reads as 0xF</td>
</tr>
<tr>
<td>[15:4]</td>
<td>PartNo</td>
<td>Part number of the processor: 0xC27: Cortex®-M7</td>
</tr>
<tr>
<td>[3:0]</td>
<td>Revision</td>
<td>Revision number, the p value in the rnpn product revision identifier: 0x0 Patch 0</td>
</tr>
</tbody>
</table>

4.3.3 Interrupt control and state register

The ICSR:

- provides:
  - A set-pending bit for the Non Maskable Interrupt (NMI) exception.
  - Set-pending and clear-pending bits for the PendSV and SysTick exceptions.

- indicates:
  - The exception number of the exception being processed.
  - Whether there are preempted active exceptions.
  - The exception number of the highest priority pending exception
  - Whether any interrupts are pending.

See the register summary in Table 50 on page 192, and the type descriptions in Table 53 on page 195, for the ICSR attributes. The bit assignments are:
Table 53. ICSR bit assignments

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Type</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31]</td>
<td>NMIPENDSET</td>
<td>RW</td>
<td>NMI set-pending bit. Write: 0: No effect, 1: Changes NMI exception state to pending. Read: 0: NMI exception is not pending, 1: NMI exception is pending. Because NMI is the highest-priority exception, normally the processor enters the NMI exception handler as soon as it registers a write of 1 to this bit, and entering the handler clears this bit to 0. A read of this bit by the NMI exception handler returns 1 only if the NMI signal is reasserted while the processor is executing that handler.</td>
</tr>
<tr>
<td>[30:29]</td>
<td>-</td>
<td>-</td>
<td>Reserved</td>
</tr>
<tr>
<td>[28]</td>
<td>PENDSVSET</td>
<td>RW</td>
<td>PendSV set-pending bit. Write: 0: No effect, 1: Changes PendSV exception state to pending. Read: 0: PendSV exception is not pending, 1: PendSV exception is pending. Writing 1 to this bit is the only way to set the PendSV exception state to pending.</td>
</tr>
<tr>
<td>[27]</td>
<td>PENDSVCLR</td>
<td>WO</td>
<td>PendSV clear-pending bit. Write: 0: No effect, 1: Removes the pending state from the PendSV exception.</td>
</tr>
<tr>
<td>[26]</td>
<td>PENDSTSET</td>
<td>RW</td>
<td>SysTick exception set-pending bit. Write: 0: No effect, 1: Changes SysTick exception state to pending. Read: 0: SysTick exception is not pending, 1: SysTick exception is pending.</td>
</tr>
</tbody>
</table>
When writing to the ICSR, the effect is unpredictable if the user:

- Writes 1 to the PENDSVSET bit and writes 1 to the PENDSVCLR bit.
- Writes 1 to the PENDSTSET bit and writes 1 to the PENDSTCLR bit.

---

**Table 53. ICSR bit assignments (continued)**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Type</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td></td>
<td></td>
<td>Write:</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>0: No effect.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>1: Removes the pending state from the SysTick exception.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>This bit is WO. On a register read its value is <strong>Unknown</strong>.</td>
</tr>
<tr>
<td>[23]</td>
<td>Reserved for Debug</td>
<td>RO</td>
<td>This bit is reserved for Debug use and reads-as-zero when the processor</td>
</tr>
<tr>
<td></td>
<td>use</td>
<td></td>
<td>is not in Debug.</td>
</tr>
<tr>
<td>[22]</td>
<td>ISRPENDING</td>
<td>RO</td>
<td>Interrupt pending flag, excluding NMI and Faults:</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>0: Interrupt not pending.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>1: Interrupt pending.</td>
</tr>
<tr>
<td>[20:12]</td>
<td>VECTPENDING</td>
<td>RO</td>
<td>Indicates the exception number of the highest priority pending enabled</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>exception:</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>0: No pending exceptions.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>Nonzero: The exception number of the highest priority pending enabled</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>exception.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>The value indicated by this field includes the effect of the BASEPRI and</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>FAULTMASK registers, but not any effect of the PRIMASK register.</td>
</tr>
<tr>
<td>[11]</td>
<td>RETTOBASE</td>
<td>RO</td>
<td>Indicates whether there are preempted active exceptions:</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>0: There are preempted active exceptions to execute.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>1: There are no active exceptions, or the currently-executing exception</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>is the only active exception.</td>
</tr>
<tr>
<td>[8:0]</td>
<td>VECTACTIVE(^{(1)})</td>
<td>RO</td>
<td>Contains the active exception number:</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>0: Thread mode</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>1: The exception number(^{(1)}) of the currently active exception.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>Subtract 16 from this value to obtain the CMSIS IRQ number required to</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>index into the Interrupt Clear-Enable, Set-Enable, Clear-Pending, Set-</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>Pending, or Priority Registers, see <strong>Table 5 on page 24</strong>.</td>
</tr>
</tbody>
</table>

---

1. This is the same value as IPSR bits[8:0], see Interrupt program status register on page 23.
### 4.3.4 Vector table offset register

The VTOR indicates the offset of the vector table base address from memory address 0x00000000. See the register summary in *Table 50 on page 192* for its attributes. The bit assignments are:

![Figure 27. VTOR bit assignments](MSv39699V1)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[8:0]</td>
<td>-</td>
<td>Reserved.</td>
</tr>
</tbody>
</table>

When setting TBLOFF, the user must align the offset to the number of exception entries in the vector table.

The table alignment requirements mean that bits [8:0] of the table offset are always zero.

### 4.3.5 Application interrupt and reset control register

The AIRCR provides priority grouping control for the exception model, endian status for data accesses, and reset control of the system. See the register summary in *Table 50 on page 192* and *Table 55 on page 198* for its attributes.

To write to this register, the user must write 0x5FA to the VECTKEY field, otherwise the processor ignores the write.

The bit assignments are:

![Figure 28. AIRCR bit assignments](MSv39651V1)
Table 55. AIRCR bit assignments

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Type</th>
<th>Function</th>
</tr>
</thead>
</table>
| [31:16]| Read: VECTKEYSTAT, Write: VECTKEY | RW     | Register key:  
|        |                               |        | Reads as 0xFA05  
|        |                               |        | On writes, write 0x5FA to VECTKEY, otherwise the write is ignored.                                                                      |
| [15]   | ENDIANNESS                    | RO     | Data endianness bit:  
|        |                               |        | 0: Little-endian.                                                                                                                      |
| [10:8] | PRIGROUP                      | RW     | Interrupt priority grouping field. This field determines the split of group priority from subpriority, see Binary point.               |
| [7:3]  | -                             | -      | Reserved.                                                                                                                               |
| [2]    | SYSRESETREQ                   | WO     | System reset request:  
|        |                               |        | 0: No system reset request.  
|        |                               |        | 1: Asserts a signal to the outer system that requests a reset. This is intended to force a large system reset of all major components except for debug.  
|        |                               |        | This bit reads as 0.                                                                                                                    |
| [1]    | VECTCLRACTIVE                 | WO     | Reserved for Debug use. This bit reads as 0. When writing to the register the user must write 0 to this bit, otherwise behavior is Unpredictable. |
| [0]    | VECTRESET                     | WO     | Reserved for Debug use. This bit reads as 0. When writing to the register the user must write 0 to this bit, otherwise behavior is Unpredictable. |

Binary point

The PRIGROUP field indicates the position of the binary point that splits the PRI_n fields in the Interrupt Priority Registers into separate group priority and subpriority fields. Table 56 shows how the PRIGROUP value controls this split.

If the user implements fewer than 8 priority bits he might require more explanation here, and want to remove invalid rows from the table, and modify the entries in the number of columns.

Table 56. Priority grouping

<table>
<thead>
<tr>
<th>PRIGROUP</th>
<th>Binary point(1)</th>
<th>Group priority bits</th>
<th>Subpriority bits</th>
<th>Group priorities</th>
<th>Subpriorities</th>
</tr>
</thead>
<tbody>
<tr>
<td>0b000</td>
<td>bxxxxxy</td>
<td>[7:1]</td>
<td>[0]</td>
<td>128</td>
<td>2</td>
</tr>
<tr>
<td>0b001</td>
<td>bxxxxxyy</td>
<td>[7:2]</td>
<td>[1:0]</td>
<td>64</td>
<td>4</td>
</tr>
<tr>
<td>0b010</td>
<td>bxxxxyyy</td>
<td>[7:3]</td>
<td>[2:0]</td>
<td>32</td>
<td>8</td>
</tr>
<tr>
<td>0b011</td>
<td>bxxxxyyyy</td>
<td>[7:4]</td>
<td>[3:0]</td>
<td>16</td>
<td>16</td>
</tr>
<tr>
<td>0b100</td>
<td>bxxxYYYYY</td>
<td>[7:5]</td>
<td>[4:0]</td>
<td>8</td>
<td>32</td>
</tr>
</tbody>
</table>
Table 56. Priority grouping (continued)

<table>
<thead>
<tr>
<th>PRIGROUP</th>
<th>Binary point(1)</th>
<th>Group priority bits</th>
<th>Subpriority bits</th>
<th>Number of Group priorities</th>
<th>Number of Subpriorities</th>
</tr>
</thead>
<tbody>
<tr>
<td>0b101</td>
<td>bxx.yyyyyy</td>
<td>[7:6]</td>
<td>[5:0]</td>
<td>4</td>
<td>64</td>
</tr>
<tr>
<td>0b110</td>
<td>bx.yyyyyyy</td>
<td>[7]</td>
<td>[6:0]</td>
<td>2</td>
<td>128</td>
</tr>
<tr>
<td>0b111</td>
<td>b.yyyyyyy</td>
<td>None</td>
<td>[7:0]</td>
<td>1</td>
<td>256</td>
</tr>
</tbody>
</table>

1. PRI_{n}[7:0] field showing the binary point. x denotes a group priority field bit, and y denotes a subpriority field bit.

Note: Determining preemption of an exception uses only the group priority field, see Interrupt priority grouping on page 43.

4.3.6 System control register

The SCR controls features of entry to and exit from Low-power state. See the register summary in Table 50 on page 192 for its attributes. The bit assignments are

Figure 29. SCR bit assignments:

Table 57. SCR bit assignments

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:5]</td>
<td>-</td>
<td>Reserved.</td>
</tr>
</tbody>
</table>
| [4]   | SEVONPEND  | Send Event on Pending bit:  
0: Only enabled interrupts or events can wakeup the processor, disabled interrupts are excluded.  
1: Enabled events and all interrupts, including disabled interrupts, can wakeup the processor.  
When an event or interrupt enters pending state, the event signal wakes up the processor from WFE. If the processor is not waiting for an event, the event is registered and affects the next WFE.  
The processor also wakes up on execution of an SEV instruction or an external event. |
4.3.7 Configuration and control register

The CCR controls entry to Thread mode and enables:
- The handlers for NMI, hard fault and faults escalated by FAULTMASK to ignore BusFaults.
- Trapping of divide by zero and unaligned accesses.
- Access to the STIR by unprivileged software, see *Software trigger interrupt register on page 189*.
- Instruction and data cache enable control.

See the register summary in *Table 50 on page 192* for the CCR attributes.

The bit assignments are:

**Table 57. SCR bit assignments (continued)**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
</table>
| [2]  | SLEEPDEEP    | Controls whether the processor uses sleep or deep sleep as its Low-power mode:  
|      |              | 0: Sleep.  
|      |              | 1: Deep sleep.               |
| [1]  | SLEEPONEXIT  | Indicates sleep-on-exit when returning from Handler mode to Thread mode:  
|      |              | 0: Do not sleep when returning to Thread mode.  
|      |              | 1: Enter sleep, or deep sleep, on return from an ISR.  
|      |              | Setting this bit to 1 enables an interrupt driven application to avoid returning to an empty main application. |
| [0]  | -            | Reserved.                                                                |

**Figure 30. CCR bit assignments**
<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Type</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:19]</td>
<td>-</td>
<td>-</td>
<td>Reserved.</td>
</tr>
<tr>
<td>[18]</td>
<td>BP</td>
<td>RO</td>
<td>Always reads-as-one. It indicates branch prediction is enabled.</td>
</tr>
<tr>
<td>[17]</td>
<td>IC</td>
<td>RW</td>
<td>Enables L1 instruction cache:</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>0: L1 instruction cache disabled.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>1: L1 instruction cache enabled.</td>
</tr>
<tr>
<td>[16]</td>
<td>DC</td>
<td>RW</td>
<td>Enables L1 data cache:</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>0: L1 data cache disabled.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>1: L1 data cache enabled.</td>
</tr>
<tr>
<td>[15:10]</td>
<td>-</td>
<td>-</td>
<td>Reserved.</td>
</tr>
<tr>
<td>[9]</td>
<td>STKALIGN</td>
<td>RO</td>
<td>Always reads-as-one. It indicates stack alignment on exception entry is 8-byte aligned. On exception entry, the processor uses bit[9] of the stacked PSR to indicate the stack alignment. On return from the exception it uses this stacked bit to restore the correct stack alignment.</td>
</tr>
<tr>
<td>[8]</td>
<td>BFHFNMIGN</td>
<td>RW</td>
<td>Enables handlers with priority -1 or -2 to ignore data BusFaults caused by load and store instructions. This applies to the hard fault, NMI, and FAULTMASK escalated handlers:</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>0: Data bus faults caused by load and store instructions cause a lock-up.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>1: Handlers running at priority -1 and -2 ignore data bus faults caused by load and store instructions. Set this bit to 1 only when the handler and its data are in absolutely safe memory. The normal use of this bit is to probe system devices and bridges to detect control path problems and fix them.</td>
</tr>
<tr>
<td>[7:5]</td>
<td>-</td>
<td>-</td>
<td>Reserved.</td>
</tr>
<tr>
<td>[4]</td>
<td>DIV_0_TRP</td>
<td>RW</td>
<td>Enables faulting or halting when the processor executes an SDIV or UDIV instruction with a divisor of 0:</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>0: Do not trap divide by 0.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>1: Trap divide by 0.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>When this bit is set to 0, a divide by zero returns a quotient of 0.</td>
</tr>
<tr>
<td>[3]</td>
<td>UNALIGN_TRP</td>
<td>RW</td>
<td>Enables unaligned access traps:</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>0: Do not trap unaligned halfword and word accesses.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>1: Trap unaligned halfword and word accesses.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>If this bit is set to 1, an unaligned access generates a UsageFault.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>Unaligned LDM, STM, LDRD, and STRD instructions always fault irrespective of whether UNALIGN_TRP is set to 1.</td>
</tr>
</tbody>
</table>
4.3.8 System handler priority registers

The SHPR1-SHPR3 registers set the priority level, 0 to 255 of the exception handlers that have configurable priority.

SHPR1-SHPR3 are byte accessible. See the register summary in Table 50 on page 192 for their attributes.

The system fault handlers and the priority field and register for each handler are:

<table>
<thead>
<tr>
<th>Handler</th>
<th>Field</th>
<th>Register description</th>
</tr>
</thead>
<tbody>
<tr>
<td>MemManage</td>
<td>PRI_4</td>
<td>System handler priority register 1</td>
</tr>
<tr>
<td>BusFault</td>
<td>PRI_5</td>
<td>System handler priority register 2</td>
</tr>
<tr>
<td>UsageFault</td>
<td>PRI_6</td>
<td>System handler priority register 3</td>
</tr>
<tr>
<td>SVC1Cal</td>
<td>PRI_11</td>
<td></td>
</tr>
<tr>
<td>PendSV</td>
<td>PRI_14</td>
<td></td>
</tr>
<tr>
<td>SysTick</td>
<td>PRI_15</td>
<td></td>
</tr>
</tbody>
</table>

Each PRI_n field is 8 bits wide, but the processor implements only bits[7:M] of each field, and bits[M-1:0] read as zero and ignore writes.

System handler priority register 1

The bit assignments are:

![Figure 31. SHPR1 bit assignments](attachment:MSv395689V1)
### System handler priority register 2

The bit assignments are:

**Figure 32. SHPR2 bit assignments**

```
<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:24]</td>
<td>PRI_7</td>
<td>Reserved</td>
</tr>
<tr>
<td>[23:16]</td>
<td>PRI_6</td>
<td>Priority of system handler 6, UsageFault</td>
</tr>
<tr>
<td>[15:8]</td>
<td>PRI_5</td>
<td>Priority of system handler 5, BusFault</td>
</tr>
<tr>
<td>[7:0]</td>
<td>PRI_4</td>
<td>Priority of system handler 4, MemManage</td>
</tr>
</tbody>
</table>
```

### System handler priority register 3

The bit assignments are:

**Figure 33. SHPR3 bit assignments**

```
<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:24]</td>
<td>PRI_11</td>
<td>Priority of system handler 11, SVCall</td>
</tr>
<tr>
<td>[23:0]</td>
<td>-</td>
<td>Reserved</td>
</tr>
<tr>
<td>[31:24]</td>
<td>PRI_15</td>
<td>Priority of system handler 15, SysTick exception</td>
</tr>
<tr>
<td>[23:16]</td>
<td>PRI_14</td>
<td>Priority of system handler 14, PendSV</td>
</tr>
<tr>
<td>[15:0]</td>
<td>-</td>
<td>Reserved</td>
</tr>
</tbody>
</table>
```
4.3.9 System handler control and state register

The SHCSR enables the system handlers, and indicates:

- The pending status of the BusFault, MemManage fault, and SVC exceptions.
- The active status of the system handlers.

See the register summary in Table 50 on page 192 for the SHCSR attributes. The bit assignments are:

**Table 63. SHCSR bit assignments**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:19]</td>
<td>-</td>
<td>Reserved</td>
</tr>
<tr>
<td>[18]</td>
<td>USGFAULTENA</td>
<td>UsageFault enable bit, set to 1 to enable(^{(1)})</td>
</tr>
<tr>
<td>[17]</td>
<td>BUSFAULTENA</td>
<td>BusFault enable bit, set to 1 to enable(^{(1)})</td>
</tr>
<tr>
<td>[16]</td>
<td>MEMFAULTENA</td>
<td>MemManage enable bit, set to 1 to enable(^{(1)})</td>
</tr>
<tr>
<td>[15]</td>
<td>SVCALLPENDED</td>
<td>SVCall pending bit, reads as 1 if exception is pending(^{(2)})</td>
</tr>
<tr>
<td>[14]</td>
<td>BUSFAULTPENDED</td>
<td>BusFault exception pending bit, reads as 1 if exception is pending(^{(2)})</td>
</tr>
<tr>
<td>[13]</td>
<td>MEMFAULTPENDED</td>
<td>MemManage exception pending bit, reads as 1 if exception is pending(^{(2)})</td>
</tr>
<tr>
<td>[12]</td>
<td>USGFAULTPENDED</td>
<td>UsageFault exception pending bit, reads as 1 if exception is pending(^{(2)})</td>
</tr>
<tr>
<td>[11]</td>
<td>SYSTICKACT</td>
<td>SysTick exception active bit, reads as 1 if exception is active(^{(3)})</td>
</tr>
<tr>
<td>[10]</td>
<td>PENDSVACT</td>
<td>PendSV exception active bit, reads as 1 if exception is active</td>
</tr>
<tr>
<td>[9]</td>
<td>-</td>
<td>Reserved</td>
</tr>
<tr>
<td>[8]</td>
<td>MONITORACT</td>
<td>Debug monitor active bit, reads as 1 if Debug monitor is active</td>
</tr>
<tr>
<td>[7]</td>
<td>SVCALLACT</td>
<td>SVCall active bit, reads as 1 if SVC call is active</td>
</tr>
<tr>
<td>[6:4]</td>
<td>-</td>
<td>Reserved</td>
</tr>
<tr>
<td>[3]</td>
<td>USGFAULTACT</td>
<td>UsageFault exception active bit, reads as 1 if exception is active</td>
</tr>
</tbody>
</table>
If the user disables a system handler and the corresponding fault occurs, the processor treats the fault as a hard fault.

The user can write to this register to change the pending or active status of system exceptions. An OS kernel can write to the active bits to perform a context switch that changes the current exception type.

- A software that changes the value of an active bit in this register without correct adjustment to the stacked content can cause the processor to generate a fault exception. Ensure a software that writes to this register retains and subsequently restores the current active status.
- After having enabled the system handlers, if the user has to change the value of a bit in this register he must use a read-modify-write procedure to ensure that only the required bit is changed.

### 4.3.10 Configurable fault status register

The CFSR indicates the cause of a MemManage fault, BusFault, or UsageFault. See the register summary in Table 50 on page 192 for its attributes. The bit assignments are:

![Figure 35. CFSR bit assignments](image)

The following subsections describe the subregisters that make up the CFSR:

- MemManage fault status register on page 206.
- BusFault status register on page 207.
- UsageFault status register on page 209.
The CFSR is byte accessible. The CFSR or its subregisters can be accessed as follows:

- Access the complete CFSR with a word access to 0xE000ED28.
- Access the MMFSR with a byte access to 0xE000ED28.
- Access the MMFSR and BFSR with a halfword access to 0xE000ED28.
- Access the BFSR with a byte access to 0xE000ED29.
- Access the UFSR with a halfword access to 0xE000ED2A.

**MemManage fault status register**

The flags in the MMFSR indicate the cause of memory access faults. The bit assignments are:

**Figure 36. MMFSR bit assignments**

```
<table>
<thead>
<tr>
<th>Bit</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[7]</td>
<td>MMARVALID</td>
<td>MemManage Fault Address register (MMFAR) valid flag:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: Value in MMAR is not a valid fault address.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: MMAR holds a valid fault address.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>If a MemManage fault occurs and is escalated to a HardFault because of</td>
</tr>
<tr>
<td></td>
<td></td>
<td>priority, the HardFault handler must set this bit to 0. This prevents</td>
</tr>
<tr>
<td></td>
<td></td>
<td>problems on return to a stacked active MemManage fault handler whose</td>
</tr>
<tr>
<td></td>
<td></td>
<td>MMAR value has been overwritten.</td>
</tr>
<tr>
<td>[5]</td>
<td>MLSPERR</td>
<td>0: No MemManage fault occurred during floating-point lazy state</td>
</tr>
<tr>
<td></td>
<td></td>
<td>preservation.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: A MemManage fault occurred during floating-point lazy state</td>
</tr>
<tr>
<td></td>
<td></td>
<td>preservation.</td>
</tr>
<tr>
<td>[4]</td>
<td>MSTKERR</td>
<td>MemManage fault on stacking for exception entry:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: No stacking fault.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: Stacking for an exception entry has caused one or more access violations.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>When this bit is 1, the SP is still adjusted but the values in the</td>
</tr>
<tr>
<td></td>
<td></td>
<td>context area on the stack might be incorrect. The processor has not</td>
</tr>
<tr>
<td></td>
<td></td>
<td>written a fault address to the MMAR.</td>
</tr>
</tbody>
</table>
```
The MMFSR bits are sticky. This means as one or more fault occurs, the associated bits are set to 1. A bit that is set to 1 is cleared to 0 only by writing 1 to that bit, or by a reset.

**BusFault status register**

The flags in the BFSR indicate the cause of a bus access fault. The bit assignments are:

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
</table>
| [3]  | MUNSTKERR      | MemManage fault on unstacking for a return from exception:  
0: No unstacking fault.  
1: Unstack for an exception return has caused one or more access violations.  
This fault is chained to the handler. This means that when this bit is 1, the original return stack is still present. The processor has not adjusted the SP from the failing return, and has not performed a new save. The processor has not written a fault address to the MMAR. |
| [2]  | -              | Reserved                                                                                                                                                                                                 |
| [1]  | DACCVIOL       | Data access violation flag:  
0: No data access violation fault.  
1: The processor attempted a load or store at a location that does not permit the operation.  
When this bit is 1, the PC value stacked for the exception return points to the faulting instruction. The processor has loaded the MMAR with the address of the attempted access. |
| [0]  | IACCVIOL       | Instruction access violation flag:  
0: No instruction access violation fault.  
1: The processor attempted an instruction fetch from a location that does not permit execution.  
This fault occurs on any access to an XN region, even when the MPU is disabled or not present.  
When this bit is 1, the PC value stacked for the exception return points to the faulting instruction. The processor has not written a fault address to the MMAR. |

**Figure 37. BFSR bit assignments**
**Table 65. BFSR bit assignments**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>7</td>
<td>BFARVALID</td>
<td><em>BusFault Address register (BFAR) valid flag:</em></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: Value in BFAR is not a valid fault address.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: BFAR holds a valid fault address.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>The processor sets this bit to 1 after a BusFault where the address is known. Other faults can set this bit to 0, such as a MemManage fault occurring later.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>If a BusFault occurs and is escalated to a hard fault because of priority, the hard fault handler must set this bit to 0. This prevents problems if returning to a stacked active BusFault handler whose BFAR value has been overwritten.</td>
</tr>
<tr>
<td>6</td>
<td>-</td>
<td>Reserved.</td>
</tr>
<tr>
<td>5</td>
<td>LSPERR</td>
<td>0: No bus fault occurred during floating-point lazy state preservation.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: A bus fault occurred during floating-point lazy state preservation.</td>
</tr>
<tr>
<td>4</td>
<td>STKERR</td>
<td>BusFault on stacking for exception entry:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: No stacking fault.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: Stacking for an exception entry has caused one or more BusFaults.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>When the processor sets this bit to 1, the SP is still adjusted but the values in the context area on the stack might be incorrect. The processor does not write a fault address to the BFAR.</td>
</tr>
<tr>
<td>3</td>
<td>UNSTKERR</td>
<td>BusFault on unstacking for a return from exception:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: No unstacking fault.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: Unstack for an exception return has caused one or more BusFaults.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>This fault is chained to the handler. This means that when the processor sets this bit to 1, the original return stack is still present. The processor does not adjust the SP from the failing return, does not performed a new save, and does not write a fault address to the BFAR.</td>
</tr>
<tr>
<td>2</td>
<td>IMPRECISERR</td>
<td>Imprecise data bus error:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: No imprecise data bus error.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: A data bus error has occurred, but the return address in the stack frame is not related to the instruction that caused the error.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>When the processor sets this bit to 1, it does not write a fault address to the BFAR. This is an asynchronous fault. Therefore, if it is detected when the priority of the current process is higher than the BusFault priority, the BusFault becomes pending and becomes active only when the processor returns from all higher priority processes. If a precise fault occurs before the processor enters the handler for the imprecise BusFault, the handler detects both IMPRECISERR set to 1 and one of the precise fault status bits set to 1.</td>
</tr>
<tr>
<td>1</td>
<td>PRECISERR</td>
<td>Precise data bus error:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: No precise data bus error.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: A data bus error has occurred, and the PC value stacked for the exception return points to the instruction that caused the fault.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>When the processor sets this bit to 1, it writes the faulting address to the BFAR.</td>
</tr>
<tr>
<td>0</td>
<td>IBUSERR</td>
<td>Instruction bus error:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: No instruction bus error.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: Instruction bus error.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>The processor detects the instruction bus error on prefetching an instruction, but it sets the IBUSERR flag to 1 only if it attempts to issue the faulting instruction.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>When the processor sets this bit to 1, it does not write a fault address to the BFAR.</td>
</tr>
</tbody>
</table>
PM0253 Cortex-M7 peripherals

The BFSR bits are sticky. This means as one or more fault occurs, the associated bits are set to 1. A bit that is set to 1 is cleared to 0 only by writing 1 to that bit, or by a reset.

**UsageFault status register**

The UFSR indicates the cause of a UsageFault. The bit assignments are:

---

**Figure 38. UFSR bit assignments**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[15:10]</td>
<td>Reserved</td>
<td>Divide by zero UsageFault:</td>
</tr>
<tr>
<td>[9]</td>
<td>DIVBYZERO</td>
<td>0: No divide by zero fault, or divide by zero trapping not enabled.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: The processor has executed an SDIV or UDIV instruction with a divisor of 0.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>When the processor sets this bit to 1, the PC value stacked for the exception return points to the instruction that performed the divide by zero. Enable trapping of divide by zero by setting the DIV_0_TRP bit in the CCR to 1, see <em>Configuration and control register on page 200</em>.</td>
</tr>
<tr>
<td>[8]</td>
<td>UNALIGNED</td>
<td>Unaligned access UsageFault:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: No unaligned access fault, or unaligned access trapping not enabled.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: The processor has made an unaligned memory access.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>Enable trapping of unaligned accesses by setting the UNALIGN_TRP bit in the CCR to 1, see <em>Configuration and control register on page 200</em>.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>Unaligned LDM, STM, LDRD, and STRD instructions always fault irrespective of the setting of UNALIGN_TRP.</td>
</tr>
<tr>
<td>[3]</td>
<td>NOCP</td>
<td>No coprocessor UsageFault:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: No UsageFault caused by attempting to access a coprocessor.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: The processor has attempted to access a coprocessor.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>The processor does not support coprocessor instructions:</td>
</tr>
<tr>
<td>[2]</td>
<td>INVPC</td>
<td>Invalid PC load UsageFault, caused by an invalid PC load by EXC_RETURN:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: No invalid PC load UsageFault.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: The processor has attempted an illegal load of EXC_RETURN to the PC, as a result of an invalid context, or an invalid EXC_RETURN value.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>When this bit is set to 1, the PC value stacked for the exception return points to the instruction that tried to perform the illegal load of the PC.</td>
</tr>
</tbody>
</table>
---

MSv39698V1
The UFSR bits are sticky. This means as one or more fault occurs, the associated bits are set to 1. A bit that is set to 1 is cleared to 0 only by writing 1 to that bit, or by a reset.

### 4.3.11 HardFault status register

The HFSR gives information about events that activate the HardFault handler. See the register summary in Table 50 on page 192 for its attributes.

This register is read, write to clear. This means that bits in the register read normally, but writing 1 to any bit clears that bit to 0. The bit assignments are:

#### Table 66. UFSR bit assignments (continued)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
</table>
| [1]  | INVSTATE | Invalid state UsageFault:  
0: No invalid state UsageFault.  
1: The processor has attempted to execute an instruction that makes illegal use of the EPSR.  
When this bit is set to 1, the PC value stacked for the exception return points to the instruction that attempted the illegal use of the EPSR.  
This bit is not set to 1 if an undefined instruction uses the EPSR. |
| [0]  | UNDEFINSTR | Undefined instruction UsageFault:  
0: No undefined instruction UsageFault.  
1: The processor has attempted to execute an undefined instruction.  
When this bit is set to 1, the PC value stacked for the exception return points to the undefined instruction.  
An undefined instruction is an instruction that the processor cannot decode. |
The HFSR bits are sticky. This means as one or more fault occurs, the associated bits are set to 1. A bit that is set to 1 is cleared to 0 only by writing 1 to that bit, or by a reset.

### 4.3.12 MemManage fault address register

The MMFAR contains the address of the location that generated a MemManage fault. See the register summary in Table 50 on page 192 for its attributes. The bit assignments are:

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:0]</td>
<td>ADDRESS</td>
<td>When the MMARVALID bit of the MMFSR is set to 1, this field holds the address of the location that generated the MemManage fault</td>
</tr>
</tbody>
</table>

When an unaligned access faults, the address is the actual address that faulted. Because a single read or write instruction can be split into multiple aligned accesses, the fault address can be any address in the range of the requested access size.

Flags in the MMFSR indicate the cause of the fault, and whether the value in the MMFAR is valid. See Configuration and control register on page 200.
4.3.13 BusFault address register

The BFAR contains the address of the location that generated a BusFault. See the register summary in Table 50 on page 192 for its attributes. The bit assignments are:

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:0]</td>
<td>ADDRESS</td>
<td>When the BFARVALID bit of the BFSR is set to 1, this field holds the address of the location that generated the BusFault</td>
</tr>
</tbody>
</table>

When an unaligned access faults the address in the BFAR is the one requested by the instruction, even if it is not the address of the fault.

Flags in the BFSR indicate the cause of the fault, and whether the value in the BFAR is valid. See Table 50 on page 192.

4.3.14 System control block design hints and tips

Ensure software uses aligned accesses of the correct size to access the system control block registers:

- Except for the CFSR and SHPR1-SHPR3, it must use aligned word accesses.
- For the CFSR and SHPR1-SHPR3 it can use byte or aligned halfword or word accesses.

The processor does not support unaligned accesses to system control block registers.

In a fault handler, to determine the true faulting address:

1. Read and save the MMFAR or BFAR value.
2. Read the MMARVALID bit in the MMFSR, or the BFARVALID bit in the BFSR. The MMFAR or BFAR address is valid only if this bit is 1.

The software must follow this sequence because another higher priority exception might change the MMFAR or BFAR value. For example, if a higher priority handler preempts the current fault handler, the other fault might change the MMFAR or BFAR value.

In addition, the CMSIS provides a number of functions for system control, including:

<table>
<thead>
<tr>
<th>CMSIS system control function</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>void NVIC_SystemReset (void)</td>
<td>Reset the system</td>
</tr>
</tbody>
</table>

4.4 System timer, SysTick

The processor has a 24-bit system timer, SysTick, that counts down from the reload value to zero, reloads, that is wraps to, the value in the SYST_RVR register on the next clock edge, then counts down on subsequent clocks.

When the processor is halted for debugging the counter does not decrement.

The system timer registers are:
4.4.1 SysTick control and status register

The SysTick SYST_CSR register enables the SysTick features. See the register summary in Table 71 for its attributes. The bit assignments are:

**Table 71. System timer registers summary**

<table>
<thead>
<tr>
<th>Address</th>
<th>Name</th>
<th>Type</th>
<th>Required privilege</th>
<th>Reset value</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0xE000E010</td>
<td>SYST_CSR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000004</td>
<td>SysTick control and status register</td>
</tr>
<tr>
<td>0xE000E014</td>
<td>SYST_RVR</td>
<td>RW</td>
<td>Privileged</td>
<td>UNKNOWN</td>
<td>SysTick reload value register</td>
</tr>
<tr>
<td>0xE000E018</td>
<td>SYST_CVR</td>
<td>RW</td>
<td>Privileged</td>
<td>UNKNOWN</td>
<td>SysTick current value register</td>
</tr>
<tr>
<td>0xE000E01C</td>
<td>SYST_CALIB</td>
<td>RO</td>
<td>Privileged</td>
<td>0xC0000000</td>
<td>SysTick calibration value register</td>
</tr>
</tbody>
</table>

**Figure 40. SysTick SYST_CSR bit assignments**

![SysTick SYST_CSR bit assignments](MSv39696V1)

**Table 72. SysTick SYST_CSR bit assignments**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:17]</td>
<td>-</td>
<td>Reserved.</td>
</tr>
<tr>
<td>[16]</td>
<td>COUNTFLAG</td>
<td>Returns 1 if timer counted to 0 since last time this was read.</td>
</tr>
<tr>
<td>[15:3]</td>
<td>-</td>
<td>Reserved.</td>
</tr>
<tr>
<td>[2]</td>
<td>CLKSOURCE</td>
<td>Indicates the clock source:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>– 0: External clock.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>– 1: Processor clock.</td>
</tr>
<tr>
<td>[1]</td>
<td>TICKINT</td>
<td>Enables SysTick exception request:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: Counting down to zero does not assert the SysTick exception request.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: Counting down to zero asserts the SysTick exception request.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>Software can use COUNTFLAG to determine if SysTick has ever counted to 0</td>
</tr>
<tr>
<td>[0]</td>
<td>ENABLE</td>
<td>Enables the counter:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: Counter disabled.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: Counter enabled.</td>
</tr>
</tbody>
</table>
When ENABLE is set to 1, the counter loads the RELOAD value from the SYST_RVR register and then counts down. On reaching 0, it sets the COUNTFLAG to 1 and optionally asserts the SysTick depending on the value of TICKINT. It then loads the RELOAD value again, and begins counting.

### 4.4.2 SysTick reload value register

The SYST_RVR register specifies the start value to load into the SYST_CVR register. See the register summary in *Table 71 on page 213* for its attributes. The bit assignments are:

**Figure 41. SYST_RVR bit assignments**

<table>
<thead>
<tr>
<th>Bit</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[23:0]</td>
<td>RELOAD</td>
<td>Value to load into the SYST_CVR register when the counter is enabled and when it reaches 0, see <em>Calculating the RELOAD value</em>.</td>
</tr>
</tbody>
</table>

**Calculating the RELOAD value**

The RELOAD value can be any value in the range 0x00000001-0x00FFFFFF. A start value of 0 is possible, but has no effect because the SysTick exception request and COUNTFLAG are activated when counting from 1 to 0.

The RELOAD value is calculated according to its use. For example, to generate a multi-shot timer with a period of N processor clock cycles, use a RELOAD value of N-1. If the SysTick interrupt is required every 100 clock pulses, set RELOAD to 99.

### 4.4.3 SysTick current value register

The SYST_CVR register contains the current value of the SysTick counter. See the register summary in *Table 71 on page 213* for its attributes. The bit assignments are:

**Figure 42. SYST_CVR bit assignments**

<table>
<thead>
<tr>
<th>Bit</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[23:0]</td>
<td>CURRENT</td>
<td></td>
</tr>
</tbody>
</table>
4.4.4 SysTick calibration value register

The SYST_CALIB register indicates the SysTick calibration properties. See the register summary in Table 71 on page 213 for its attributes. The bit assignments are:

Table 74. SYST_CVR bit assignments

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[23:0]</td>
<td>CURRENT</td>
<td>Reads return the current value of the SysTick counter.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>A write of any value clears the field to 0, and also clears</td>
</tr>
<tr>
<td></td>
<td></td>
<td>the SYST_CSR COUNTFLAG bit to 0.</td>
</tr>
</tbody>
</table>

If the calibration information is not known, calculate the calibration value required from the frequency of the processor clock or external clock.

![Figure 43. SYST_CALIB bit assignments](image)

Table 75. SYST_CALIB bit assignments

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31]</td>
<td>NOREF</td>
<td>Indicates whether the device provides a reference clock to the processor:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: Reference clock provided.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: No reference clock provided.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>If the device does not provide a reference clock, the SYST_CSR.CLKSOURCE bit reads-as-one and ignores writes.</td>
</tr>
<tr>
<td>[30]</td>
<td>SKEW</td>
<td>Indicates whether the TENMS value is exact:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: TENMS value is exact.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: TENMS value is inexact, or not given.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>An inexact TENMS value can affect the suitability of SysTick as a software real time clock.</td>
</tr>
<tr>
<td>[23:0]</td>
<td>TENMS</td>
<td>Reload value for 10ms (100Hz) timing, subject to system clock skew errors. If the value reads as zero, the calibration value is not known.</td>
</tr>
</tbody>
</table>
4.4.5 SysTick design hints and tips

The SysTick counter runs on the processor clock. If this clock signal is stopped for Low-power mode, the SysTick counter stops.

Ensure the software uses aligned word accesses to access the SysTick registers.

The SysTick counter reload and current value are undefined at reset, the correct initialization sequence for the SysTick counter is:

1. Program reload value.
2. Clear current value.
3. Program Control and Status register.

In addition, the CMSIS provides a number of functions for SysTick control, including:

<table>
<thead>
<tr>
<th>CMSIS SysTick control function</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>uint32_t SysTick_Config(uint32_t ticks)</td>
<td>Creates a periodic SysTick interrupt using the SysTick timer, with a interval defined by the ticks parameter.</td>
</tr>
</tbody>
</table>
4.5 Processor features

The processor features registers provide a software with cache configuration information. The identification space registers are:

Table 77. Identification space summary

<table>
<thead>
<tr>
<th>Address</th>
<th>Name</th>
<th>Type</th>
<th>Required privilege</th>
<th>Reset value</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0xE000ED78</td>
<td>CLIDR</td>
<td>RO</td>
<td>Privileged</td>
<td>0x09000003</td>
<td>Cache level ID register</td>
</tr>
<tr>
<td>0xE000ED7C</td>
<td>CTR</td>
<td>RO</td>
<td>Privileged</td>
<td>0x8303C003</td>
<td>Cache type register on page 218</td>
</tr>
<tr>
<td>0xE000ED80</td>
<td>CCSIDR</td>
<td>RO</td>
<td>Privileged</td>
<td>Unknown</td>
<td>Cache size ID register on page 219</td>
</tr>
<tr>
<td>0xE000ED84</td>
<td>CSSELR</td>
<td>RW</td>
<td>Privileged</td>
<td>Unknown</td>
<td>Cache size selection register on page 220</td>
</tr>
</tbody>
</table>

All the registers are only accessible by privileged loads and stores. Unprivileged accesses to these registers result in a BusFault.

4.5.1 Cache level ID register

The CLIDR identifies the type of cache, or caches, implemented at each level, and the level of coherency and unification. See the register summary in Table 77 on page 217 for its attributes. The bit assignments are:

Table 78. CLIDR bit assignments

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:30]</td>
<td>-</td>
<td>SBZ.</td>
</tr>
<tr>
<td>[29:27]</td>
<td>LoU</td>
<td>Level of Unification.  0b001: Level 2, if either cache is implemented. 0b000: Level 1, if neither instruction nor data cache is implemented.</td>
</tr>
<tr>
<td>[26:24]</td>
<td>LoC</td>
<td>Level of Coherency.  0b001: Level 2, if either cache is implemented. 0b000: Level 1, if neither instruction nor data cache is implemented.</td>
</tr>
<tr>
<td>[23:21]</td>
<td>LoUIS</td>
<td>RAZ.</td>
</tr>
<tr>
<td>[20:18]</td>
<td>CL 7</td>
<td>0b000: No cache at CL 7.</td>
</tr>
</tbody>
</table>
The CTR provides information about the cache architecture. See the register summary in Table 77 on page 217 for its attributes. The bit assignments are:

### Table 78. CLIDR bit assignments (continued)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[14:12]</td>
<td>CL 5</td>
<td>0b000: No cache at CL 5.</td>
</tr>
<tr>
<td>[8:6]</td>
<td>CL 3</td>
<td>0b000: No cache at CL 3.</td>
</tr>
<tr>
<td>[5:3]</td>
<td>CL 2</td>
<td>0b000: No cache at CL 2.</td>
</tr>
<tr>
<td>[1]</td>
<td>CL 1</td>
<td>1: Data cache is implemented. 0: No data cache is implemented.</td>
</tr>
<tr>
<td>[0]</td>
<td>CL 1</td>
<td>1: An instruction cache is implemented. 0: No instruction cache is implemented.</td>
</tr>
</tbody>
</table>

### 4.5.2 Cache type register

The CTR provides information about the cache architecture. See the register summary in Table 77 on page 217 for its attributes. The bit assignments are:

### Table 79. CTR bit assignments

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:29]</td>
<td>Format</td>
<td>Register format. 0b100: Armv7 register format.</td>
</tr>
<tr>
<td>[28]</td>
<td>-</td>
<td>Reserved, RAZ.</td>
</tr>
<tr>
<td>[23:20]</td>
<td>ERG</td>
<td>Exclusives Reservation Granule. 0b0000: The local monitor within the processor does not hold any physical address. It treats any STREX instruction access as matching the address of the previous LDREX instruction. This means that the implemented exclusive reservation granule is the entire memory address range.</td>
</tr>
<tr>
<td>[19:16]</td>
<td>DMinLine</td>
<td>Smallest cache line of all the data and unified caches under the core control. 0b0011: 8 words for the Cortex®-M7 processor.</td>
</tr>
<tr>
<td>[15:14]</td>
<td>-</td>
<td>All bits RAO.</td>
</tr>
</tbody>
</table>
4.5.3 Cache size ID register

The CCSIDR identifies the configuration of the cache currently selected by the CSSELR. If no instruction or data cache is configured, the corresponding CCSIDR is RAZ. See the register summary in Table 77 on page 217 for its attributes. The bit assignments are:

```
<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>[3:0]</td>
<td>IminLine</td>
<td>Smallest cache line of all the instruction caches under the control of the processor. 0b0011: 8 words for the Cortex®-M7 processor.</td>
</tr>
</tbody>
</table>
```

![Figure 46. CCSIDR bit assignments](MSv39657V1)

Table 79. CTR bit assignments (continued)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:12]</td>
<td>NumSet</td>
<td>Number of sets as: (number of sets) - 1.</td>
</tr>
<tr>
<td>[11:0]</td>
<td>Assoc</td>
<td>Number of ways as: (number of ways) - 1.</td>
</tr>
<tr>
<td>[31:0]</td>
<td>LineSize</td>
<td>Number of words in each cache line.</td>
</tr>
</tbody>
</table>

![Table 80. CCSIDR bit assignments](MSv39657V1)

The LineSize field is encoded as 2 less than \( \log(2) \) of the number of words in the cache line. For example, a value of 0x0 indicates there are four words in a cache line, that is the minimum size for the cache. A value of 0x1 indicates there are eight words in a cache line.
Table 81 shows the individual bit field and complete register encodings for the CCSIDR. Use this to determine the cache size for the L1 data or instruction cache selected by the Cache Size Selection Register (CSSELR). See Cache size selection register.

Table 81. CCSIDR encodings

<table>
<thead>
<tr>
<th>CSSELR</th>
<th>Cache</th>
<th>Size</th>
<th>Complete register encoding</th>
<th>Register bit field encoding</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td></td>
<td></td>
<td>WT</td>
<td>WB</td>
</tr>
<tr>
<td>0x0</td>
<td>Data cache</td>
<td>4 Kbytes</td>
<td>0xF003E019</td>
<td>1</td>
</tr>
<tr>
<td></td>
<td></td>
<td>8 Kbytes</td>
<td>0xF007E019</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>16 Kbytes</td>
<td>0xF00FE019</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>32 Kbytes</td>
<td>0xF01FE019</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>64 Kbytes</td>
<td>0xF03FE019</td>
<td></td>
</tr>
<tr>
<td>0x1</td>
<td>Instruction cache</td>
<td>4 Kbytes</td>
<td>0xF007E009</td>
<td>1</td>
</tr>
<tr>
<td></td>
<td></td>
<td>8 Kbytes</td>
<td>0xF00FE009</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>16 Kbytes</td>
<td>0xF01FE009</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>32 Kbytes</td>
<td>0xF03FE009</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>64 Kbytes</td>
<td>0xF07FE009</td>
<td></td>
</tr>
</tbody>
</table>

4.5.4 Cache size selection register

The CSSELR selects the cache whose configuration is currently visible in the CCSIDR. See the register summary in Table 77 on page 217 for its attributes. The bit assignments are:

Figure 47. CSSELR bit assignments

<table>
<thead>
<tr>
<th>Bit</th>
<th>Name</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:4]</td>
<td>-</td>
<td>RESERVED</td>
</tr>
<tr>
<td>[3:1]</td>
<td>Level</td>
<td>Identifies the cache level selected. 0b000: Level 1 cache. This field is read only, writes are ignored.</td>
</tr>
<tr>
<td>[0]</td>
<td>InD</td>
<td>Enables selection of instruction or data cache: 0: Data cache. 1: Instruction cache.</td>
</tr>
</tbody>
</table>
4.6 Memory protection unit

The Memory Protection Unit (MPU) divides the memory map into a number of regions, and defines the location, size, access permissions, and memory attributes of each region. It supports:

- Independent attribute settings for each region.
- Overlapping regions.
- Export of memory attributes to the system.

The memory attributes affect the behavior of memory accesses to the region. The Cortex®-M7 MPU defines:

- 8 or 16 separate memory regions, 0-7 or 0-15.
- A background region.

When memory regions overlap, a memory access is affected by the attributes of the region with the highest number. For example, the attributes for region 7 take precedence over the attributes of any region that overlaps region 7.

The background region has the same memory access attributes as the default memory map, but is accessible from privileged software only.

The Cortex®-M7 MPU memory map is unified. This means instruction accesses and data accesses have same region settings.

If a program accesses a memory location that is prohibited by the MPU, the processor generates a MemManage fault. This causes a fault exception, and might cause termination of the process in an OS environment. In an OS environment, the kernel can update the MPU region setting dynamically based on the process to be executed. Typically, an embedded OS uses the MPU for memory protection.

The configuration of MPU regions is based on memory types, see Memory regions, types and attributes on page 33.

Table 83 shows the possible MPU region attributes. These include Shareability and cache behavior attributes that are generally only relevant when the processor is configured with caches.

<table>
<thead>
<tr>
<th>Memory type</th>
<th>Shareability</th>
<th>Other attributes</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>Strongly-ordered</td>
<td>-</td>
<td>-</td>
<td>All accesses to Strongly-ordered memory occur in program order. All Strongly-ordered regions are assumed to be shared.</td>
</tr>
<tr>
<td>Device</td>
<td>Shared</td>
<td>-</td>
<td>Memory-mapped peripherals that several processors share.</td>
</tr>
<tr>
<td>-</td>
<td>Non-shared</td>
<td>-</td>
<td>Memory-mapped peripherals that only a single processor uses.</td>
</tr>
</tbody>
</table>
Use the MPU registers to define the MPU regions and their attributes. The MPU registers are:

<table>
<thead>
<tr>
<th>Address</th>
<th>Name</th>
<th>Type</th>
<th>Required privilege</th>
<th>Reset value</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0xE000ED90</td>
<td>MPU_TYPE</td>
<td>RO</td>
<td>Privileged</td>
<td>0x00000800</td>
<td>MPU type register on page 223</td>
</tr>
<tr>
<td>0xE000ED94</td>
<td>MPU_CTRL</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>MPU control register on page 223</td>
</tr>
<tr>
<td>0xE000ED98</td>
<td>MPU_RNR</td>
<td>RW</td>
<td>Privileged</td>
<td>Unknown</td>
<td>MPU region number register on page 225</td>
</tr>
<tr>
<td>0xE000ED9C</td>
<td>MPU_RBAR</td>
<td>RW</td>
<td>Privileged</td>
<td>Unknown</td>
<td>MPU region base address register on page 225</td>
</tr>
<tr>
<td>0xE000EDA0</td>
<td>MPU_RASR</td>
<td>RW</td>
<td>Privileged</td>
<td>..(1)</td>
<td>MPU region attribute and size register on page 226</td>
</tr>
<tr>
<td>0xE000EDA4</td>
<td>MPU_RBAR_A1</td>
<td>RW</td>
<td>Privileged</td>
<td>Unknown</td>
<td>Alias of RBAR, see MPU region base address register on page 225</td>
</tr>
<tr>
<td>0xE000EDA8</td>
<td>MPU_RASR_A1</td>
<td>RW</td>
<td>Privileged</td>
<td>..(1)</td>
<td>Alias of RASR, see MPU region attribute and size register on page 226</td>
</tr>
<tr>
<td>0xE000EDAC</td>
<td>MPU_RBAR_A2</td>
<td>RW</td>
<td>Privileged</td>
<td>Unknown</td>
<td>Alias of RBAR, see MPU region base address register on page 225</td>
</tr>
<tr>
<td>0xE000EDB0</td>
<td>MPU_RASR_A2</td>
<td>RW</td>
<td>Privileged</td>
<td>..(1)</td>
<td>Alias of RASR, see MPU region attribute and size register on page 226</td>
</tr>
<tr>
<td>0xE000EDB4</td>
<td>MPU_RBAR_A3</td>
<td>RW</td>
<td>Privileged</td>
<td>Unknown</td>
<td>Alias of RBAR, see MPU region base address register on page 225</td>
</tr>
<tr>
<td>0xE000EDB8</td>
<td>MPU_RASR_A3</td>
<td>RW</td>
<td>Privileged</td>
<td>..(1)</td>
<td>Alias of RASR, see MPU region attribute and size register on page 226</td>
</tr>
</tbody>
</table>

1. Unknown apart from the ENABLE field, which is reset to 0.
### 4.6.1 MPU type register

The MPU_TYPE register indicates whether the MPU is present, and if so, how many regions it supports. If the MPU is not present the MPU_TYPE register is RAZ. See the register summary in Table 84 for its attributes. The bit assignments are:

**Figure 48. TYPE bit assignments**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[23:16]</td>
<td>IREGION</td>
<td>Indicates the number of supported MPU instruction regions. Always contains 0x00. The MPU memory map is unified and is described by the DREGION field.</td>
</tr>
<tr>
<td>[15:8]</td>
<td>DREGION</td>
<td>Indicates the number of supported MPU data regions: 0x08: 8 MPU regions. 0x10: 16 MPU regions.</td>
</tr>
<tr>
<td>[7:1]</td>
<td>-</td>
<td>Reserved.</td>
</tr>
<tr>
<td>[0]</td>
<td>SEPARATE</td>
<td>Indicates support for unified or separate instruction and data memory maps: 0: Unified.</td>
</tr>
</tbody>
</table>

**Table 85. TYPE bit assignments**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[23:16]</td>
<td>IREGION</td>
<td>Indicates the number of supported MPU instruction regions. Always contains 0x00. The MPU memory map is unified and is described by the DREGION field.</td>
</tr>
<tr>
<td>[15:8]</td>
<td>DREGION</td>
<td>Indicates the number of supported MPU data regions: 0x08: 8 MPU regions. 0x10: 16 MPU regions.</td>
</tr>
<tr>
<td>[7:1]</td>
<td>-</td>
<td>Reserved.</td>
</tr>
<tr>
<td>[0]</td>
<td>SEPARATE</td>
<td>Indicates support for unified or separate instruction and data memory maps: 0: Unified.</td>
</tr>
</tbody>
</table>

### 4.6.2 MPU control register

The MPU_CTRL register:
- Enables the MPU.
- Enables the default memory map background region.
- Enables use of the MPU when in the hard fault, Non Maskable Interrupt (NMI), and FAULTMASK escalated handlers.

See the register summary in Table 84 on page 222 for the MPU_CTRL attributes. The bit assignments are:

**Figure 49. MPU_CTRL bit assignments**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:24]</td>
<td>-</td>
<td>Reserved</td>
</tr>
<tr>
<td>[23:22]</td>
<td>PRIVDEFENA</td>
<td></td>
</tr>
<tr>
<td>[15:14]</td>
<td>HFNMIEENA</td>
<td></td>
</tr>
<tr>
<td>[13:12]</td>
<td>ENABLE</td>
<td></td>
</tr>
</tbody>
</table>
When ENABLE and PRIVDEFENA are both set to 1:

- For privileged accesses, the default memory map is as described in Cortex®-M7 configurations on page 30. Any access by privileged software that does not address an enabled memory region behaves as defined by the default memory map. If the MPU is disabled, the processor ignores this bit.

- Any access by unprivileged software that does not address an enabled memory region causes a MemManage fault.

XN and Strongly-ordered rules always apply to the System Control Space regardless of the value of the ENABLE bit.

When the ENABLE bit is set to 1, at least one region of the memory map must be enabled for the system to function unless the PRIVDEFENA bit is set to 1. If the PRIVDEFENA bit is set to 1 and no regions are enabled, then only privileged software can operate.

When the ENABLE bit is set to 0, the system uses the default memory map. This has the same memory attributes as if the MPU is not implemented, see Table 77 on page 217. The default memory map applies to accesses from both privileged and unprivileged software.

When the MPU is enabled, accesses to the System Control Space and vector table are always permitted. Other areas are accessible based on regions and whether PRIVDEFENA is set to 1.

Unless HFNMIENA is set to 1, the MPU is not enabled when the processor is executing the handler for an exception with priority –1 or –2. These priorities are only possible when handling a hard fault or NMI exception, or when FAULTMASK is enabled. Setting the HFNMIENA bit to 1 enables the MPU when operating with these two priorities.

## Table 86. MPU_CTRL bit assignments

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:3]</td>
<td>-</td>
<td>Reserved.</td>
</tr>
</tbody>
</table>
| [2]  | PRIVDEFENA | Enables privileged software access to the default memory map:  
0: If the MPU is enabled, disables use of the default memory map. Any memory access to a location not covered by any enabled region causes a fault.  
1: If the MPU is enabled, enables use of the default memory map as a background region for privileged software accesses. When enabled, the background region acts as if it is region number -1. Any region that is defined and enabled has priority over this default map. If the MPU is disabled, the processor ignores this bit. |
| [1]  | HFNMIENA  | Enables the operation of MPU during hard fault, NMI, and FAULTMASK handlers.  
When the MPU is enabled:  
0: MPU is disabled during hard fault, NMI, and FAULTMASK handlers, regardless of the value of the ENABLE bit.  
1: The MPU is enabled during hard fault, NMI, and FAULTMASK handlers. When the MPU is disabled, if this bit is set to 1 the behavior is Unpredictable. |
| [0]  | ENABLE    | Enables the MPU:  
0: MPU disabled.  
1: MPU enabled. |
4.6.3 MPU region number register

The MPU_RNR selects which memory region is referenced by the MPU_RBAR and MPU_RASR registers. See the register summary in Table 77 on page 217 for its attributes. The bit assignments are:

![Figure 50. MPU_RNR bit assignments](image1)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:8]</td>
<td>-</td>
<td>Reserved.</td>
</tr>
<tr>
<td>[7:0]</td>
<td>REGION</td>
<td>Indicates the MPU region referenced by the MPU_RBAR and MPU_RASR registers. The MPU supports 8 or 16 memory regions, so the permitted values of this field are 0-7 or 0-15.</td>
</tr>
</tbody>
</table>

Normally, the user writes the required region number to this register before accessing the MPU_RBAR or MPU_RASR. However the region number can be changed by writing to the MPU RBAR with the VALID bit set to 1, see MPU region base address register. This write updates the value of the REGION field.

4.6.4 MPU region base address register

The MPU_RBAR defines the base address of the MPU region selected by the MPU_RNR, and can update the value of the MPU_RNR. See the register summary in Table 84 on page 222 for its attributes.

Write MPU_RBAR with the VALID bit set to 1 to change the current region number and update the MPU_RNR. See the register summary in Table 84 on page 222 for its attributes. The bit assignments are

![Figure 51. MPU_RBAR bit assignments](image2)

If the region size is 32B, the ADDR field is bits [31:5] and there is no Reserved field.
The ADDR field

The ADDR field is bits[31:N] of the MPU_RBAR. The region size, as specified by the SIZE field in the MPU_RASR, defines the value of N:

\[ N = \log_2(\text{Region size in bytes}) \]

If the region size is configured to 4 Gbytes, in the MPU_RASR, there is no valid ADDR field. In this case, the region occupies the complete memory map, and the base address is 0x00000000.

The base address is aligned to the size of the region. For example, a 64KB region must be aligned on a multiple of 64KB, for example, at 0x00010000 or 0x00020000.

### 4.6.5 MPU region attribute and size register

The MPU_RASR defines the region size and memory attributes of the MPU region specified by the MPU_RNR, and enables that region and any subregions. See the register summary in Table 84 on page 222 for its attributes.

MPU_RASR is accessible using word accesses:
- The most significant halfword holds the region attributes.
- The least significant halfword holds the region size and the region and subregion enable bits.

The bit assignments are:

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:N]</td>
<td>ADDR</td>
<td>Region base address field. The value of N depends on the region size. For more information see The ADDR field.</td>
</tr>
<tr>
<td>[4]</td>
<td>VALID</td>
<td>MPU Region Number valid bit: Write: 0: MPU_RNR not changed, and the processor: Updates the base address for the region specified in the MPU_RNR Ignores the value of the REGION field 1: The processor: Updates the value of the MPU_RNR to the value of the REGION field Updates the base address for the region specified in the REGION field. Always reads as zero.</td>
</tr>
<tr>
<td>[3:0]</td>
<td>REGION</td>
<td>MPU region field: For the behavior on writes, see the description of the VALID field. On reads, returns the current region number, as specified by the RNR.</td>
</tr>
</tbody>
</table>

Table 88. MPU_RBAR bit assignments
Figure 52. MPU_RASR bit assignments

Table 89. MPU_RASR bit assignments

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:29]</td>
<td>-</td>
<td>Reserved.</td>
</tr>
</tbody>
</table>
| [28]   | XN     | Instruction access disable bit:  
0: Instruction fetches enabled.  
1: Instruction fetches disabled. |
| [27]   | -      | Reserved.|
| [26:24]| AP     | Access permission field, see Table 93 on page 229. |
| [21:19, 17, 16]| TEX, C, B | Memory access attributes, see Table 91 on page 228. |
| [18]   | S      | Shareable bit, see Table 91 on page 228. |
| [15:8] | SRD    | Subregion disable bits. For each bit in this field:  
0: Corresponding sub-region is enabled.  
1: Corresponding sub-region is disabled.  
See Subregions on page 231 for more information.  
Region sizes of 128 bytes and less do not support subregions. When writing the attributes for such a region, write the SRD field as 0x00. |
| [5:1]  | SIZE   | Specifies the size of the MPU protection region. The minimum permitted value is 4 (0b000100), see SIZE field values for more information. |
| [0]    | ENABLE | Region enable bit. |

For information about access permission, see MPU access permission attributes on page 228.

SIZE field values

The SIZE field defines the size of the MPU memory region specified by the RNR. as follows:  
(Region size in bytes) = 2^{(SIZE+1)}

The smallest permitted region size is 32B, corresponding to a SIZE value of 4. Table 90 gives example SIZE values, with the corresponding region size and value of N in the MPU_RBAR.
4.6.6 MPU access permission attributes

This section describes the MPU access permission attributes. The access permission bits, TEX, C, B, S, AP, and XN, of the RASR, control access to the corresponding memory region. If an access is made to an area of memory without the required permissions, then the MPU generates a permission fault. Table 91 shows encodings for the TEX, C, B, and S access permission bits.

Table 90. Example SIZE field values

<table>
<thead>
<tr>
<th>SIZE value</th>
<th>Region size</th>
<th>Value of N(1)</th>
<th>Note</th>
</tr>
</thead>
<tbody>
<tr>
<td>0b00100 (4)</td>
<td>32 Bytes</td>
<td>5</td>
<td>Minimum permitted size</td>
</tr>
<tr>
<td>0b01000 (9)</td>
<td>1 Kbyte</td>
<td>10</td>
<td>-</td>
</tr>
<tr>
<td>0b10011 (19)</td>
<td>1 Mbyte</td>
<td>20</td>
<td>-</td>
</tr>
<tr>
<td>0b11101 (29)</td>
<td>1 Gbyte</td>
<td>30</td>
<td>-</td>
</tr>
<tr>
<td>0b11111 (31)</td>
<td>4 Gbytes</td>
<td>32</td>
<td>Maximum possible size</td>
</tr>
</tbody>
</table>

1. In the MPU_RBAR, see MPU region base address register on page 225.

Table 91. TEX, C, B, and S encoding

<table>
<thead>
<tr>
<th>TEX</th>
<th>C</th>
<th>B</th>
<th>S</th>
<th>Memory type</th>
<th>Shareability</th>
<th>Other attributes</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>x(1)</td>
<td>Strongly-ordered</td>
<td>Shareable</td>
<td>-</td>
</tr>
<tr>
<td></td>
<td>1</td>
<td>x(1)</td>
<td>Device</td>
<td>Shareable</td>
<td>-</td>
<td></td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>Normal</td>
<td>Not shareable</td>
<td>Outer and inner write-through. No write allocate.</td>
<td></td>
</tr>
<tr>
<td></td>
<td>1</td>
<td>Normal</td>
<td>Not shareable</td>
<td>Shareable</td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>1</td>
<td>Normal</td>
<td>Not shareable</td>
<td>Outer and inner write-back. No write allocate.</td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>1</td>
<td>Normal</td>
<td>Shareable</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>0</td>
<td>Normal</td>
<td>Not shareable</td>
<td>Outer and inner noncacheable.</td>
<td></td>
</tr>
<tr>
<td></td>
<td>0</td>
<td>Normal</td>
<td>Not shareable</td>
<td>Shareable</td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>0</td>
<td>Reserved encoding</td>
<td>-</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>1</td>
<td>Implementation defined attributes.</td>
<td>-</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>1</td>
<td>Normal</td>
<td>Not shareable</td>
<td>Outer and inner write-back. Write and read allocate.</td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>1</td>
<td>Normal</td>
<td>Shareable</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>x(1)</td>
<td>Device</td>
<td>Not shareable</td>
<td>Nonshared Device.</td>
<td></td>
</tr>
<tr>
<td></td>
<td>1</td>
<td>Reserved encoding</td>
<td>-</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>1</td>
<td>Reserved encoding</td>
<td>-</td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>
Table 91. TEX, C, B, and S encoding (continued)

<table>
<thead>
<tr>
<th>TEX</th>
<th>C</th>
<th>B</th>
<th>S</th>
<th>Memory type</th>
<th>Shareability</th>
<th>Other attributes</th>
</tr>
</thead>
<tbody>
<tr>
<td>0b1BB</td>
<td>A</td>
<td>A</td>
<td>0</td>
<td>Normal</td>
<td>Not shareable</td>
<td>Cached memory, BB = outer policy, AA = inner policy. See Table 92 on page 229 for the encoding of the AA and BB bits.</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>1</td>
<td></td>
<td>Shareable</td>
<td></td>
</tr>
</tbody>
</table>

1. The MPU ignores the value of this bit.

Table 92 shows the cache policy for memory attribute encodings with a TEX value is in the range 4-7.

Table 92. Cache policy for memory attribute encoding

<table>
<thead>
<tr>
<th>Encoding, AA or BB</th>
<th>Corresponding cache policy</th>
</tr>
</thead>
<tbody>
<tr>
<td>00</td>
<td>Non-cacheable</td>
</tr>
<tr>
<td>01</td>
<td>Write back, write and read allocate</td>
</tr>
<tr>
<td>10</td>
<td>Write through, no write allocate</td>
</tr>
<tr>
<td>11</td>
<td>Write back, no write allocate</td>
</tr>
</tbody>
</table>

Table 93 shows the AP encodings that define the access permissions for privileged and unprivileged software.

Table 93. AP encoding

<table>
<thead>
<tr>
<th>AP[2:0]</th>
<th>Privileged permissions</th>
<th>Unprivileged permissions</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>000</td>
<td>No access</td>
<td>No access</td>
<td>All accesses generate a permission fault</td>
</tr>
<tr>
<td>001</td>
<td>RW</td>
<td>No access</td>
<td>Access from privileged software only</td>
</tr>
<tr>
<td>010</td>
<td>RW</td>
<td>RO</td>
<td>Writes by unprivileged software generate a permission fault</td>
</tr>
<tr>
<td>011</td>
<td>RW</td>
<td>RW</td>
<td>Full access</td>
</tr>
<tr>
<td>100</td>
<td>Unpredictable</td>
<td>Unpredictable</td>
<td>Reserved</td>
</tr>
<tr>
<td>101</td>
<td>RO</td>
<td>No access</td>
<td>Reads by privileged software only</td>
</tr>
<tr>
<td>110</td>
<td>RO</td>
<td>RO</td>
<td>Read only, by privileged or unprivileged software</td>
</tr>
<tr>
<td>111</td>
<td>RO</td>
<td>RO</td>
<td>Read only, by privileged or unprivileged software</td>
</tr>
</tbody>
</table>
4.6.7 MPU mismatch

When an access violates the MPU permissions, the processor generates a MemManage fault, see Exceptions and interrupts on page 28. The MMFSR indicates the cause of the fault. See MemManage fault status register on page 206 for more information.

4.6.8 Updating an MPU region

To update the attributes for an MPU region, update the MPU_RNR, MPU_RBAR and MPU_RASR registers. It is possible to program each register separately, or use a multiple-word write to program all of these registers. The MPU_RBAR and MPU_RASR aliases can be used to program up to four regions simultaneously using an STM instruction.

Updating an MPU region using separate words

Simple code to configure one region:

; R1 = region number
; R2 = size/enable
; R3 = attributes
; R4 = address
LDR R0,=MPU_RNR      ; 0xE000ED98, MPU region number register
STR R1, [R0, #0x0]   ; Region Number
STR R4, [R0, #0x4]   ; Region Base Address
STRH R2, [R0, #0x8]  ; Region Size and Enable
STRH R3, [R0, #0xA]  ; Region Attribute

Disable a region before writing new region settings to the MPU if the region being changed, has been previously enabled. For example:

; R1 = region number
; R2 = size/enable
; R3 = attributes
; R4 = address
LDR R0,=MPU_RNR      ; 0xE000ED98, MPU region number register
STR R1, [R0, #0x0]   ; Region Number
BIC R2, R2, #1       ; Disable
STRH R2, [R0, #0x8]  ; Region Size and Enable
STR R4, [R0, #0x4]   ; Region Base Address
STRH R3, [R0, #0xA]  ; Region Attribute
ORR R2, #1           ; Enable
STRH R2, [R0, #0x8]  ; Region Size and Enable

The software must use memory barrier instructions:
- Before MPU setup if there might be outstanding memory transfers, such as buffered writes, that might be affected by the change in MPU settings.
- After MPU setup if it includes memory transfers that must use the new MPU settings.

The software does not require any memory barrier instructions during MPU setup, because it accesses the MPU through the PPB, which is a Strongly-ordered memory region.

For example, if it is required that all of the memory access behavior to take effect immediately after the programming sequence, use a DSB instruction and an ISB instruction. A DSB is required after changing MPU settings, such as at the end of context switch. An ISB is required if the code that programs the MPU region or regions is entered using a branch or call. If the programming sequence is entered by taking an exception and the programming sequence is exited by using a return from exception then an ISB instruction is not required.
Updating an MPU region using multi-word writes

The user can program directly using multi-word writes, depending on how the information is divided. Consider the following reprogramming:

; R1 = region number
; R2 = address
; R3 = size, attributes in one
LDR R0, =MPU_RNR ; 0xE000ED98, MPU region number register
STR R1, [R0, #0x0] ; Region Number
STR R2, [R0, #0x4] ; Region Base Address
STR R3, [R0, #0x8] ; Region Attribute, Size and Enable

Use an STM instruction to optimize this:

; R1 = region number
; R2 = address
; R3 = size, attributes in one
LDR R0, =MPU_RNR ; 0xE000ED98, MPU region number register
STM R0, {R1-R3} ; Region Number, address, attribute, size and enable

The user can do this in two words for pre-packed information. This means that the MPU_RBAR contains the required region number and had the VALID bit set to 1, see MPU region base address register on page 225. Use this when the data is statically packed, for example in a boot loader:

; R1 = address and region number in one
; R2 = size and attributes in one
LDR R0, =MPU_RBAR ; 0xE000ED9C, MPU Region Base register.
STR R1, [R0, #0x0] ; Region base address and region number combined
; with VALID (bit 4) set to 1.
STR R2, [R0, #0x4] ; Region Attribute, Size and Enable.

Subregions

Regions of 256 bytes or more are divided into eight equal-sized subregions. Set the corresponding bit in the SRD field of the MPU_RASR to disable a subregion, see MPU region attribute and size register on page 226. The least significant bit of SRD controls the first subregion, and the most significant bit controls the last subregion. Disabling a subregion means another region overlapping the disabled range matches instead. If no other enabled region overlaps the disabled subregion, and the access is unprivileged or the background region is disabled, the MPU issues a fault.

Regions of 32, 64, and 128 bytes do not support subregions. With regions of these sizes, The user must set the SRD field to 0x00, otherwise the MPU behavior is Unpredictable.

Example of SRD use

Two regions with the same base address overlap. Region one is 128 KB, and region two is 512 KB. To ensure the attributes from region one apply to the first 128 KB region, set the SRD field for region two to 0b00000011 to disable the first two subregions, as the figure shows.
4.6.9 MPU design hints and tips

To avoid an unexpected behavior, disable the interrupts before updating the attributes of a region that the interrupt handlers might access.

The processor does not support unaligned accesses to MPU registers.

The MPU registers support aligned word accesses only. The byte and halfword accesses are unpredictable.

When setting up the MPU, and if the MPU has previously been programmed, disable the unused regions to prevent any previous region settings from affecting the new MPU setup.
4.7 Floating-point unit

The Cortex®-M7 Floating-Point Unit (FPU) implements the FPv5 floating-point extensions. The FPU fully supports single-precision and double-precision add, subtract, multiply, divide, multiply and accumulate, and square root operations. It also provides conversions between fixed-point and floating-point data formats, and floating-point constant instructions.

The FPU provides floating-point computation functionality that is compliant with the ANSI/IEEE Std 754-2008, IEEE Standard for Binary Floating-Point Arithmetic, referred to as the IEEE 754 standard.

The silicon vendor should also include the following text when implementations support single-precision FPU only. The FPU contains 32 single-precision extension registers, which can be also accessed as 16 doubleword registers for load, store, and move operations. Table 94 shows the floating-point system registers in the Cortex®-M7 processor with FPU.

<table>
<thead>
<tr>
<th>Address</th>
<th>Name</th>
<th>Type</th>
<th>Required privilege</th>
<th>Reset</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0xE000ED88</td>
<td>CPACR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Coprocessor access control register on page 233</td>
</tr>
<tr>
<td>0xE000EF34</td>
<td>FPCCR</td>
<td>RW</td>
<td>Privileged</td>
<td>0xC0000000</td>
<td>Floating-point context control register on page 234</td>
</tr>
<tr>
<td>0xE000EF38</td>
<td>FPCAR</td>
<td>RW</td>
<td>Privileged</td>
<td>-</td>
<td>Floating-point context address register on page 236</td>
</tr>
<tr>
<td>-</td>
<td>FPSCR (1)</td>
<td>RW</td>
<td>Unprivileged</td>
<td>-</td>
<td>Floating-point status control register on page 236</td>
</tr>
<tr>
<td>0xE000EF3C</td>
<td>FPDSCR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Floating-point default status control register on page 237</td>
</tr>
</tbody>
</table>

1. The FPSCR register is not memory-mapped, it can be accessed using the VMSR and VMRS instructions, see VMRS on page 165 and VMSR on page 166. The software can only access the FPSCR when the FPU is enabled, see Enabling the FPU on page 238.

The following sections describe the floating-point system registers whose implementation is specific to this processor.

4.7.1 Coprocessor access control register

The CPACR register specifies the access privileges for coprocessors. See the register summary in Table 94 on page 233 for its attributes. The bit assignments are:

<table>
<thead>
<tr>
<th>Bit</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>31</td>
<td>Reserved</td>
</tr>
<tr>
<td>24</td>
<td>23 22 21 20 19</td>
</tr>
<tr>
<td>18</td>
<td>CP11</td>
</tr>
<tr>
<td>17</td>
<td>CP10</td>
</tr>
<tr>
<td>0</td>
<td>Reserved</td>
</tr>
</tbody>
</table>

Table 94. Cortex®-M7 floating-point system registers

The silicon vendor should also include the following text when implementations support single-precision FPU only. The FPU contains 32 single-precision extension registers, which can be also accessed as 16 doubleword registers for load, store, and move operations. Table 94 shows the floating-point system registers in the Cortex®-M7 processor with FPU.

<table>
<thead>
<tr>
<th>Address</th>
<th>Name</th>
<th>Type</th>
<th>Required privilege</th>
<th>Reset</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0xE000ED88</td>
<td>CPACR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Coprocessor access control register on page 233</td>
</tr>
<tr>
<td>0xE000EF34</td>
<td>FPCCR</td>
<td>RW</td>
<td>Privileged</td>
<td>0xC0000000</td>
<td>Floating-point context control register on page 234</td>
</tr>
<tr>
<td>0xE000EF38</td>
<td>FPCAR</td>
<td>RW</td>
<td>Privileged</td>
<td>-</td>
<td>Floating-point context address register on page 236</td>
</tr>
<tr>
<td>-</td>
<td>FPSCR (1)</td>
<td>RW</td>
<td>Unprivileged</td>
<td>-</td>
<td>Floating-point status control register on page 236</td>
</tr>
<tr>
<td>0xE000EF3C</td>
<td>FPDSCR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Floating-point default status control register on page 237</td>
</tr>
</tbody>
</table>

1. The FPSCR register is not memory-mapped, it can be accessed using the VMSR and VMRS instructions, see VMRS on page 165 and VMSR on page 166. The software can only access the FPSCR when the FPU is enabled, see Enabling the FPU on page 238.

The following sections describe the floating-point system registers whose implementation is specific to this processor.

4.7.1 Coprocessor access control register

The CPACR register specifies the access privileges for coprocessors. See the register summary in Table 94 on page 233 for its attributes. The bit assignments are:

<table>
<thead>
<tr>
<th>Bit</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>31</td>
<td>Reserved</td>
</tr>
<tr>
<td>24</td>
<td>23 22 21 20 19</td>
</tr>
<tr>
<td>18</td>
<td>CP11</td>
</tr>
<tr>
<td>17</td>
<td>CP10</td>
</tr>
<tr>
<td>0</td>
<td>Reserved</td>
</tr>
</tbody>
</table>
4.7.2 Floating-point context control register

The FPCCR register sets or returns FPU control data. See the register summary in Table 94 on page 233 for its attributes. The bit assignments are:

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[2n+1:2n] for n values 10 and 11</td>
<td>CPn</td>
<td>Access privileges for coprocessor n. The possible values of each field are: 0b00: Access denied. Any attempted access generates a NOCP UsageFault. 0b01: Privileged access only. An unprivileged access generates a NOCP fault. 0b10: Reserved. The result of any access is Unpredictable. 0b11: Full access.</td>
</tr>
<tr>
<td>[19:0]</td>
<td>-</td>
<td>Reserved. Read as Zero, Write Ignore.</td>
</tr>
</tbody>
</table>

Table 95. CPACR bit assignments

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31]</td>
<td>ASPEN</td>
<td>Enables CONTROL.FPCA setting on execution of a floating-point instruction. This results in automatic hardware state preservation and restoration, for floating-point context, on exception entry and exit. 0: Disable CONTROL.FPCA setting on execution of a floating-point instruction. 1: Enable CONTROL.FPCA setting on execution of a floating-point instruction.</td>
</tr>
</tbody>
</table>
Table 96. FPCCR bit assignments (continued)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[8]</td>
<td>MONRDY</td>
<td>0: DebugMonitor is disabled or priority did not permit setting MON_PEND when the floating-point stack frame was allocated.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: DebugMonitor is enabled and priority permits setting MON_PEND when the floating-point stack frame was allocated.</td>
</tr>
<tr>
<td>[6]</td>
<td>BFRDY</td>
<td>0: BusFault is disabled or priority did not permit setting the BusFault handler to the pending state when the floating-point stack frame was allocated.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: BusFault is enabled and priority permitted setting the BusFault handler to the pending state when the floating-point stack frame was allocated.</td>
</tr>
<tr>
<td>[5]</td>
<td>MMRDY</td>
<td>0: MemManage is disabled or priority did not permit setting the MemManage handler to the pending state when the floating-point stack frame was allocated.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: MemManage is enabled and priority permitted setting the MemManage handler to the pending state when the floating-point stack frame was allocated.</td>
</tr>
<tr>
<td>[4]</td>
<td>HFRDY</td>
<td>0: Priority did not permit setting the HardFault handler to the pending state when the floating-point stack frame was allocated.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: Priority permitted setting the HardFault handler to the pending state when the floating-point stack frame was allocated.</td>
</tr>
<tr>
<td>[3]</td>
<td>THREAD</td>
<td>0: Mode was not Thread Mode when the floating-point stack frame was allocated.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: Mode was Thread Mode when the floating-point stack frame was allocated.</td>
</tr>
<tr>
<td>[1]</td>
<td>USER</td>
<td>0: Privilege level was not user when the floating-point stack frame was allocated.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: Privilege level was user when the floating-point stack frame was allocated.</td>
</tr>
<tr>
<td>[0]</td>
<td>LSPACT</td>
<td>0: Lazy state preservation is not active.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: Lazy state preservation is active. Floating-point stack frame has been allocated but saving state to it has been deferred.</td>
</tr>
</tbody>
</table>
4.7.3 Floating-point context address register

The FPCAR register holds the location of the unpopulated floating-point register space allocated on an exception stack frame. See the register summary in Table 94 on page 233 for its attributes. The bit assignments are:

**Figure 56. FPCAR bit assignments**

![FPCAR bit assignments](image)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:3]</td>
<td>ADDRESS</td>
<td>The location of the unpopulated floating-point register space allocated on an exception stack frame.</td>
</tr>
</tbody>
</table>

4.7.4 Floating-point status control register

The FPSCR register provides all necessary User level control of the floating-point system. The bit assignments are:

**Figure 57. FPSCR bit assignments**

![FPSCR bit assignments](image)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31]</td>
<td>N</td>
<td>Condition code flags. Floating-point comparison operations update these flags.</td>
</tr>
<tr>
<td></td>
<td>Z</td>
<td>Z: Zero condition code flag.</td>
</tr>
<tr>
<td>[29]</td>
<td>C</td>
<td>C: Carry condition code flag.</td>
</tr>
<tr>
<td>[27]</td>
<td>-</td>
<td>Reserved.</td>
</tr>
<tr>
<td>[26]</td>
<td>AHP</td>
<td>Alternative half-precision control bit:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0: IEEE half-precision format selected.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1: Alternative half-precision format selected.</td>
</tr>
</tbody>
</table>
4.7.5 Floating-point default status control register

The FPDSCR register holds the default values for the floating-point status control data. See the register summary in Table 94 on page 233 for its attributes. The bit assignments are:

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
</table>
| [25] | DN   | Default NaN mode control bit:  
|      |      | 0: NaN operands propagate through to the output of a floating-point operation.  
|      |      | 1: Any operation involving one or more NaNs returns the Default NaN. |
| [24] | FZ   | Flush-to-zero mode control bit:  
|      |      | 0: Flush-to-zero mode disabled. Behavior of the floating-point system is fully compliant with the IEEE 754 standard.  
|      |      | 1: Flush-to-zero mode enabled.  
| [23:22] | RMode | Rounding Mode control field. The encoding of this field is:  
|      |      | 0b00: Round to Nearest (RN) mode  
|      |      | 0b01: Round towards Plus Infinity (RP) mode  
|      |      | 0b10: Round towards Minus Infinity (RM) mode  
|      |      | 0b11: Round towards Zero (RZ) mode.  
|      |      | The specified rounding mode is used by almost all floating-point instructions.  
| [7] | IDC | Input Denormal cumulative exception bit, see bits [4:0]. |
| [4] | IXC | Cumulative exception bits for floating-point exceptions, see also bit [7]. Each of these bits is set to 1 to indicate that the corresponding exception has occurred since 0 was last written to it. |
|      | XC: Inexact cumulative exception bit.  
|      | UFC: Underflow cumulative exception bit.  
|      | OFC: Overflow cumulative exception bit.  
|      | DZC: Division by Zero cumulative exception bit.  
|      | IOC: Invalid Operation cumulative exception bit. |
| [2] | DZC |  
| [1] | OFC |  
| [0] | IOC |  

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:27]</td>
<td>-</td>
<td>Reserved</td>
</tr>
<tr>
<td>[26]</td>
<td>AHP</td>
<td>Default value for FPSCR.AHP</td>
</tr>
</tbody>
</table>
4.7.6 Enabling the FPU

The FPU is disabled from reset. The user must enable it before using any floating-point instructions. Example 4-1: Enabling the FPU shows an example code sequence for enabling the FPU in privileged mode. The processor must be in privileged mode to read from and write to the CPACR.

Example 4-1: Enabling the FPU

```
CPACR  EQU  0xE000ED88
LDR    R0, =CPACR  ; Read CPACR
LDR    r1, [R0]  ; Set bits 20-23 to enable CP10 and CP11 coprocessors
ORR    R1, R1, #(0xF << 20)
STR     R1, [R0]  ; Write back the modified value to the CPACR
DSB
ISB                          ; Reset pipeline now the FPU is enabled.
```

Table 99. FPSCR bit assignments (continued)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[25]</td>
<td>DN</td>
<td>Default value for FPSCR.DN</td>
</tr>
<tr>
<td>[24]</td>
<td>FZ</td>
<td>Default value for FPSCR.FZ</td>
</tr>
<tr>
<td>[23:22]</td>
<td>RMode</td>
<td>Default value for FPSCR.RMode</td>
</tr>
<tr>
<td>[21:0]</td>
<td>-</td>
<td>Reserved</td>
</tr>
</tbody>
</table>

[column1] [column2] [column3]
4.7.7 Enabling and clearing FPU exception interrupts

The FPU exception flags are generating an interrupt through the interrupt controller. The FPU interrupt is globally controlled through the interrupt controller.

There is no individual mask and the enable/disable of the FPU interrupts is done at interrupt controller level. As it occurs very frequently, the IXC exception flag is not connected to the interrupt controller, and cannot generate an interrupt. If needed, it must be managed by polling.

Clearing the FPU exception flags depends on the FPU context save/restore configuration:

- **No floating-point register saving:** when Floating-point context control register (FPCCR) Bit 30 LSPEN=0 and Bit 31 ASPEN=0. You must clear interrupt source in Floating-point Status and Control Register (FPSCR).
  Example:
  ```c
  register uint32_t fpscr_val = 0;
  fpscr_val = __get_FPSCR();
  { check exception flags }
  fpscr_val &= (uint32_t)~0x8F; // Clear all exception flags
  __set_FPSCR(fpscr_val);
  ```

- **Lazy save/restore:** when Floating-point context control register (FPCCR) Bit 30 LSPEN=1 and Bit 31 ASPEN=X. In the case of lazy floating-point context save/restore, a dummy read access should be made to Floating-point Status and Control Register (FPSCR) to force state preservation and FPSCR clear. Then handle FPSCR in the stack.
  Example:
  ```c
  register uint32_t fpscr_val = 0;
  register uint32_t reg_val = 0;
  reg_val =  __get_FPSCR();        //dummy access
  fpscr_val=*(__IO uint32_t*)(FPU->FPCAR +0x40);
  { check exception flags }
  fpscr_val &= (uint32_t)~0x8F ;   // Clear all exception flags
  *(__IO uint32_t*)(FPU->FPCAR +0x40)=fpscr_val;
  __DMB() ;
  ```

- **Automatic floating-point registers save/restore:** when Floating-point context control register (FPCCR) Bit 30 LSPEN=0 and Bit 31 ASPEN=1. In the case of automatic floating-point context save/restore, a read access should be made to Floating-point Status and Control Register (FPSCR) to force clear. Then handle FPSCR in the stack.
  Example:
  ```c
  // FPU Exception handler
  void FPU_ExceptionHandler(uint32_t lr, uint32_t sp)
  {
    register uint32_t fpscr_val;
    if(lr == 0xFFFFFFE9)
    {
      sp = sp + 0x60;
    }
    else if(lr == 0xFFFFFFED)
  ```
{  
    sp = __get_PSP() + 0x60;  
}
fpscr_val = *(uint32_t*)sp;
{  
    check exception flags
    fpscr_val &= (uint32_t)-0x8F;  // Clear all exception flags
    *(uint32_t*)sp = fpscr_val;
    __DMB();
}

// FPU IRQ Handler
void __asm FPU_IRQHandler(void)
{
    IMPORT  FPU_ExceptionHandler
    MOV R0, LR  // move LR to R0
    MOV R1, SP  // Save SP to R1 to avoid any modification to
    // the stack pointer from FPU_ExceptionHandler
    VMRS R2, FPSCR  // dummy read access, to force clear
    B  FPU_ExceptionHandler
    BX LR
}

4.8 Cache maintenance operations

The cache maintenance operations are only accessible by privileged loads and stores.
Unprivileged accesses to these registers always generate a BusFault.

### Table 100. Cache maintenance space register summary

<table>
<thead>
<tr>
<th>Address</th>
<th>Name</th>
<th>Type</th>
<th>Required privilege</th>
<th>Reset value</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0xE000EF50</td>
<td>ICIALU</td>
<td>WO</td>
<td>Privileged</td>
<td>Unknown</td>
<td>Instruction cache invalidate all to the Point of Unification (PoU)(^1)</td>
</tr>
<tr>
<td>0xE000EF54</td>
<td>-</td>
<td>-</td>
<td>-</td>
<td>-</td>
<td>Reserved</td>
</tr>
<tr>
<td>0xE000EF58</td>
<td>ICIMVAU</td>
<td>WO</td>
<td>Privileged</td>
<td>Unknown</td>
<td>Instruction cache invalidate by address to the PoU(^1)</td>
</tr>
<tr>
<td>0xE000EF5C</td>
<td>DCIMVAC</td>
<td>WO</td>
<td>Privileged</td>
<td>Unknown</td>
<td>Data cache invalidate by address to the Point of Coherency (PoC)(^2)</td>
</tr>
<tr>
<td>0xE000EF60</td>
<td>DCISW</td>
<td>WO</td>
<td>Privileged</td>
<td>Unknown</td>
<td>Data cache invalidate by set/way</td>
</tr>
<tr>
<td>0xE000EF64</td>
<td>DCCMVAU</td>
<td>WO</td>
<td>Privileged</td>
<td>Unknown</td>
<td>Data cache by address to the PoU(^1)</td>
</tr>
<tr>
<td>0xE000EF68</td>
<td>DCCMVC</td>
<td>WO</td>
<td>Privileged</td>
<td>Unknown</td>
<td>Data cache clean by address to the PoC(^2)</td>
</tr>
<tr>
<td>0xE000EF6C</td>
<td>DCCSW</td>
<td>WO</td>
<td>Privileged</td>
<td>Unknown</td>
<td>Data cache clean by set/way</td>
</tr>
<tr>
<td>0xE000EF70</td>
<td>DCCMVAC</td>
<td>WO</td>
<td>Privileged</td>
<td>Unknown</td>
<td>Data cache clean and invalidate by address to the PoC(^2)</td>
</tr>
<tr>
<td>0xE000EF74</td>
<td>DCCSW</td>
<td>WO</td>
<td>Privileged</td>
<td>Unknown</td>
<td>Data cache clean and invalidate by set/way</td>
</tr>
<tr>
<td>0xE000EF78</td>
<td>BPIALL</td>
<td>RAZ/WI</td>
<td>Privileged</td>
<td>-</td>
<td>The BPIALL register is not implemented</td>
</tr>
</tbody>
</table>

1. Cache maintenance operations by PoU can be used to synchronize data between the Cortex®-M7 data and instruction Caches, for example when the software uses self-modifying code.
2. Cache maintenance operations by PoC can be used to synchronize data between the Cortex®-M7 data cache and an external agent such as a system DMA.

4.8.1 Full instruction cache operation

The ICIALLU is WO and write data is ignored and reads return 0. Writes to this register perform the requested cache maintenance operation. The BPIALL register is not implemented in the Cortex-M7 processor as branch predictor maintenance is not required. The register is RAZ/WI.

4.8.2 Instruction and data cache operations by address

The cache maintenance operations registers are ICIMVAU, DCIMVAC, DCCMVAU, DCCMVAC, and DCCIMVAC. These registers are WO, reads return 0. See the register summary in Table 100 for their attributes. The bit assignments are:

Table 101. Cache operation registers bit assignments

<table>
<thead>
<tr>
<th>Bit</th>
<th>Name</th>
<th>Type</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:0]</td>
<td>MVA</td>
<td>WO</td>
<td>MVA of requested operation</td>
</tr>
</tbody>
</table>

4.8.3 Data cache operations by set-way

The DCISW, DCCSW and DCCISW registers are WO. Reads return 0. See the register summary in Table 100 on page 240 for their attributes. The bit assignments are:

Figure 59. Cache operation bit assignments

Table 102. Cache operations by set-way bit assignments

<table>
<thead>
<tr>
<th>Bit</th>
<th>Name</th>
<th>Type</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td></td>
<td></td>
<td>For the data cache, values 0, 1, 2 and 3 are supported.</td>
</tr>
<tr>
<td>[29:14]</td>
<td>-</td>
<td>-</td>
<td>Reserved</td>
</tr>
<tr>
<td>[13:5]</td>
<td>Set</td>
<td>WO</td>
<td>Set/index that operation applies to. The number of indices in a cache depends on the configured cache size. When this is less than the maximum, use the LSB of this field. The number of sets in the cache can be determined by reading the Cache size ID register on page 219.</td>
</tr>
<tr>
<td>[4:1]</td>
<td>-</td>
<td>-</td>
<td>Reserved</td>
</tr>
<tr>
<td>[0]</td>
<td>-</td>
<td>-</td>
<td>Always reads as zero.</td>
</tr>
</tbody>
</table>
4.8.4  Cortex®-M7 cache maintenance operations using CMSIS

CMSIS functions enable the software portability between different Cortex®-M profile processors. To access cache maintenance operations when using CMSIS, use the following functions:

<table>
<thead>
<tr>
<th>CMSIS function</th>
<th>Descriptions</th>
</tr>
</thead>
<tbody>
<tr>
<td>void SCB_EnableICache(void)</td>
<td>Invalidate and then enable instruction cache</td>
</tr>
<tr>
<td>void SCB_DisableICache(void)</td>
<td>Disable instruction cache and invalidate its contents</td>
</tr>
<tr>
<td>void SCB_InvalidateICache(void)</td>
<td>Invalidate instruction cache</td>
</tr>
<tr>
<td>void SCB_EnableDCache(void)</td>
<td>Invalidate and then enable data cache</td>
</tr>
<tr>
<td>void SCB_DisableDCache(void)</td>
<td>Disable data cache and then clean and invalidate its contents</td>
</tr>
<tr>
<td>void SCB_InvalidateDCache(void)</td>
<td>Invalidate data cache</td>
</tr>
<tr>
<td>void SCB_CleanDCache(void)</td>
<td>Clean data cache</td>
</tr>
<tr>
<td>void SCB_CleanInvalidateDCache(void)</td>
<td>Clean and invalidate data cache</td>
</tr>
</tbody>
</table>

4.8.5  Initializing and enabling the L1-cache

The user can use cache maintenance operations for:
- Cache startup type operations.
- Manipulating the caches so that shared data is visible to other bus masters.
- Enabling data changed by an external DMA agent to be made visible to the Cortex®-M7 processor.

After enabling or disabling the instruction cache, the user must issue an ISB instruction to flush the pipeline. This ensures that all subsequent instruction fetches see the effect of enabling or disabling the instruction cache.

After reset, the user must invalidate each cache before enabling it.

When disabling the data cache, the user must clean the entire cache to ensure that any dirty data is flushed to external memory.

Before enabling the data cache, the user must invalidate the entire data cache if external memory might have changed since the cache was disabled.

Before enabling the instruction cache, the user must invalidate the entire instruction cache if external memory might have changed since the cache was disabled.

L1 data and instruction cache must be invalidated before they are enabled in the software, otherwise unpredictable behavior can occur.

Invalidating the entire data cache

The software can use the following code example to invalidate the entire data cache, if it has been included in the processor. The operation is carried out by iterating over each line of the cache and using the DCISW register in the Private Peripheral Bus (PPB) memory region to
invalidate the line. The number of cache ways and sets is determined by reading the CCSIDR register.

CCSIDR  EQU 0xE000ED80
CSSELR  EQU 0xE000ED84
DCISW   EQU 0xE000EF60
MOV r0, #0x0
LDR r11, =CSSELR
STR r0, [r11]                   ; Select Data Cache size
DSB
LDR r11, =CCSIDR
LDR r2, [r11]                   ; Cache size identification
AND r1, r2, #0x7               ; Number of words in a cache line
ADD r7, r1, #0x4
MOV r1, #0x3ff
ANDS r4, r1, r2, LSR #3
MOV r1, #0x7fff
ANDS r2, r1, r2, LSR #13
CLZ r6, r4
LDR r11, =DCISW
inv_loop1
    MOV r1, r4
inv_loop2
    LSL r3, r1, r6
    LSL r8, r2, r7
    ORRR r3, r3, r8
    STR r3, [r11]                   ; Invalidate D-cache line
    SUBS r1, r1, #0x1
    BGE inv_loop2
    SUBS r2, r2, #0x1
    BGE inv_loop1
    DSB
    ISB

Invalidate instruction cache

The user can use the following code example to invalidate the entire instruction cache, if it has been included in the processor. The operation is carried out by writing to the ICIALLU register in the PPB memory region.

ICIALLU EQU 0xE000EF50
MOV r0, #0x0
LDR r11, =ICIALLU
STR r0, [r11]
DSB
ISB
Enabling data and instruction caches

The user can use the following code example to enable the data and instruction cache after they have been initialized. The operation is carried out by modifying the CCR.IC and CCR.DC fields in the PPB memory region.

```
CCR     EQU 0xE000ED14
LDR r11, =CCR
LDR r0, [r11]
ORR r0, r0, #0x1:SHL:16    ; Set CCR.DC field
ORR r0, r0, #0x1:SHL:17    ; Set CCR.IC field
STR r0, [r11]
DSB
ISB
```

4.8.6 Faults handling considerations

Cache maintenance operations can result in a BusFault. Such fault events are asynchronous.

This type of BusFault:
- Does not cause escalation to HardFault where a BusFault handler is enabled.
- Never causes lockup.

Because the fault event is asynchronous, the software code for cache maintenance operations should use memory barrier instructions, such as DSB, on completion so that the fault event can be observed immediately.

4.8.7 Cache maintenance design hints and tips

The user must always place a DSB and ISB instruction sequence after a cache maintenance operation to ensure that the effect is observed by any following instructions in the software.

When using a cache maintenance operation by address or set/way a DSB instruction must be executed after any previous load or store, and before the maintenance operation, to guarantee that the effect of the load or store is observed by the operation. For example, if a store writes to the address accessed by a DCCMVAC the DSB instruction guarantees that the dirty data is correctly cleaned from the data cache.

When one or more maintenance operations have been executed, use of a DSB instruction guarantees that they have completed and that any following load or store operations executes in order after the maintenance operations.

Cache maintenance operations always complete in-order with respect to each other. This means only one DSB instruction is required to guarantee the completion of a set of maintenance operations.

The following code sequence shows how to use cache maintenance operations to synchronize the data and instruction caches for self-modifying code. The sequence is entered with <Rx> containing the new 32-bit instruction. Use STRH in the first line instead of STR for a 16-bit instruction:

```
STR <Rx>, <inst_address1>
DSB ; Ensure the data has been written to the cache.
```
STR <inst_address1>, DCCMVAU ; Clean data cache by MVA to point of
; unification (PoU).
STR <inst_address1>, ICIMVAU ; Invalidate instruction cache by MVA to
; PoU.
DSB ; Ensure completion of the invalidations.
ISB ; Synchronize fetched instruction stream.

4.9 Access control

Control of the L1-cache ECC and attribute override, the priority of AHB slave traffic, and
whether an access is mapped to TCM interfaces or AXI master interface, is defined by the
access control registers. The access control registers are:

Table 104. Access control register summary

<table>
<thead>
<tr>
<th>Address</th>
<th>Name</th>
<th>Type</th>
<th>Required privilege</th>
<th>Reset value</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0xE000EF90</td>
<td>ITCMCR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Instruction and data tightly-coupled memory control</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>registers on page 246</td>
</tr>
<tr>
<td>0xE000EF94</td>
<td>DTCMCR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>AHB control register on page 248</td>
</tr>
<tr>
<td>0xE000EF98</td>
<td>AHBPCR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>AHB bus fault status register on page 249</td>
</tr>
<tr>
<td>0xE000EF9C</td>
<td>CACR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Auxiliary cache control register on page 249</td>
</tr>
<tr>
<td>0xE000EFA0</td>
<td>ABFSR</td>
<td>RW</td>
<td>Privileged</td>
<td>0x00000000</td>
<td>Auxiliary bus fault status register on page 251</td>
</tr>
</tbody>
</table>

1. The reset value is implementation and configuration dependent and the silicon vendor changes this. If cache ECC is
configured the reset value is 0x00000000, if cache ECC is not configured the reset value is 0x00000002.
### 4.9.1 Instruction and data tightly-coupled memory control registers

The ITCMCR and DTCMCR control whether access is mapped to the TCM interfaces or the AXI master interface. The bit assignments are:

**Figure 60. ITCMCR and DTCMCR bit assignments**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Type</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:7]</td>
<td>-</td>
<td>-</td>
<td>Reserved, RAZ/WI.</td>
</tr>
</tbody>
</table>
| [6:3] | SZ | RO | TCM size. Indicates the size of the relevant TCM:  
  0b0000: No TCM implemented.  
  0b0011: 4KB.  
  0b0100: 8KB.  
  0b0101: 16KB.  
  0b0110: 32KB.  
  0b0111: 64KB.  
  0b1000: 128KB.  
  0b1001: 256KB.  
  0b1010: 512KB.  
  0b1011: 1MB.  
  0b1100: 2MB.  
  0b1101: 4MB.  
  0b1110: 8MB.  
  0b1111: 16MB.  
  All other encodings are reserved. |
| [2] | RETEN(1) | RW | Retry phase enable. When enabled the processor guarantees to honor the retry output on the corresponding TCM interface:  
  0: Retry phase disabled.  
  1: Retry phase enabled. |
| [1] | RMW(2) | RW | Read-Modify-Write (RMW) enable. Indicates that all sub-chunk writes to a given TCM use a RMW sequence:  
  0: RMW disabled.  
  1: RMW enabled. |
| [0] | EN | RW | TCM enable. When a TCM is disabled all accesses are made to the AXI master:  
  0: TCM disabled.  
  1: TCM enabled. |

1. The RETEN field in the ITCMCR and DTCMCR is used to support error detection and correction in the TCM.  
2. The RMW field in the ITCMCR and DTCMCR is used to support error detection and correction in the TCM.
Enabling the TCM

The TCM interfaces can be enabled at reset in the system by an external signal on the processor. If they are disabled at reset then the following code example can be used to enable both the instruction and data TCM interfaces in software:

```
ITCMCR  EQU 0xE000EF90
DTCMCR  EQU 0xE000EF94

LDR r11, =ITCMCR
LDR r0, [r11]
ORR r0, r0, #0x1  ; Set ITCMCR.EN field
STR r0, [r11]

LDR r11, =DTCMCR
LDR r0, [r11]
ORR r0, r0, #0x1  ; Set DTCMCR.EN field
STR r0, [r11]

DSB
ISB
```

Enabling the TCM retry and read-modify-write

If the TCM connected to the processor supports error detection and correction, the TCM interface should be configured to support the retry and read-modify-write features. These can be enabled at reset in the system by external signals on the processor. If they are disabled at reset then the following code example can be used to enable them in software:

```
ITCMCR  EQU 0xE000EF90
DTCMCR  EQU 0xE000EF94

LDR r11, =ITCMCR
LDR r0, [r11]
ORR r0, r0, #0x1:SHL:1  ; Set ITCMCR.RMW field
ORR r0, r0, #0x1:SHL:2  ; Set ITCMCR.RETEN field
STR r0, [r11]

LDR r11, =DTCMCR
LDR r0, [r11]
ORR r0, r0, #0x1:SHL:1  ; Set DTCMCR.RMW field
ORR r0, r0, #0x1:SHL:2  ; Set DTCMCR.RETEN field
STR r0, [r11]

DSB
ISB
```
### 4.9.2 AHBP control register

The AHBPCR controls accesses to the device on the AHBP or AXI master interface. The bit assignments are:

![Figure 61. AHBPCR bit assignments](MSv39649V1)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Type</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:4]</td>
<td>-</td>
<td>-</td>
<td>Reserved, RAZ/WI.</td>
</tr>
</tbody>
</table>
| [3:1]  | SZ | RO | AHBP size:  
0b001: 64 MBytes.  
0b010: 128 MBytes.  
0b011: 256 MBytes.  
0b100: 512 MBytes. |
| [0]    | EN  | RW | AHBP enable:  
0: AHBP disabled. When disabled all accesses are made to the AXI master.  
1: AHBP enabled. |

### Enabling the AHBP interface

The AHBP interface can be enabled at reset in the system by an external signal on the processor. If it is disabled at reset then the following code example can be used to enable the AHPB interface from software:

AHBPCR EQU 0xE000EF98

LDR r11, =AHBPCR  
LDR r0, [r11]  
orr r0, r0, #0x1 ; Set AHBPCR.EN field  
str r0, [r11]  

DSB  
ISB
4.9.3 Auxiliary cache control register

The CACR controls the L1-cache ECC and attribute override. The bit assignments are:

**Figure 62. CACR bit assignments**

<table>
<thead>
<tr>
<th>Bit</th>
<th>Name</th>
<th>Type</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:3]</td>
<td>Reserved</td>
<td>-</td>
<td>Reserved, RAZ/WI.</td>
</tr>
</tbody>
</table>
| [2] | FORCEWT | RW | Enables Force Write-through in the data cache:  
0: Disables Force Write-Through.  
1: Enables Force Write-Through. Cacheable Write-Back memory regions are treated as Write-Through.  
This bit is RAZ if the data cache is excluded. |
| [1] | ECCEN | RW | Enables ECC in the instruction and data cache:  
0: Enables ECC in the instruction and data cache.  
1: Disables ECC in the instruction and data cache.  
This bit is WI if both data cache and instruction cache are excluded or if ECC is not configured. If ECC is included in the processor the reset value of ECCEN is 0. If ECC is excluded the reset value of ECCEN is 1 |
| [0] | SIWT | RW | Enables cache coherency usage:  
0: Normal Cacheable Shared locations are treated as being Non-cacheable. Programmed inner cacheability attributes are ignored. This is the default mode of operation for Shared memory. Caches are transparent to software for these locations and therefore no software maintenance is required to maintain coherency.  
1: For the data cache, Normal Cacheable shared locations are treated as Write-through. For the instruction cache, shared locations are treated as being Non-cacheable. Programmed inner cacheability attributes are ignored. All writes are globally visible. Other memory agent updates are not visible to Cortex®-M7 software without suitable cache maintenance.  
Useful for heterogeneous MP-systems where, for example, Cortex®-M7 processor is integrated on the Accelerator Coherency Port (ACP) interface on an MP-capable processor. This bit is RAZ if the data cache is excluded. |
Disabling cache error checking and correction

If the cache error checking and correction is included in the processor it is enabled by default from reset. The following code example can be used to disable the feature. The operation is carried out by modifying the CACR.ECCEN field in the PPB memory region.

```assembly
CACR    EQU 0xE000EF9C
LDR r11, =CACR
LDR r0, [r11]
BFC r0, #0x1, #0x1     ; Clear CACR.ECCEN
STR r0, [r11]

DSB
ISB
```

Care must be taken when the software changes CACR.ECCEN. If CACR.ECCEN changes when the caches contain data, ECC information in the caches might not be correct for the new setting, resulting in unexpected errors and data loss. Therefore the software must only change CACR.ECCEN when both caches are turned off and both caches must be invalidated after the change.

4.9.4 AHB slave control register

The AHBSCR is used by software to control the priority of AHB slave traffic. The bit assignments are:

**Figure 63. AHBSCR bit assignments**

```
<table>
<thead>
<tr>
<th>31</th>
<th>30</th>
<th>21</th>
<th>10</th>
<th>9</th>
<th>8</th>
<th>7</th>
<th>6</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>Reserved</td>
<td>INITCOUNT</td>
<td>TPRI</td>
<td>CTL</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>
```

**Table 108. AHBSCR bit assignments**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Type</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:16]</td>
<td>-</td>
<td>-</td>
<td>Reserved.</td>
</tr>
<tr>
<td>[15:11]</td>
<td>INITCOUNT</td>
<td>RW</td>
<td>Fairness counter initialization value. Use to demote access priority of the requestor selected by the AHBSCRT.CTL field. The reset value is 0b01. For round-robin mode set INITCOUNT to 0b01 and AHBSCR.CTL to 0b00 or 0b01. INITCOUNT must not be set to 0b00 because the demoted requestor always takes priority when contention occurs, which can lead to livelock. INITCOUNT is not used when AHBSCR.CTL is 0b11.</td>
</tr>
</tbody>
</table>
4.9.5 Auxiliary bus fault status register

The ABFSR stores information on the source of asynchronous bus faults. The ASBFSR bit assignments are:

**Figure 64. ABFSR bit assignments**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Type</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[31:10]</td>
<td>-</td>
<td></td>
<td>Reserved</td>
</tr>
<tr>
<td>[9:8]</td>
<td>AXIMTYPE</td>
<td></td>
<td>Indicates the type of fault on the AXIM interface: b00: OKAY b01: EXOKAY b10: SLVERR b11: DECERR Only valid when AXIM is 1.</td>
</tr>
<tr>
<td>[7:5]</td>
<td>-</td>
<td></td>
<td>Reserved</td>
</tr>
</tbody>
</table>
In the bus-fault handler, the software reads the BFSR, and if an asynchronous fault occurs, the ABFSR is read to determine which interfaces are affected. The ABFSR[4:0] fields remains valid until cleared by writing to the ABFSR with any value.

For more information about the BFSR, see *BusFault status register on page 207.*

### Table 109. ABFSR bit assignments (continued)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>[0]</td>
<td>ITCM</td>
<td>Asynchronous fault on ITCM interface</td>
</tr>
</tbody>
</table>

[0] ITCM Asynchronous fault on ITCM interface.
## Revision history

Table 110. Document revision history

<table>
<thead>
<tr>
<th>Date</th>
<th>Revision</th>
<th>Changes</th>
</tr>
</thead>
<tbody>
<tr>
<td>18-Dec-2015</td>
<td>1</td>
<td>Initial release.</td>
</tr>
<tr>
<td>22-Apr-2016</td>
<td>2</td>
<td>Updated Table 11: STM32F746xx/STM32F756xx Cortex®-M7 configuration. Added Table 12: STM32F76xxx/STM32F77xxx Cortex®-M7 configuration. In Section 2.3.2: Memory system ordering of memory accesses updated link to section 2.3.4: software ordering of memory accesses.</td>
</tr>
<tr>
<td>02-Feb-2017</td>
<td>3</td>
<td>Added Table 13: STM32F72xxx/STM32F73xxx Cortex®-M7 configuration.</td>
</tr>
<tr>
<td>15-Jun-2019</td>
<td>5</td>
<td>Updated DREGION function description in Table 85: TYPE bit assignments. Added Section 4.7.7: Enabling and clearing FPU exception interrupts.</td>
</tr>
</tbody>
</table>
IMPORTANT NOTICE – PLEASE READ CAREFULLY

STMicroelectronics NV and its subsidiaries (“ST”) reserve the right to make changes, corrections, enhancements, modifications, and improvements to ST products and/or to this document at any time without notice. Purchasers should obtain the latest relevant information on ST products before placing orders. ST products are sold pursuant to ST’s terms and conditions of sale in place at the time of order acknowledgement.

Purchasers are solely responsible for the choice, selection, and use of ST products and ST assumes no liability for application assistance or the design of Purchasers’ products.

No license, express or implied, to any intellectual property right is granted by ST herein.

Resale of ST products with provisions different from the information set forth herein shall void any warranty granted by ST for such product.

ST and the ST logo are trademarks of ST. For additional information about ST trademarks, please refer to www.st.com/trademarks. All other product or service names are the property of their respective owners.

Information in this document supersedes and replaces information previously supplied in any prior versions of this document.

© 2019 STMicroelectronics – All rights reserved