.. SPDX-License-Identifier: GPL-2.0 ========================= Introduction to LoongArch ========================= LoongArch is a new RISC ISA, which is a bit like MIPS or RISC-V. There are currently 3 variants: a reduced 32-bit version (LA32R), a standard 32-bit version (LA32S) and a 64-bit version (LA64). There are 4 privilege levels (PLVs) defined in LoongArch: PLV0~PLV3, from high to low. Kernel runs at PLV0 while applications run at PLV3. This document introduces the registers, basic instruction set, virtual memory and some other topics of LoongArch. Registers ========= LoongArch registers include general purpose registers (GPRs), floating point registers (FPRs), vector registers (VRs) and control status registers (CSRs) used in privileged mode (PLV0). GPRs ---- LoongArch has 32 GPRs ( ``$r0`` ~ ``$r31`` ); each one is 32-bit wide in LA32 and 64-bit wide in LA64. ``$r0`` is hard-wired to zero, and the other registers are not architecturally special. (Except ``$r1``, which is hard-wired as the link register of the BL instruction.) The kernel uses a variant of the LoongArch register convention, as described in the LoongArch ELF psABI spec, in :ref:`References <loongarch-references>`: ================= =============== =================== ============ Name Alias Usage Preserved across calls ================= =============== =================== ============ ``$r0`` ``$zero`` Constant zero Unused ``$r1`` ``$ra`` Return address No ``$r2`` ``$tp`` TLS/Thread pointer Unused ``$r3`` ``$sp`` Stack pointer Yes ``$r4``-``$r11`` ``$a0``-``$a7`` Argument registers No ``$r4``-``$r5`` ``$v0``-``$v1`` Return value No ``$r12``-``$r20`` ``$t0``-``$t8`` Temp registers No ``$r21`` ``$u0`` Percpu base address Unused ``$r22`` ``$fp`` Frame pointer Yes ``$r23``-``$r31`` ``$s0``-``$s8`` Static registers Yes ================= =============== =================== ============ .. Note:: The register ``$r21`` is reserved in the ELF psABI, but used by the Linux kernel for storing the percpu base address. It normally has no ABI name, but is called ``$u0`` in the kernel. You may also see ``$v0`` or ``$v1`` in some old code,however they are deprecated aliases of ``$a0`` and ``$a1`` respectively. FPRs ---- LoongArch has 32 FPRs ( ``$f0`` ~ ``$f31`` ) when FPU is present. Each one is 64-bit wide on the LA64 cores. The floating-point register convention is the same as described in the LoongArch ELF psABI spec: ================= ================== =================== ============ Name Alias Usage Preserved across calls ================= ================== =================== ============ ``$f0``-``$f7`` ``$fa0``-``$fa7`` Argument registers No ``$f0``-``$f1`` ``$fv0``-``$fv1`` Return value No ``$f8``-``$f23`` ``$ft0``-``$ft15`` Temp registers No ``$f24``-``$f31`` ``$fs0``-``$fs7`` Static registers Yes ================= ================== =================== ============ .. Note:: You may see ``$fv0`` or ``$fv1`` in some old code, however they are deprecated aliases of ``$fa0`` and ``$fa1`` respectively. VRs ---- There are currently 2 vector extensions to LoongArch: - LSX (Loongson SIMD eXtension) with 128-bit vectors, - LASX (Loongson Advanced SIMD eXtension) with 256-bit vectors. LSX brings ``$v0`` ~ ``$v31`` while LASX brings ``$x0`` ~ ``$x31`` as the vector registers. The VRs overlap with FPRs: for example, on a core implementing LSX and LASX, the lower 128 bits of ``$x0`` is shared with ``$v0``, and the lower 64 bits of ``$v0`` is shared with ``$f0``; same with all other VRs. CSRs ---- CSRs can only be accessed from privileged mode (PLV0): ================= ===================================== ============== Address Full Name Abbrev Name ================= ===================================== ============== 0x0 Current Mode Information CRMD 0x1 Pre-exception Mode Information PRMD 0x2 Extension Unit Enable EUEN 0x3 Miscellaneous Control MISC 0x4 Exception Configuration ECFG 0x5 Exception Status ESTAT 0x6 Exception Return Address ERA 0x7 Bad (Faulting) Virtual Address BADV 0x8 Bad (Faulting) Instruction Word BADI 0xC Exception Entrypoint Address EENTRY 0x10 TLB Index TLBIDX 0x11 TLB Entry High-order Bits TLBEHI 0x12 TLB Entry Low-order Bits 0 TLBELO0 0x13 TLB Entry Low-order Bits 1 TLBELO1 0x18 Address Space Identifier ASID 0x19 Page Global Directory Address for PGDL Lower-half Address Space 0x1A Page Global Directory Address for PGDH Higher-half Address Space 0x1B Page Global Directory Address PGD 0x1C Page Walk Control for Lower- PWCL half Address Space 0x1D Page Walk Control for Higher- PWCH half Address Space 0x1E STLB Page Size STLBPS 0x1F Reduced Virtual Address Configuration RVACFG 0x20 CPU Identifier CPUID 0x21 Privileged Resource Configuration 1 PRCFG1 0x22 Privileged Resource Configuration 2 PRCFG2 0x23 Privileged Resource Configuration 3 PRCFG3 0x30+n (0≤n≤15) Saved Data register SAVEn 0x40 Timer Identifier TID 0x41 Timer Configuration TCFG 0x42 Timer Value TVAL 0x43 Compensation of Timer Count CNTC 0x44 Timer Interrupt Clearing TICLR 0x60 LLBit Control LLBCTL 0x80 Implementation-specific Control 1 IMPCTL1 0x81 Implementation-specific Control 2 IMPCTL2 0x88 TLB Refill Exception Entrypoint TLBRENTRY Address 0x89 TLB Refill Exception BAD (Faulting) TLBRBADV Virtual Address 0x8A TLB Refill Exception Return Address TLBRERA 0x8B TLB Refill Exception Saved Data TLBRSAVE Register 0x8C TLB Refill Exception Entry Low-order TLBRELO0 Bits 0 0x8D TLB Refill Exception Entry Low-order TLBRELO1 Bits 1 0x8E TLB Refill Exception Entry High-order TLBEHI Bits 0x8F TLB Refill Exception Pre-exception TLBRPRMD Mode Information 0x90 Machine Error Control MERRCTL 0x91 Machine Error Information 1 MERRINFO1 0x92 Machine Error Information 2 MERRINFO2 0x93 Machine Error Exception Entrypoint MERRENTRY Address 0x94 Machine Error Exception Return MERRERA Address 0x95 Machine Error Exception Saved Data MERRSAVE Register 0x98 Cache TAGs CTAG 0x180+n (0≤n≤3) Direct Mapping Configuration Window n DMWn 0x200+2n (0≤n≤31) Performance Monitor Configuration n PMCFGn 0x201+2n (0≤n≤31) Performance Monitor Overall Counter n PMCNTn 0x300 Memory Load/Store WatchPoint MWPC Overall Control 0x301 Memory Load/Store WatchPoint MWPS Overall Status 0x310+8n (0≤n≤7) Memory Load/Store WatchPoint n MWPnCFG1 Configuration 1 0x311+8n (0≤n≤7) Memory Load/Store WatchPoint n MWPnCFG2 Configuration 2 0x312+8n (0≤n≤7) Memory Load/Store WatchPoint n MWPnCFG3 Configuration 3 0x313+8n (0≤n≤7) Memory Load/Store WatchPoint n MWPnCFG4 Configuration 4 0x380 Instruction Fetch WatchPoint FWPC Overall Control 0x381 Instruction Fetch WatchPoint FWPS Overall Status 0x390+8n (0≤n≤7) Instruction Fetch WatchPoint n FWPnCFG1 Configuration 1 0x391+8n (0≤n≤7) Instruction Fetch WatchPoint n FWPnCFG2 Configuration 2 0x392+8n (0≤n≤7) Instruction Fetch WatchPoint n FWPnCFG3 Configuration 3 0x393+8n (0≤n≤7) Instruction Fetch WatchPoint n FWPnCFG4 Configuration 4 0x500 Debug Register DBG 0x501 Debug Exception Return Address DERA 0x502 Debug Exception Saved Data Register DSAVE ================= ===================================== ============== ERA, TLBRERA, MERRERA and DERA are sometimes also known as EPC, TLBREPC, MERREPC and DEPC respectively. Basic Instruction Set ===================== Instruction formats ------------------- LoongArch instructions are 32 bits wide, belonging to 9 basic instruction formats (and variants of them): =========== ========================== Format name Composition =========== ========================== 2R Opcode + Rj + Rd 3R Opcode + Rk + Rj + Rd 4R Opcode + Ra + Rk + Rj + Rd 2RI8 Opcode + I8 + Rj + Rd 2RI12 Opcode + I12 + Rj + Rd 2RI14 Opcode + I14 + Rj + Rd 2RI16 Opcode + I16 + Rj + Rd 1RI21 Opcode + I21L + Rj + I21H I26 Opcode + I26L + I26H =========== ========================== Rd is the destination register operand, while Rj, Rk and Ra ("a" stands for "additional") are the source register operands. I8/I12/I14/I16/I21/I26 are immediate operands of respective width. The longer I21 and I26 are stored in separate higher and lower parts in the instruction word, denoted by the "L" and "H" suffixes. List of Instructions -------------------- For brevity, only instruction names (mnemonics) are listed here; please see the :ref:`References <loongarch-references>` for details. 1. Arithmetic Instructions:: ADD.W SUB.W ADDI.W ADD.D SUB.D ADDI.D SLT SLTU SLTI SLTUI AND OR NOR XOR ANDN ORN ANDI ORI XORI MUL.W MULH.W MULH.WU DIV.W DIV.WU MOD.W MOD.WU MUL.D MULH.D MULH.DU DIV.D DIV.DU MOD.D MOD.DU PCADDI PCADDU12I PCADDU18I LU12I.W LU32I.D LU52I.D ADDU16I.D 2. Bit-shift Instructions:: SLL.W SRL.W SRA.W ROTR.W SLLI.W SRLI.W SRAI.W ROTRI.W SLL.D SRL.D SRA.D ROTR.D SLLI.D SRLI.D SRAI.D ROTRI.D 3. Bit-manipulation Instructions:: EXT.W.B EXT.W.H CLO.W CLO.D SLZ.W CLZ.D CTO.W CTO.D CTZ.W CTZ.D BYTEPICK.W BYTEPICK.D BSTRINS.W BSTRINS.D BSTRPICK.W BSTRPICK.D REVB.2H REVB.4H REVB.2W REVB.D REVH.2W REVH.D BITREV.4B BITREV.8B BITREV.W BITREV.D MASKEQZ MASKNEZ 4. Branch Instructions:: BEQ BNE BLT BGE BLTU BGEU BEQZ BNEZ B BL JIRL 5. Load/Store Instructions:: LD.B LD.BU LD.H LD.HU LD.W LD.WU LD.D ST.B ST.H ST.W ST.D LDX.B LDX.BU LDX.H LDX.HU LDX.W LDX.WU LDX.D STX.B STX.H STX.W STX.D LDPTR.W LDPTR.D STPTR.W STPTR.D PRELD PRELDX 6. Atomic Operation Instructions:: LL.W SC.W LL.D SC.D AMSWAP.W AMSWAP.D AMADD.W AMADD.D AMAND.W AMAND.D AMOR.W AMOR.D AMXOR.W AMXOR.D AMMAX.W AMMAX.D AMMIN.W AMMIN.D 7. Barrier Instructions:: IBAR DBAR 8. Special Instructions:: SYSCALL BREAK CPUCFG NOP IDLE ERTN(ERET) DBCL(DBGCALL) RDTIMEL.W RDTIMEH.W RDTIME.D ASRTLE.D ASRTGT.D 9. Privileged Instructions:: CSRRD CSRWR CSRXCHG IOCSRRD.B IOCSRRD.H IOCSRRD.W IOCSRRD.D IOCSRWR.B IOCSRWR.H IOCSRWR.W IOCSRWR.D CACOP TLBP(TLBSRCH) TLBRD TLBWR TLBFILL TLBCLR TLBFLUSH INVTLB LDDIR LDPTE Virtual Memory ============== LoongArch supports direct-mapped virtual memory and page-mapped virtual memory. Direct-mapped virtual memory is configured by CSR.DMWn (n=0~3), it has a simple relationship between virtual address (VA) and physical address (PA):: VA = PA + FixedOffset Page-mapped virtual memory has arbitrary relationship between VA and PA, which is recorded in TLB and page tables. LoongArch's TLB includes a fully-associative MTLB (Multiple Page Size TLB) and set-associative STLB (Single Page Size TLB). By default, the whole virtual address space of LA32 is configured like this: ============ =========================== ============================= Name Address Range Attributes ============ =========================== ============================= ``UVRANGE`` ``0x00000000 - 0x7FFFFFFF`` Page-mapped, Cached, PLV0~3 ``KPRANGE0`` ``0x80000000 - 0x9FFFFFFF`` Direct-mapped, Uncached, PLV0 ``KPRANGE1`` ``0xA0000000 - 0xBFFFFFFF`` Direct-mapped, Cached, PLV0 ``KVRANGE`` ``0xC0000000 - 0xFFFFFFFF`` Page-mapped, Cached, PLV0 ============ =========================== ============================= User mode (PLV3) can only access UVRANGE. For direct-mapped KPRANGE0 and KPRANGE1, PA is equal to VA with bit30~31 cleared. For example, the uncached direct-mapped VA of 0x00001000 is 0x80001000, and the cached direct-mapped VA of 0x00001000 is 0xA0001000. By default, the whole virtual address space of LA64 is configured like this: ============ ====================== ====================================== Name Address Range Attributes ============ ====================== ====================================== ``XUVRANGE`` ``0x0000000000000000 - Page-mapped, Cached, PLV0~3 0x3FFFFFFFFFFFFFFF`` ``XSPRANGE`` ``0x4000000000000000 - Direct-mapped, Cached / Uncached, PLV0 0x7FFFFFFFFFFFFFFF`` ``XKPRANGE`` ``0x8000000000000000 - Direct-mapped, Cached / Uncached, PLV0 0xBFFFFFFFFFFFFFFF`` ``XKVRANGE`` ``0xC000000000000000 - Page-mapped, Cached, PLV0 0xFFFFFFFFFFFFFFFF`` ============ ====================== ====================================== User mode (PLV3) can only access XUVRANGE. For direct-mapped XSPRANGE and XKPRANGE, PA is equal to VA with bits 60~63 cleared, and the cache attribute is configured by bits 60~61 in VA: 0 is for strongly-ordered uncached, 1 is for coherent cached, and 2 is for weakly-ordered uncached. Currently we only use XKPRANGE for direct mapping and XSPRANGE is reserved. To put this in action: the strongly-ordered uncached direct-mapped VA (in XKPRANGE) of 0x00000000_00001000 is 0x80000000_00001000, the coherent cached direct-mapped VA (in XKPRANGE) of 0x00000000_00001000 is 0x90000000_00001000, and the weakly-ordered uncached direct-mapped VA (in XKPRANGE) of 0x00000000 _00001000 is 0xA0000000_00001000. Relationship of Loongson and LoongArch ====================================== LoongArch is a RISC ISA which is different from any other existing ones, while Loongson is a family of processors. Loongson includes 3 series: Loongson-1 is the 32-bit processor series, Loongson-2 is the low-end 64-bit processor series, and Loongson-3 is the high-end 64-bit processor series. Old Loongson is based on MIPS, while New Loongson is based on LoongArch. Take Loongson-3 as an example: Loongson-3A1000/3B1500/3A2000/3A3000/3A4000 are MIPS-compatible, while Loongson- 3A5000 (and future revisions) are all based on LoongArch. .. _loongarch-references: References ========== Official web site of Loongson Technology Corp. Ltd.: http://www.loongson.cn/ Developer web site of Loongson and LoongArch (Software and Documentation): http://www.loongnix.cn/ https://github.com/loongson/ https://loongson.github.io/LoongArch-Documentation/ Documentation of LoongArch ISA: https://github.com/loongson/LoongArch-Documentation/releases/latest/download/LoongArch-Vol1-v1.02-CN.pdf (in Chinese) https://github.com/loongson/LoongArch-Documentation/releases/latest/download/LoongArch-Vol1-v1.02-EN.pdf (in English) Documentation of LoongArch ELF psABI: https://github.com/loongson/LoongArch-Documentation/releases/latest/download/LoongArch-ELF-ABI-v2.01-CN.pdf (in Chinese) https://github.com/loongson/LoongArch-Documentation/releases/latest/download/LoongArch-ELF-ABI-v2.01-EN.pdf (in English) Linux kernel repository of Loongson and LoongArch: https://git.kernel.org/pub/scm/linux/kernel/git/chenhuacai/linux-loongson.git