Processor device, instruction execution method thereof and computing equipment
A technology of instruction execution and processor, applied in the field of processor device and its instruction execution method, and computing equipment, can solve the problems of system performance loss, small number, predicate registers cannot be shared, etc., and achieve the effect of reducing usage and making full use of it
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
example 1
[0093] set_predicate_base 32
[0094] @P1 FMA R4,R5,R6,R7
[0095] The meaning of the above instruction means that the predicate base address setting special instruction is executed to set the predicate base address register PredicateBase to 32, that is, the general register W32 is used as the predicate register P0 of the current warp, and the default value is no longer 0.
[0096] Assume that general-purpose registers W32=0x 01 FF FFFF, W33=0x 80 FF FFFF, thread mask register LaneMask=0x FFFFFFFF, the above instruction indicates that the instruction corresponding to the current program counter PC is to execute the fusion multiplication and addition FMA instruction, and the fusion is executed based on the predicate register P1 Multiply and add FMA instruction.
[0097] Since the general-purpose register W33 represents the predicate register P1, this FMA instruction is invalid for threads 24-30 in the thread warp, and is valid for threads 0-23, that is, for threads 0-23, perfo...
example 2
[0099] set_predicate_base 8
[0100] @! P0 sub R7, R1, R2
[0101] The meaning of the above instruction means that the predicate base address setting special instruction is executed to set the predicate base address register PredicateBase to 8, that is, the general register W8 is the predicate register P0 of the current warp, use W8 as P0, and execute the SUB instruction after negating P0.
[0102] Assuming general register W8=0x 00 FF FFFF, then! P0=0xFF 00 00 00, then the above instruction SUB is invalid for threads 0-23 and valid for threads 24-31, that is, the subtraction operation of the operands in registers R1 and R2 is performed on threads 24-31, and the result is output to register R7.
example 3
[0104] set_predicate_base 16
[0105] @! P1 sub R7,R1,R2
[0106] The meaning of the above instruction means that the predicate base address register PredicateBase is set to 16 by executing the special predicate base address setting instruction, that is, the general register W16 is the predicate register P0 of the current warp, and W17 is the predicate register P1 of the current warp.
[0107] Assuming general register W17=0x 00 0F FF FF, then! P1=0x FFF0 00 00, then the above instruction SUB is invalid for threads 0-19 and valid for threads 20-31, that is, the subtraction of operands in registers R1 and R2 is performed on threads 20-31, and the result is output to register R7.
[0108] Figure 4 is a schematic flowchart of a method for executing an instruction by a processor device according to an embodiment of the present application. like Figure 4 As shown, the instruction execution method in the embodiment of the present application is applicable to the processor dev...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com