Architecture optimization method and device for loop body

A technology of architecture and optimization method, applied in the computer field, can solve problems such as speeding up short-loop body instructions, and achieve the effects of avoiding low kernel performance, reducing the frequency of flushing, and accurately predicting

Active Publication Date: 2022-03-01
GUANGDONG COMM & NETWORKS INST
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] On the other hand, front-end instruction fetching generally focuses on how to quickly access icache fetching of instructions and instruction prefetching, which is useful for reducing the number of accesses to the cache for short loops (the number of instructions in the loop is small) to save power consumption and speed up the execution of instructions in short loops. Taken out, this type of research has received little attention

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Architecture optimization method and device for loop body
  • Architecture optimization method and device for loop body
  • Architecture optimization method and device for loop body

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 2

[0140] In this embodiment, refer to the attached Figure 7 , providing an architecture optimization device for a loop body, which is used to implement any of the architecture optimization methods for a loop body in the above method embodiments, and the architecture optimization device for a loop body at least includes:

[0141] A short loop body judging module, used to judge whether the current instruction is a short loop body instruction;

[0142] Short loop body cache, used to cache the current instruction when the current instruction is a short loop body instruction;

[0143] The loop body conditional branch prediction module is used to predict whether the current instruction is a conditional branch instruction that controls the loop body, and checks the PC value of the current instruction to determine whether the current instruction PC value is matched with a valid record in the loop table. The information recorded in the table loop table includes the PC value cur_br_pc o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an architecture optimization method and device for a loop body, and the method comprises the steps: judging whether a current instruction is a short loop body instruction, and if yes, caching the current instruction in a short loop body cache; table look-up is conducted on the PC value of the current instruction, if effective records matched exist in the table, it is determined that the current instruction is a conditional branch instruction for controlling the loop body, and subsequent instructions are prefetched from the corresponding jumppc position in the table. According to the architecture optimization method and device for the loop body, provided by the invention, the branch instruction of the short loop body can be taken out more efficiently and quickly with lower power consumption, and the probability of cavitation bubbles in an assembly line is reduced, so that the problem of low kernel performance caused by slow instruction taking of the assembly line is avoided as much as possible; according to the method, more accurate loop body prediction is realized, conditional branch instructions for loop body control can be rapidly detected, the frequency of assembly line scouring is reduced, and the performance of a processor is improved.

Description

technical field [0001] The present disclosure relates to the field of computer technology, and in particular to a method and device for optimizing the architecture of a loop body. Background technique [0002] In a program, there are usually a large number of loop statements. In C code, it is generally embodied as a statement in the form of for, while, do...while. The content in the loop body needs to be fetched and executed repeatedly. Just because the contents of the registers are different, the execution results are different. [0003] First of all, in terms of branch prediction, for the branch prediction behavior of the loop body, the academic community has proposed a strategy of "jump backward, not jump forward". The backward jump here means that the jump target address is smaller than the current address. Jumping forward means that the jump target address is greater than the current address. But there is no mention of how the loop body is detected. The conditional...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/30G06F9/38
CPCG06F9/30065G06F9/3842
Inventor 廖述京陈钦树欧艳凤朱晓明黄旭松
Owner GUANGDONG COMM & NETWORKS INST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products