System structure optimization method and device for processor loop body

A technology of architecture and optimization method, applied in the computer field, can solve problems such as speeding up short-loop body instructions, and achieve the effects of avoiding low kernel performance, improving performance, and fast detection

Active Publication Date: 2022-05-03
GUANGDONG COMM & NETWORKS INST
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] On the other hand, front-end instruction fetching generally focuses on how to quickly access icache fetching of instructions and instruction prefetching, which is useful for reducing the number of accesses to the cache for short loops (the number of instructions in the loop is small) to save power consumption and speed up the execution of instructions in short loops. Taken out, this type of research has received little attention

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System structure optimization method and device for processor loop body
  • System structure optimization method and device for processor loop body
  • System structure optimization method and device for processor loop body

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 2

[0140] In this embodiment, reference is made to the Figure 7 , which provides an architecture optimization device for a processor loop body, which is used to implement any one of the above method embodiments for an architecture optimization method for a processor loop body, an architecture optimization method for a processor loop body The device includes at least:

[0141] The short-loop body judgment module is used to judge whether the current instruction is a short-loop body instruction;

[0142] The short loop body cache is used to cache the current instruction when the current instruction is a short loop body instruction;

[0143] The loop body conditional branch prediction module is used to predict whether the current instruction is a conditional branch instruction that controls the loop body, look up the PC value of the current instruction, and determine whether the current instruction PC value matches with a valid record in the loop table. The information recorded in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present disclosure provides a method and device for optimizing the architecture of a processor loop body. The method includes: judging whether the current instruction is a short loop body instruction, and if so, caching the current instruction in the short loop body cache; If there is a valid record matching in the table, it is determined that the current instruction is a conditional branch instruction of the control loop body, and the subsequent instruction is prefetched from the corresponding jump_pc in the table. The architecture optimization method and device for the processor loop body provided by the present disclosure can take out short loop body branch instructions with lower power consumption and more efficiently and quickly, reducing the probability of cavitation in the pipeline, thereby avoiding as much as possible The pipeline has the problem of low core performance caused by slow instruction fetching; more accurate loop body prediction is realized, and conditional branch instructions for loop body control can be quickly detected, reducing the frequency of pipeline flushing and improving processor performance.

Description

technical field [0001] The present disclosure relates to the field of computer technologies, and in particular, to an architecture optimization method and apparatus for a processor loop body. Background technique [0002] In a program, there are usually a large number of loop statements. In C code, it is generally embodied as a statement in the form of for, while, do...while. The content of the loop body needs to be repeatedly fetched and executed repeatedly. Just because the register contents are different, the result of execution is not the same. [0003] First of all, in terms of branch prediction, for the branch prediction behavior of the loop body, the academic community has proposed the strategy of "jump backward, not jump forward". The backward jump here means that the jump target address is smaller than the current address. Jump forward means that the jump target address is greater than the current address. But there is no mention of how the loop body is detected...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F9/30G06F9/38
CPCG06F9/30065G06F9/3842
Inventor 廖述京陈钦树欧艳凤朱晓明黄旭松
Owner GUANGDONG COMM & NETWORKS INST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products