Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

62 results about "Adder tree" patented technology

Signed multiply-accumulate algorithm method using adder tree structure

InactiveCN102681815AReduce areaRealize multiply-accumulate calculationDigital data processing detailsProgrammable logic deviceDistributed algorithm
The invention relates to a hardware multiply-accumulate algorithm and particularly relates to a signed multiply-accumulate algorithm method using an adder tree structure. According to the method, data are divided into two parts, i.e. coefficient items and data items, by adopting a complement form, the data items are decomposed into bit units according to a binary principle, the coefficient items are subjected to binary multiplication with bits of the data items according to an addition allocation principle, and the products are subjected to binary accumulation so as to obtain a final output result. The signed multiply-accumulate algorithm method using the adder tree structure overcomes the disadvantage of fixed coefficients of the original DA distributed algorithm and does not need a large number of ROMs (Read Only Memories) to serve as a coefficient table, the occupied area of a chip is smaller, the multiply-accumulate calculation of signed numbers is realized, and the extension is convenient, so that the signed multiply-accumulate algorithm method is especially suitable for the realization of a programmable logic device.
Owner:深圳市清友能源技术有限公司

Reconfigurable multi-precision integer dot-product hardware accelerator for machine-learning applications

A configurable integrated circuit to compute vector dot products between a first N-bit vector and a second N-bit vector in a plurality of precision modes. An embodiment includes M slices, each of which calculates the vector dot products between a corresponding segment of the first and the second N-bit vectors. Each of the slices outputs intermediary multiplier results for the lower precision modes, but not for highest precision mode. A plurality of adder trees to sum up the plurality of intermediate multiplier results, with each adder tree producing a respective adder out result. An accumulator to merge the adder out result from a first adder tree with the adder out result from a second adder tree to produce the vector dot product of the first and the second N-bit vector in the highest precision mode.
Owner:INTEL CORP

High efficiency video coding adder tree parallel implementation method

ActiveCN105847810ACalculation speedImprove the efficiency of motion estimation calculationDigital video signal modificationDigital videoComputation process
The invention provides a high efficiency video coding (HEVC) adder tree parallel implementation method, and relates to the technical field of digital video coding and decoding. By utilizing a two-dimensional processing element array structure, an SAD value in a luminance block division mode is computed and is subjected to parallel processing, so that motion estimation computation efficiency is effectively improved; and by utilizing a method for selecting a processing element (PE) for storing the SAD value according to the type of the block division mode, computation speed of an adder tree is increased, and computation efficiency is improved. Compared with the traditional pixel block storage manner (storing a single pixel through the single PE ), a manner of storing 4*4 pixel blocks through the single PE has the advantage that the amount of utilized PEs is reduced to 1/16th of the original amount of the utilized PEs; compared with an adder tree serial structure implementation method, the parallel implementation method has the speed increased to nearly 100 times; and computation of the SAD values in twelve types of the block division modes are obtained through combination of the SAD values in 4*4 block division modes, so that excess computation processes can be reduced, and computation efficiency is improved.
Owner:XIAN UNIV OF POSTS & TELECOMM

Signal detection circuit and method based on Gini Correlation

The invention relates to a signal detection circuit and method based on Gini Correlation. The circuit comprises two comparator arrays, a subtracter array, two multiplier arrays, two adder trees, a divider and a register, wherein the comparator arrays and the subtracter array are n*n matrixes (n is a signal length); each element on the matrixes is applied to comparison computation and subtraction computation respectively; the multiplier arrays are n2 2-input multipliers; the adder trees are n2-input adders; the divider is used for finishing 2-input division computation; and the register is used for registering results of relevant computations. When environmental noise comprises a pulse noise component, matching of a filter with a Pearson's Product Moment Correlation Coefficient basically fails. The Gini Correlation shows excellent robustness, comprising mathematical expectation which is very close to a true value and a small standard different, in a noise environment comprising pulse components.
Owner:中山易美杰智能科技有限公司

Asymmetrical partition mode based high efficiency video coding adder tree parallel realization method

ActiveCN105578189ACalculation speedImprove the efficiency of motion estimation calculationDigital video signal modificationDigital videoProcessing element
The invention discloses an asymmetrical partition mode based high efficiency video coding adder tree parallel realization method, relating to the technical field of digital video coding and decoding. According to the method, a two-dimension processing element array structure is adopted, and an SAD value of a brightness block partition mode is calculated and processed parallelly, so that the motion estimation operation efficiency is improved effectively. A PE for storing the SAD value is selected according to whether the SAD value is used in subsequent process, so that the calculation speed of anadder tree is improved, and the calculation efficiency is improved. In the conventional pixel block storage manner, a single PE stores a single pixel, while in the method, a single PE stores 4*4 pixel blocks, so that the quantity of processing units is reduced to 1 / 16 that of processing units in prior art. Compared with the adder tree serial structure realization method, the parallel structure improves the speed by nearly 92 times. Calculation of the SAD values of 36 partition modes is realized through merging of SAD values of 4*4 partition mode, so that excessive calculation steps are reduced, and the calculation efficiency is improved.
Owner:XIAN UNIV OF POSTS & TELECOMM

Adder tree structure DSP system and method

A Wallace tree structure such as that used in a DSP is arranged to sum vectors. The structure has a number of adder stages (365, 370, 375), each of which may have half adders (300) with two input nodes, and full adders (310) with three input nodes. The structure is designed with reference to the vectors to be summed. The number of full- and half-adders in each stage and the arrangement of vector inputs depends upon their characteristics. An algorithm calculates the possible tree structures and input arrangements, and selects an optimum design having a small final stage ripple adder (380), the design being based upon the characteristics of the vector inputs. This leads to reduced propagation delay and a reduced amount of semiconductor material for implementation of the DSP.
Owner:NXP USA INC

An interleaved matching filtering method suitable for specific integrated circuit design

The invention discloses an interleaved matching filtering method suitable for specific integrated circuit design, and belongs to the technical field of application specific integrated circuits. At first, that delay unit is used to delay the spread spectrum oversampling signal; Then, the delayed data is judged and processed, and the delayed signal is directly output or inverted according to the value of the corresponding position of the PN code, and the inverted operation is accomplished by the way of bit-by-bit inverted compensation. Finally, the pipeline adder tree is used to add and registerthe judged data step by step, and the result of the last addition plus the compensation value is the output result of the odd-even interleaved matched filter. The invention has the following advantages: (1) the design structure of the digitally matched filter can be simplified; (2) the logic sequence is optimized and logic resources are saved; (3) the timing path is shortened to meet the higher timing requirements; (4) The output result is still the sequential matched filter result of each sampling value, so it is easy to process the subsequent signal.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Approximate calculation device and method suitable for neural network data and weight pre-classification

According to the approximate calculation device and method suitable for neural network data and weight pre-classification, weight parameters of a neural network are pre-processed, fine classification is conducted according to the number of continuous' 0 'from the lowest bit, simple classification is conducted on input data according to the same features, and the two parties are combined to serve as an important basis for configuring an approximate scheme; approximate calculation arrays with different approximation degrees are configured by controlling approximate line positions of the full adder adder tree accumulation circuit and the low-order or approximate adder accumulation circuit. Aiming at the characteristic that weight parameters of a neural network model are often known and fixed, fine pre-classification processing is performed on the weight parameters, fine simple dichotomy processing is performed according to the characteristics of input system data, and approximate multiplication operation arrays with different configurations are dynamically selected by combining the fine pre-classification processing and the dichotomy processing. The selection of the approximation scheme is accurate, the characteristics of the corresponding data and weight are matched in real time, and the blindness, hysteresis and complexity of the dynamic configuration scheme of the existing approximation multiplication are overcome.
Owner:NANJING PROCHIP ELECTRONIC TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products