A method and device for controlling an intelligent reflective surface based on deep reinforcement learning

A technology of reinforcement learning and reflective surfaces, applied in the field of control of intelligent reflective surfaces, can solve problems such as low learning efficiency and poor stability

Active Publication Date: 2021-02-19
SUN YAT SEN UNIV
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a control method and device for intelligent reflective surfaces based on deep reinforcement learning to solve the technical problems of low learning efficiency and poor stability of traditional deep reinforcement learning methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for controlling an intelligent reflective surface based on deep reinforcement learning
  • A method and device for controlling an intelligent reflective surface based on deep reinforcement learning
  • A method and device for controlling an intelligent reflective surface based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] Embodiments of the present invention provide a control method and device for an intelligent reflective surface based on deep reinforcement learning to solve the technical problems of low learning efficiency and poor stability of traditional deep reinforcement learning methods.

[0042] In order to facilitate the understanding of the present invention, the present invention will be described more fully below with reference to the associated drawings. A preferred embodiment of the invention is shown in the drawings. However, the present invention can be embodied in many different forms and is not limited to the embodiments described herein. Rather, these embodiments are provided so that the disclosure of the present invention will be thorough and complete.

[0043] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the technical field of the invention. The terms used herein in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a method and device for controlling an intelligent reflective surface based on deep reinforcement learning, wherein the method includes: a policy network generates a first action according to a first state; the amplitude is fixed and input into an optimization module, and the first action is updated to obtain a second action action, and get the first target value at the same time; apply the second action to the wireless environment to get the second state, get a new sample and store it in the experience pool; the policy network and the value network carry out deep deterministic policy gradient training according to the sample, and the executor uses The deep deterministic policy gradient method updates its parameters; determines the third target value according to the first target value and the second target value generated by the target Q network, trains the DNN of the online Q network according to the third target value and updates its parameters; repeats the above The steps are until the network parameters that minimize the transmit power of the AP are obtained and output. The invention can realize stable and high-efficiency learning in a shorter time, and can converge to the optimal goal faster.

Description

technical field [0001] The invention relates to the technical field of wireless communication networks, in particular to a control method and device for an intelligent reflective surface based on deep reinforcement learning. Background technique [0002] At present, IRS is considered to be a very potential and promising technology. IRS is composed of a large number of passive reflection elements, which are connected to each other and controlled by an embedded IRS controller. It can be used to improve the energy and spectrum of wireless communication. efficiency. Signal strength at the receiver is enhanced through joint control of the complex reflection coefficients of all reflective elements, known as passive beamforming. The passive beamforming technology of the IRS and the transmission control technology of the transceiver can further improve the network performance. IRS has been applied in various scenarios, which play different roles in wireless communication, such as ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): H04B7/06G06N20/00
CPCH04B7/0617G06N20/00Y02D30/70H04B7/04013
Inventor 龚世民陈希雨林嘉烨谭源正
Owner SUN YAT SEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products