Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Internet of Things information freshness optimization method based on dual-network deep reinforcement learning

A technology of reinforcement learning and optimization method, applied in the field of deep reinforcement learning, can solve the problem of dimension disaster, reinforcement learning algorithm cannot be directly applied to the average cost problem, etc., to achieve the effect of maximizing service life

Pending Publication Date: 2021-09-24
SUN YAT SEN UNIV
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, most of the common reinforcement learning optimization algorithms are discounted reinforcement learning algorithms, which cannot be directly applied to the optimization of the average cost problem, while the traditional reinforcement learning algorithm suitable for the average cost problem faces the curse of dimensionality problem, there are huge limitations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Internet of Things information freshness optimization method based on dual-network deep reinforcement learning
  • Internet of Things information freshness optimization method based on dual-network deep reinforcement learning
  • Internet of Things information freshness optimization method based on dual-network deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments. For the step numbers in the following embodiments, it is only set for the convenience of illustration and description, and the order between the steps is not limited in any way. The execution order of each step in the embodiments can be adapted according to the understanding of those skilled in the art sexual adjustment.

[0040] like figure 2 As shown, the present invention is applicable to the transmission optimization scenario of the point-to-point communication system model. The characteristics of the communication network scenario in this embodiment include: this is a time-slot system model; The function of information sampling transmission; the wireless channel is not a perfect channel, and the transmission may fail; for the sensor, the channel state information and the source state information are unknown, which means that the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an Internet of Things information freshness optimization method based on dual-network deep reinforcement learning. The method comprises the steps: enabling a sensor to operate a specified experience number, selecting actions according to a strategy in each time step of each period of experience, executing the actions, observing rewards and states, and storing the rewards and states in an experience playback pool; then taking out a batch of experiences from the experience playback pool by the model, calculating a loss function by using a target value calculation formula in an average cost form, updating the current network parameters, and updating the target network parameters every certain time step number; and cyclically executing the steps until the end of the period of experience. According to the method, the sensor can be helped to make an optimal decision in each time step, so that the weighted sum of the average information change age and the energy consumption is minimized. The purpose of maximizing the service life of equipment while the average information change age of the system is minimized is achieved. The Internet of Things information freshness optimization method based on dual-network deep reinforcement learning can be applied to the field of deep reinforcement learning.

Description

technical field [0001] The invention relates to the field of deep reinforcement learning, in particular to a method for optimizing the freshness of Internet of Things information based on dual-network deep reinforcement learning. Background technique [0002] With the deepening of the application scenarios of the Internet of Things system and the implementation of facilities such as the Internet of Vehicles, the real-time performance of the Internet of Things system is becoming more and more important. On the other hand, most of the existing real-time indicators only consider the change of information in the time dimension, and lack the consideration of the change of information in the content dimension, while the age of information change measures the change of information in both time and content dimensions. Variety. Since the age of information change is a random process that changes over time, its average value is often used as a performance indicator. Similar to avera...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): H04L29/08H04L12/24G06F30/27G06N3/08G06F17/18
CPCH04L67/12H04L67/104H04L41/145G06F30/27G06N3/08G06F17/18Y02D30/70
Inventor 王玺钧林文锐陈翔孙兴华詹文
Owner SUN YAT SEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products