Layered task planning method and system for spatial fine operation

A mission planning and space technology, applied in the space field, can solve problems such as poor algorithm convergence, large number of samples, and large gradient estimation variance, and achieve wide engineering applicability, save space-borne computing resources, and meet actual engineering needs. Effect

Pending Publication Date: 2022-07-22
BEIJING INST OF CONTROL ENG
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In practice, it is difficult to perform the above multiple manipulation tasks with a single policy network optimized by reinforcement learning
In addition, for multi-task / multi-MDP reinforcement learning, a large number of samples needs to be collected, and the parameter dimension of the policy network needs to be increased accordingly, resulting in problems such as large variance of gradient estimation in the optimization process and poor algorithm convergence.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Layered task planning method and system for spatial fine operation
  • Layered task planning method and system for spatial fine operation

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach

[0038] As a further limiting scheme of the present invention, step 3 compares the optimized high-level policy π h and the low-level policy π l The specific implementation of model evaluation is as follows:

[0039] Step 3.1, delineate the square area with the operation object as the center. The size of the square area is determined by the size of the operation object, and it should be ensured that when the camera is arranged at the edge of the area and the camera line of sight is perpendicular to the square area, the camera field of view can be photographed. Operate at least 30% of the object;

[0040] Step 3.2, Evaluate the high-level policy π h , randomly arrange the camera positions in the square area delineated in step 3.1, and run the high-level policy π h , check whether the target signal g is consistent with the position of the operation object in the camera coordinate system;

[0041] Step 3.3, Evaluate the low-level policy π l , with the target signal g given in ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a hierarchical task planning method and system for spatial fine operation, and belongs to the technical field of space. In order to solve the problems of large gradient estimation variance and poor algorithm convergence when a strategy network learns multiple tasks, high-level strategy planning based on a dynamic estimator and Monte Carlo tree search is constructed, and the method specifically comprises the following three steps of offline data collection, asynchronous high / low-level strategy optimization and model evaluation. According to the method, the algorithm convergence is improved, satellite-borne computing resources of on-orbit service space operation can be saved, and actual engineering requirements are met.

Description

technical field [0001] The invention belongs to the field of space technology, and in particular relates to a hierarchical task planning method and system for fine operation in space. Background technique [0002] Spatial fine-tuning requires the ability to learn long, sequential multitasking. For example, in order to realize the on-orbit refueling mission of non-cooperative targets, the service spacecraft needs to first capture the target spacecraft, reposition, cut the wrapping film, trim the wire, open the cover, etc. before performing the refueling operation. In practice, it is difficult to perform the above multiple operational tasks with a single policy network optimized by reinforcement learning. In addition, for multi-task / multi-MDP reinforcement learning, the number of samples to be collected is large, and the parameter dimension of the policy network needs to be increased accordingly, resulting in problems such as large gradient estimation variance in the optimiza...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q10/06G06N3/04G06N3/08
CPCG06Q10/06313G06Q10/06312G06N3/08G06N3/044G06N3/045
Inventor 解永春李林峰王勇陈奥梁红义
Owner BEIJING INST OF CONTROL ENG
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products