Containerization test method and system for reinforcement learning model

A technology that reinforces learning and testing methods, applied in software testing/debugging, instrumentation, error detection/correction, etc., can solve problems such as dependencies, poor isolation of the testing process, and high testing environment requirements, to ensure fairness, achieve visualization, The effect of ensuring security and privacy

Pending Publication Date: 2021-10-22
INST OF SOFTWARE - CHINESE ACAD OF SCI
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The traditional reinforcement learning model testing needs to build a test environment in the test server and connect the test objects to the test server. The problem with this approach is: First, from the requirements of the test environment, the testing process of the reinforcement learning model relies heavily on The test environment and its configuration files generally have high requirements for the test environment; secondly, in terms of the isolation of the test, the traditional test tightly binds the test object to the test software and hardware environment, and the isolation of the test process Poor, once the test environment needs to be replaced during the test process, it will cause repeated workload to build the test environment
[0004] At present, in the testing of traditional reinforcement learning models, the tested reinforcement learning model is heavily dependent on the test environment and the test fairness caused by software and hardware environment dependencies.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Containerization test method and system for reinforcement learning model
  • Containerization test method and system for reinforcement learning model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0083] The present invention will be described in further detail below in conjunction with the accompanying drawings, and the examples given are only used to explain the present invention, and are not intended to limit the scope of the present invention.

[0084] In this embodiment, the unmanned vehicle decision-making control algorithm in automatic driving is used as a test agent (reinforcement learning model). The testing party refers to the subject that tests and evaluates the decision-making control algorithm of the unmanned vehicle, and provides the Docker image file including the test environment and the test agent template (decision-making control algorithm of the unmanned vehicle) for the test of the decision-making control algorithm of the unmanned vehicle. Unmanned vehicle decision-making control algorithm and test results; the test platform is a management platform for unmanned vehicle decision-making control algorithm testing. The division of the party is used to r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a containerized testing method and system for a reinforcement learning model. The method comprises the following steps that 1) a test party makes a test environment Docker mirror image, a proxy Docker mirror image and a corresponding connection module, a callback module and an evaluation module according to a to-be-established test task, and then packages the connection module into a proxy Docker mirror image file template; 2) the test party creates a test task on the test platform and uploads a mirror image file to a test party server; 3) the tested party downloads a mirror image file training agent of the test task, integrates the trained agent to an agent Docker mirror image and uploads the Docker mirror image to a server of the test party; 4) the test party server adds or replaces the callback module and the evaluation module in the newly uploaded proxy Docker mirror image file, repackages the newly uploaded proxy Docker mirror image file to obtain a new proxy Docker mirror image, and starts to run the test task; and 5) the test party server transmits the test process data back to the test platform.

Description

technical field [0001] The invention belongs to the technical field of computer software, and in particular relates to a containerized testing method and system for reinforcement learning models. Background technique [0002] Reinforcement learning is a learning method that is closer to life in reality. Unlike "deep learning" technology, it does not use pre-labeled data, but guides behavior through the rewards obtained by the agent interacting with the environment. The goal is to make the agent Get as many rewards from the environment as possible to learn the optimal policy. The testing of reinforcement learning needs to rely on the environment of reinforcement learning. In principle, the difference between reinforcement learning and deep learning is that the former needs to interact with the test environment online in real time, generate corresponding behaviors based on environmental feedback, and then make relevant judgments and evaluations; while the latter is not depend...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F11/36G06F9/455
CPCG06F11/3684G06F11/3688G06F9/45558G06F2009/45562Y02D10/00
Inventor 董乾薛云志孟令中杨光师源王鹏淇武斌
Owner INST OF SOFTWARE - CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products