Method of training a natural language search system, search system and corresponding use

A natural language and subsystem technology, applied in natural language data processing, patent retrieval, data processing applications, etc., can solve limited problems and achieve the effect of safe modification and less manual work

Pending Publication Date: 2021-07-30
IPRALLY TECH OY
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, they are relatively limited in, for example, patent novelty searches, because in practice their ability to assess novelty—that is, to find documents that disclose specific content that falls under the general concept defined in the patent claims— is limited

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method of training a natural language search system, search system and corresponding use
  • Method of training a natural language search system, search system and corresponding use
  • Method of training a natural language search system, search system and corresponding use

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] definition

[0038] A "natural language unit" in this context refers to a chunk of text, or after embedding, a vector representation of a chunk of text. The chunks may be single words or multi-word subconcepts that occur one or more times in the original text stored in computer readable form. The natural language unit may be represented as a set of character values ​​(commonly referred to in computer science as a "string"), or numerically as a multidimensional vector value, or a reference to such a value.

[0039] A "natural language chunk" refers to a data instance comprising a linguistically meaningful combination of natural language units, such as one or more complete or incomplete sentences of a language such as English. The natural language chunks may be represented, for example, as a single character string and stored in a file in a file system and / or displayed to a user via a user interface.

[0040] "Document" means a machine-readable entity containing natural...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and system for training a machine learning-based patent search or novelty evaluation system. The method comprises providing a plurality of patent documents each having a computer-identifiable claim block and specification block, the specification block including at least part of the description of the patent document. The method also comprises providing a machine learning model and training the machine learning model using a training data set comprising data from said patent documents for forming a trained machine learning model. According to the invention, the training comprises using pairs of claim blocks and specification blocks originating from the same patent document as training cases of said training data set.

Description

technical field [0001] The present invention relates to natural language processing. In particular, the present invention relates to machine learning-based, such as neural network-based, systems and methods for retrieving, comparing or analyzing documents containing natural language. Said documentation may be technical documentation or scientific documentation. In particular, said document may be a patent document. Background technique [0002] Written comparisons of technical concepts are required in many areas of business, industry, economics, and culture. A specific example is the examination of patent applications, where one purpose is to determine whether a technical concept defined in the claims of a patent application semantically covers another technical concept defined in another document. [0003] Currently, a growing number of retrieval tools exist that can be used to find individual documents, but the analysis and comparison of concepts exposed by documents is...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/205G06F40/279G06N20/00G06N3/08
CPCG06F40/205G06F40/279G06N3/08G06N5/01G06N7/01G06N3/044G06F16/322G06F16/355G06F40/154G06F40/211G06F40/284G06F40/289G06F40/30G06N20/00G06Q50/184G06F2216/11
Inventor S·阿维拉
Owner IPRALLY TECH OY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products