Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for mining mixed templates of Chinese sentences

A technology of mixing templates and templates, applied in natural language data processing, instruments, computing and other directions, can solve problems such as poor template matching ability, and achieve the effect of strong template matching ability, enhanced expression ability, and reduced number of

Active Publication Date: 2021-09-21
BEIJING UNISOUND INFORMATION TECH
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The invention provides a method and device for mining mixed templates of Chinese sentences, which are used to solve the defect of poor matching ability of existing sentence templates

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for mining mixed templates of Chinese sentences
  • Method and device for mining mixed templates of Chinese sentences
  • Method and device for mining mixed templates of Chinese sentences

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The preferred embodiments of the present invention will be described below in conjunction with the accompanying drawings. It should be understood that the preferred embodiments described here are only used to illustrate and explain the present invention, and are not intended to limit the present invention.

[0045] For the mining method of a Chinese sentence mixed template provided in the embodiment of the present invention, see figure 1 As shown, the method includes steps 101-105:

[0046] Step 101: Obtain preset text, which includes positive example text and negative example text.

[0047] In the embodiment of the present invention, the positive example text and the negative example text are selected in advance, each text contains multiple lines, and one line corresponds to one sentence. For example, when a question template needs to be selected, the positive example text may contain multiple question sentences (such as interrogative sentences, rhetorical questions, et...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a method and device for mining mixed templates of Chinese sentences, wherein the method includes: obtaining preset texts, the preset texts include positive example texts and negative example texts; each sentence in the preset text Perform analysis and processing separately to determine the word parameters of each word in the sentence; generate sentence candidate templates based on the word parameters of all words; combine all candidate templates of all sentences to generate a template list that does not contain repeated candidate templates, and generate positive templates set and negative example template set; select the target candidate template from the template list, and determine the template type of the target candidate template according to the number of positive examples and the number of negative examples of the target candidate template. The sentence template generated by this method is a mixed expression of words, parts of speech, named entities and syntactic dependencies, which can more fully describe the language rules existing in a sentence, and the template matching ability is strong.

Description

technical field [0001] The invention relates to the technical field of sentence template mining, in particular to a method and device for mining mixed templates of Chinese sentences. Background technique [0002] The constructed Chinese sentence templates can be used for sentence matching, classification, information extraction and other tasks. The current template mining method is to rely on human editing or machine automatic statistics to mine templates similar to regular expressions or simple part-of-speech sequences. [0003] The templates mined by the existing sentence template mining methods have limited expressive ability. The sentence templates either use string templates or part-of-speech templates. The matching ability of these sentence templates is relatively limited. Contents of the invention [0004] The invention provides a method and device for mining mixed templates of Chinese sentences, which are used to solve the defect of poor matching ability of existi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/216G06F40/211G06F40/284G06F40/295G06F40/186
CPCG06F40/186G06F40/211G06F40/284G06F40/295
Inventor 任禾
Owner BEIJING UNISOUND INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products