Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Word and sentence processing method and device and computer storage medium

A technology of computer storage and processing methods, applied in computing, electrical digital data processing, natural language data processing, etc., to achieve the effect of accurate representation and suitable for data analysis

Active Publication Date: 2020-05-12
ZHEJIANG DAHUA TECH CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a word and sentence processing method, device and computer storage medium to solve the problem of obtaining the meaning of Chinese characters through the semantic information inside Chinese characters in the prior art question

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word and sentence processing method and device and computer storage medium
  • Word and sentence processing method and device and computer storage medium
  • Word and sentence processing method and device and computer storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0018] For details, please refer to figure 1 , figure 1 It is a schematic flowchart of the first embodiment of the method for processing words and sentences in the present invention, and the method for processing words and sentences in this embodiment includes the following steps.

[0019] S11. Obtain stroke sequences to be processed of words and sentences to be processed.

[0020] The stroke sequence to be processed is acquired, and the sentence to be proce...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a word and sentence processing method and device and a computer storage medium, and the processing method comprises the steps: obtaining a to-be-processed stroke sequence of ato-be-processed word and sentence, and inputting the to-be-processed stroke sequence into a trained language model; generating a representation vector of the current to-be-processed stroke by using the language model and taking the forward to-be-processed stroke and / or the backward to-be-processed stroke of each current to-be-processed stroke as context information; and determining a representation vector of each character according to the representation vector of the to-be-processed stroke. Through the mode, the representation vector of the character is obtained according to the representation vector of the stroke to be processed of the character, so that the representation information of the character can be determined by utilizing the semantic information in the character.

Description

technical field [0001] The invention relates to the field of word and sentence processing, in particular to a method, device and computer storage medium for word and sentence processing. Background technique [0002] In recent years, the research work on related language models is mainly based on the semantics of words and words for language model training, which has achieved very good results in various natural language processing tasks, and there are already many word vector models, but relatively few Many word vector models are based on Western languages, such as English, Spanish, German, etc. The internal components of these Western languages ​​​​are Latin letters. However, because Chinese writing is completely different from Western languages, Chinese words contain very few Chinese characters. , but Chinese characters contain strong semantic information. [0003] Therefore, how to effectively use the semantic information inside Chinese characters to obtain the represen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/284G06F40/30
Inventor 刘伟棠张浩戴泽林李保敏何林强
Owner ZHEJIANG DAHUA TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products