Chinese word segmentation method and device

A Chinese word segmentation and text technology, applied in the field of search engines, can solve the problem of low accuracy of Chinese word segmentation, and achieve the effect of solving the low accuracy and improving the accuracy of word segmentation

Inactive Publication Date: 2018-11-06
DATAGRAND TECH INC
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The main purpose of this application is to provide a Chinese word segmentation method and device to solve the problem of low accuracy of Chinese word segmentation in the related art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese word segmentation method and device
  • Chinese word segmentation method and device
  • Chinese word segmentation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In order to enable those skilled in the art to better understand the solution of the present application, the technical solution in the embodiment of the application will be clearly and completely described below in conjunction with the accompanying drawings in the embodiment of the application. Obviously, the described embodiment is only It is an embodiment of a part of the application, but not all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the scope of protection of this application.

[0031] It should be noted that the terms "first" and "second" in the description and claims of the present application and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence. It should be understood that the data so used may be interchanged under appropriate circumstances for...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Chinese word segmentation method and device. The method comprises the steps of receiving first target text information sent by a user; carrying out data mapping on the firsttarget text information through a first classifier to obtain corresponding first target category information; and performing preset inquiry operation according to the first target category informationand returning an inquiry result to the user. In a mode that the first target text information sent by the user is subjected to the data mapping through the first classifier, the corresponding first target category information is obtained, so that the purpose of performing preset inquiry operation according to the first target category information is achieved, the technical effect of improving theword segmentation accuracy is achieved, and the problem of low accuracy of Chinese word segmentation in related technologies is solved.

Description

technical field [0001] This application relates to the field of search engines, in particular, to a Chinese word segmentation method and device. Background technique [0002] Search engines are based on a structure called an inverted index. The inverted index is a structure of <key, value>, and the key value in this structure directly affects the accuracy, recall rate, and speed of the entire search engine. Let's take a look at what happens if we don't use Chinese word segmentation. [0003] Assuming that Chinese word segmentation is not used, a single Chinese character index can be used. For example, for Daguan, the word 'Da' is first indexed, and then the word 'Guan' is indexed. Similarly, for an article, first index all Chinese characters separately and record their positions. In the search process, first find all the documents of the word 'Da', then find all the documents of the word 'Guan', and then do the cross 'AND' operation, that is, only the documents con...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F17/30
CPCG06F40/284
Inventor 王江高翔纪达麒陈运文
Owner DATAGRAND TECH INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products