Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Spelling error correction method and system of ES search engine

A technology of search engine and error correction method, which is applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as low spelling error correction accuracy and inaccurate detection results, so as to expand the types of error correction and improve Accuracy, variety of effects across the board

Inactive Publication Date: 2016-12-07
广州智索信息科技有限公司
View PDF4 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But in above-mentioned prior art, just carry out statistics according to the user's search log, if the user's search log is incomplete, or there are also wrong words in the user's search log, it will cause inaccurate detection results, that is, the accuracy of spelling error correction low degree

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Spelling error correction method and system of ES search engine
  • Spelling error correction method and system of ES search engine

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0023] see figure 1 , is a schematic flow chart of a spelling error correction method for an ES search engine provided by the present invention, comprising the following steps:

[0024] 101. Use the ansj tokenizer to divide the spelling content input by the user into several entries.

[0025] Wherein, the spelling content includes pinyin, Chinese characters or English.

[0026] Specifically, first obtain the spelling content input by the user, and then call the t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a spelling error correction method and system of an ES search engine, and relates to the technical field of information. The method comprises the following steps of: dividing spelling content input by a user into a plurality of entries by adoption of an ansj word segmentation device; carrying out error detection on each entry, if an error entry exists, searching error models matched from the error entry from an error model library, and obtaining correction candidate words corresponding to the error entry from the matched error models; calculating a score, under each matched error model, of each correction candidate word according to the matched error models, and forming a score vector according to the score under each matched error model; processing the score vectors by adoption of an L2R model so as to generate scores of the error models, and determining a total score of each correction candidate word according to the scores of the error models and a language model; and determining the correction candidate word with the highest score in the total score as a correct candidate word, and displaying the correct candidate word. The method and system disclosed by the invention can improve the correctness of spelling error correction.

Description

technical field [0001] The invention relates to the field of information technology, in particular to a spelling error correction method and system for an ES search engine. Background technique [0002] Elastic Search (ES for short) is a Lucene-based search server that provides a distributed multi-user capable full-text search engine developed in Java and released as open source under the Apache license. It is currently a popular enterprise level search engine. At present, spelling error correction in search engines, also called spell check, is a function widely used by various search engines, which can return correct query requests according to wrong query content input by users. [0003] The spelling error correction method in the prior art is usually: the number of times the user clicks and enters the search results of the query word after the error correction word is provided according to the user's search log statistics, the number of times the user clicks the error co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/3331G06F40/289
Inventor 刘桂良赖旦冉杨国辉宣明
Owner 广州智索信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products