Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Similarity search method of enterprise names

A technology of enterprise name and similarity, applied in the field of similarity retrieval, can solve the problems of large approximate difference of names, inaccurate sorting of retrieval results, inability to meet business needs, etc., and achieve the effect of improving retrieval efficiency.

Inactive Publication Date: 2017-01-25
GREAT WALL COMP SOFTWARE & SYST CO LTD
View PDF5 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Secondly, general-purpose full-text search engines can also support word-by-word search, but the performance advantage is lost. For example, for a 15-character name string, the conclusion obtained by searching according to the full arrangement of words is: only by " "Full-text search" can't fully realize the business requirements for approximate retrieval of enterprise names. The search time limit will exceed 30 seconds, and the sorting of the retrieval results is not accurate, which is quite different from the approximate name that people usually feel, which is far from meeting the business requirements.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Similarity search method of enterprise names
  • Similarity search method of enterprise names
  • Similarity search method of enterprise names

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts shall fall within the protection scope of the present invention.

[0039] figure 1 A schematic flowchart of an enterprise name similarity retrieval method 100 provided by an embodiment of the present invention is given. Such as figure 1 The shown similarity retrieval method 100 for business names includes:

[0040] 110. Perform decomposing processing on the input search key to obtain the processed search key, wherein the search key is the name of the enterprise to be searched.

[0041] 120. Determine a search phrase ac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a similarity search method of enterprise names. The similarity search method comprises steps as follows: input search keywords are decomposed, and the processed search keywords are obtained, wherein the search keywords are to-be-researched enterprise names; search phrases are determined according to the processed search keywords; similarity search is performed on the determined search phrases and search results are obtained; the enterprise names ranking in the top N of the search results are displayed to be checked by users, and N is an integer larger than 1. With the adoption of the method, the search efficiency is greatly improved, the users can check similarity research results, and the requirement of similarity search business is met.

Description

technical field [0001] The invention relates to the technical field of similarity retrieval, in particular to a method for similarity retrieval of enterprise names. Background technique [0002] The precise query speed of the database is quite fast, but the fuzzy query speed will decrease rapidly after the amount of data exceeds one million, especially for the "contains" relationship, usually exceeding 10 seconds. For example: if you want to hit "Great Wall Computer Software" by entering the keyword "computer", the speed will be very slow. What's more, according to the rules for judging the similarity of names, it is necessary to use circular nested fuzzy queries, and the performance is completely unacceptable. [0003] Usually when encountering this kind of problem, "full-text search technology" will be used, such as: Baidu, Sogou, etc., to achieve fast search results in massive data based on a small number of keywords. However, the inventor found in the process of implem...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/3338G06F16/316G06F16/338
Inventor 仲晓琦刘丰刘镇华
Owner GREAT WALL COMP SOFTWARE & SYST CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products