Method and device for text matching

A matching method and text technology, applied in the field of data processing, can solve problems such as inaccurate matching results

Inactive Publication Date: 2017-06-09
ALIBABA GRP HLDG LTD
View PDF3 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The embodiment of the present application provides a text matching method and device to at least solve the technical problem of inaccurate matching results of the text matching method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for text matching
  • Method and device for text matching
  • Method and device for text matching

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0021] According to the embodiment of the present application, an embodiment of a text matching method is also provided. It should be noted that the steps shown in the flow chart of the accompanying drawings can be executed in a computer system such as a set of computer-executable instructions, and, Although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in an order different from that shown or described herein.

[0022] Optionally, in this embodiment, the above text matching method can be applied to such as figure 1 In the shown hardware environment composed of the terminal 10 and the server 30, the terminal can establish a connection with the server through the network. The above-mentioned network includes but not limited to: wide area network, metropolitan area network or local area network. Preferably, the aforementioned network is a local area network.

[0023] According to the embodiment of the present applicatio...

Embodiment 2

[0103] According to the embodiment of the present application, a text matching device is also provided, such as Figure 6 As shown, the processing device may include: an acquisition unit 20 , an extraction unit 40 and a matching unit 60 .

[0104] Wherein, the obtaining unit is used to obtain at least two pieces of word attribute information of each text to be processed in the multiple texts to be processed, wherein the multiple texts to be processed include at least the text to be matched and multiple pre-stored texts in the text library, each The word attribute information is used to record the index relationship between a word contained in the text to be processed and the text to be processed.

[0105] The extracting unit is configured to extract word attribute information corresponding to the word attribute information of the text to be matched from the word attribute information of multiple pre-stored texts.

[0106] The matching unit is configured to determine the match...

Embodiment 3

[0119] Embodiments of the present application may provide a computer terminal, and the computer terminal may be any computer terminal device in a group of computer terminals. Optionally, in this embodiment, the foregoing computer terminal may also be replaced with a terminal device such as a mobile terminal.

[0120] Optionally, in this embodiment, the foregoing computer terminal may be located in at least one network device among multiple network devices of the computer network.

[0121] Optionally, Figure 7 It is a structural block diagram of a computer terminal according to an embodiment of the present application. Such as Figure 7 As shown, the server or terminal includes: one or more (only one is shown in the figure) processor 201, memory 203, and transmission device 205 (such as the sending device in the above-mentioned embodiment), such as Figure 7 As shown, the terminal may also include an input and output device 207 .

[0122]Among them, the memory 203 can be u...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The application discloses a method and device for text matching. The method comprises the steps that at least two pieces of word attribute information of each to-be-processed text among multiple to-be-processed texts are acquired, wherein multiple to-be-processed texts comprise at least a to-be-matched text and multiple pre-stored texts in a text library, and each piece of word attribute information is used to record an index relation between a word contained by the to-be-processed text and the to-be-processed text; word attribute information corresponding to the word attribute information of the to-be-matched text is extracted from the word attribute information of the multiple pre-stored texts; and based on the index relations recorded in the extracted word attribute information, a matched text matched with the to-be-matched text in the multiple pre-stored texts is determined, wherein the word attribute information of the to-be-matched text and the matched text could be matched entirely or partially. According to the invention, the technical problem that matching results of text matching methods are not accurate can be solved.

Description

technical field [0001] The present application relates to the field of data processing, in particular to a text matching method and device. Background technique [0002] In the prior art, web page rearrangement and text information matching can be performed through a hash algorithm. Locality Sensitive Hash Algorithm among the existing Hash Algorithms realizes text information matching. [0003] Specifically, local sensitive hash (LSH) is a hash algorithm that puts similar states or adjacent points in a high-dimensional space into the same bucket, and is generally used for similar text processing. MinHash in local sensitive hashing uses the hash value of a word in the text to represent the state of the text. When matching two texts, the state of the two texts is matched, that is, based on the hash value of the two words The hash value matches two texts, and if the hash values ​​of the two words are consistent, the two texts are considered to be matching texts. Using this m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/3334G06F16/3344
Inventor 祝啸风阙育飞
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products