Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

An address matching method based on statistical word segmentation

A technology of address matching and word segmentation, which is applied in search engines, geographic information, and computer fields. It can solve problems such as low retrieval speed and accuracy of matching algorithms, low matching success rate of unregistered words, and large amount of address information data. The effects of automatic address matching, improved accuracy rate, and improved processing efficiency

Active Publication Date: 2019-01-04
ZHEJIANG INST OF SURVEYING & MAPPING SCI & TECH +1
View PDF3 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0013] Aiming at the above-mentioned technical problems in the related art, the present invention proposes an address matching method based on statistical word segmentation, which can solve the problems of large amount of address information data and low matching success rate of unregistered words in the existing address matching technology; and The address matching rules are complex, the retrieval speed and accuracy of the existing matching algorithms are not high, and the address matching efficiency is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An address matching method based on statistical word segmentation
  • An address matching method based on statistical word segmentation
  • An address matching method based on statistical word segmentation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. All other embodiments obtained by persons of ordinary skill in the art based on the embodiments of the present invention belong to the protection scope of the present invention.

[0030] Such as Figures 1 to 3 As shown, a method for address matching based on statistical word segmentation according to an embodiment of the present invention includes the following steps:

[0031] S1 Establish the background database of administrative divisions based on the five levels of administrative divisions of provinces / cities / counties / blocks, townships, towns, villages, and communities;

[0032] S2 Use place name models including road, street and alley place names, district place ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an address matching method based on statistical word segmentation, which comprises the following steps: S1, establishing an administrative division background library based onfive administrative divisions of province / city / county / block, township, town, / village and community; S2, using place name models including street names, district names, natural village names, districtnames, building names and other natural names to establish place name background database; S3, establishing an address background library by using a standard address model; S4, based on the local database of the administrative division, the local database of the place name and the local database of the address, constructing a geo-code index database; S5, using the word segmentation technology andthe search engine, the address matching algorithm being established. The invention has the advantages of solving the problems of large amount of address information data and low matching success rateof unregistered words in the existing address matching technology; As well as address matching rules complex, the existing matching algorithm retrieval speed and accuracy is not high, address matchingefficiency is low.

Description

technical field [0001] The invention relates to the technical fields of computers, search engines and geographic information, in particular to an address matching method based on statistical word segmentation. Background technique [0002] There are three commonly used Chinese word segmentation algorithms: [0003] 1. Based on dictionary word segmentation algorithm [0004] Also known as string matching word segmentation algorithm. The algorithm matches the character string to be matched with words in an established "sufficiently large" dictionary according to a certain strategy. If an entry is found, it means that the matching is successful and the word is recognized. Common dictionary-based word segmentation algorithms are divided into the following types: forward maximum matching method, reverse maximum matching method, and two-way matching word segmentation method. [0005] 2. Word segmentation algorithm based on grammar and rules [0006] Its basic idea is to perfor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/9035G06F16/29G06F17/27
CPCG06F40/284
Inventor 陈张建李晶云李爱勤王延朝祝士杰赵飞陆泽丁宜忠
Owner ZHEJIANG INST OF SURVEYING & MAPPING SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products