A housing listing deduplication method based on housing listing information similarity and image recognition

A technology of image recognition and similarity, applied in still image data indexing, still image data retrieval, etc., can solve problems such as difficulty for home buyers to identify which information, poor user experience for home buyers, and duplicate listings on the official website

Active Publication Date: 2021-07-06
诸葛启航(苏州)科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Brokerage companies often forget to remove old listings due to changes in the listing information released, such as price adjustments, resulting in duplicate listings on the official website; platform websites even release a large number of duplicate listings in order to obtain traffic
[0003] Existing platforms have a large number of duplicate housing listings, resulting in poor user experience for home buyers, and some duplicate housing information is inconsistent, making it difficult for home buyers to identify which information is reliable

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0013] The present invention will be further described below in conjunction with specific embodiments, and the advantages and characteristics of the present invention will become clearer along with the description. However, these embodiments are only exemplary and do not constitute any limitation to the scope of the present invention. Those skilled in the art should understand that the details and forms of the technical solutions of the present invention can be modified or replaced without departing from the spirit and scope of the present invention, but these modifications and replacements all fall within the protection scope of the present invention.

[0014] The present invention relates to a house source deduplication method based on house source information similarity and picture identification, comprising the following steps:

[0015] Step (1), equal value deduplication of key fields: judge whether the values ​​of the same fields of two housing sources are equal, if the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention relates to a house source deduplication method based on house source information similarity and picture recognition, including the following steps: Step (1), key field equivalence deduplication: judging whether the same field values ​​of two house sources are equal, if The information of the house source is equal, and it is judged as a house source, and the new house source is not included in the warehouse; step (2), according to the picture link, download the house source picture from the source website, and calculate the phash value, and match the same phash value Find out the real estate ID and so on. The advantage of the present invention is: using the fast retrieval of the elasticsearch module and the phash value of the pictures, duplicate pictures can be quickly found from a large number of pictures, thereby screening out suspected duplicate listings, combined with the key attributes of the listings, to achieve accurate deduplication, even if the agent Tampering with information can also be identified.

Description

technical field [0001] The invention relates to a house source deduplication method based on house source information similarity and picture identification. Background technique [0002] There are a large number of false and duplicate listings in existing brokerage companies and real estate platforms. Brokerage companies often forget to remove old listings due to changes in their listing information, such as price adjustments, resulting in duplicate listings on the official website; platform websites even release a large number of duplicate listings in order to obtain traffic. [0003] A large number of duplicate listings on the existing platforms lead to poor user experience for homebuyers, and some duplicate listings have inconsistent information, making it difficult for homebuyers to identify which information is reliable. Contents of the invention [0004] In order to overcome the defects of the prior art, the present invention provides a house source deduplication me...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/51
Inventor 张文战杨丽娟白峻峰刘子耀张凯
Owner 诸葛启航(苏州)科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products