Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Image retrieval method, device, server and storage medium

A technology of pictures and picture groups, applied in the Internet field, can solve problems such as inability to recall pictures and inability to meet query requirements.

Active Publication Date: 2021-08-27
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, if the search term (query) of the query is "AB", that is, the corresponding query expression is "AANDB" (that is, A and B must be hit at the same time), and the source page of the same image has f1 and f2, and f1 only contains the word " A", f2 only contains the word "B", then the above query expression cannot meet its query requirements, and the image cannot be recalled

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image retrieval method, device, server and storage medium
  • Image retrieval method, device, server and storage medium
  • Image retrieval method, device, server and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0030] figure 1 It is a flow chart of the picture retrieval method provided by Embodiment 1 of the present invention. This embodiment is applicable to the situation of picture retrieval, especially the situation of using a long query or a query with multiple limited words for picture retrieval. The method can be searched by pictures device, which may be implemented in software and / or hardware, and may be configured in a server. Such as figure 1 As shown, the method specifically includes:

[0031] S110. Identify multiple picture groups with the same content from pictures on all web pages.

[0032] With the continuous development of network technology, pictures, an important form of information representation, inevitably appear on various web pages, and the scale of network data continues to expand at any time, and it often appears that multiple different web pages contain one or more content at the same time. The same picture, where the picture with the same content includes...

Embodiment 2

[0043] figure 2 It is a flow chart of the image retrieval method provided by Embodiment 2 of the present invention. This embodiment is further optimized on the basis of the foregoing embodiments. Such as figure 2 As shown, the method includes:

[0044] S210. Identify multiple picture groups with the same content from pictures on all web pages.

[0045] S220. Filter and deduplicate all source webpages of each picture in each picture group, and aggregate texts related to the pictures of the remaining source webpages to obtain a text description of each picture group.

[0046] The same image will appear on multiple different web pages. If the duplication is not removed, the indexed content will be redundant. At the same time, the surrounding text of some pictures from low-quality pages is not related to the picture, and if not processed, it will also affect the recall quality. Preferably, all source webpages are screened and deduplicated, the source webpage with the highest...

Embodiment 3

[0061] image 3 It is a flow chart of the image retrieval method provided by Embodiment 3 of the present invention. This embodiment is further optimized on the basis of the above embodiments. Such as image 3 As shown, the method includes:

[0062] S310. Identify multiple picture groups with the same content from pictures on all web pages.

[0063] S320. Filter and de-duplicate all source webpages of each picture in each picture group, aggregate the picture-related texts of the remaining source webpages, and obtain a text description of each picture group.

[0064] S330. Based on the text description of each picture group, establish an inverted index for each picture in each picture group, wherein, for each text description, the inverted index includes at least all The webpage from which the text description corresponds.

[0065] S340. Obtain the input search term.

[0066] S350. Recall at least one picture according to the correlation between the search term and the text...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the present invention discloses a picture retrieval method, device, server, and storage medium, wherein the method includes: identifying a plurality of picture groups with the same content from pictures on all web pages; The image-related texts of the source web pages are aggregated to obtain the text description of each image group; based on the text description of each image group, an inverted index is established for each image in each image group, and for each text description, the inverted index The index at least includes the source webpages corresponding to all the text descriptions in the picture group to which the text description belongs; image retrieval is performed according to the input search term and the inverted index. The embodiment of the present invention can realize the aggregation of relevant source webpages with pictures as the basic unit and use them as picture text description information for building an inverted index, reduce redundant information in picture indexes, and at the same time accurately recall cross-page hit results, and search for long search terms or more limited terms can also be accurately recalled.

Description

technical field [0001] The embodiments of the present invention relate to the technical field of the Internet, and in particular to a picture retrieval method, device, server and storage medium. Background technique [0002] With the development of network information technology, the data on the Internet is growing explosively, which makes there are more and more demands for quickly and accurately finding the picture information you need from Internet data. [0003] In the prior art, the text information describing the picture is usually obtained by parsing the webpage, obtaining the surrounding text of the picture, segmenting the text, and normalizing the text, and building an inverted index for the picture based on the text information. When the system searches for the picture it needs, the picture retrieval system realizes picture retrieval based on the inverted index based on the search term entered by the user. [0004] However, in the prior art, the page where the pic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/58G06F16/9535
CPCG06F16/58G06F16/51G06F16/535G06F16/953G06F16/215G06F16/538G06F16/951G06F16/319
Inventor 邹红建方高林刘海浪
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products