Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Commodity labeling method suitable for electronic commerce Chinese website

An e-commerce and labeling technology, which is applied in the direction of website content management, network data retrieval, network data indexing, etc., can solve problems such as product weight reduction, traffic loss, and incomplete information, so as to improve the recall rate and accuracy rate, and guarantee The accuracy of word segmentation and the effect of improving product information

Active Publication Date: 2016-02-10
FOCUS TECH
View PDF4 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] On e-commerce Chinese websites, when users use keywords to search for products, they usually directly retrieve the basic information of the products. information, but still can not avoid the emergence of two types of problems: one is the problem of product information cheating, in order to provide the exposure rate and frequency of appearance of their products in the product search process, the merchants make the released products eye-catching, so that product buyers More products can be searched, and they abuse brand names or keywords that are not related to the product when describing the product, so that product buyers cannot accurately find the product they need; the second is that the product information is not comprehensive The problem is that merchants omit key information about product descriptions when describing products, including missing important information such as product titles, pictures, and descriptions. The lack of information will cause the website to fail to return more relevant product search results when users search for products.
[0003] For the problem of merchants cheating on product information, e-commerce websites usually set up rules to solve the problem, and lower the rights of those cheating products that do not meet the rules. However, there are certain defects in the rules, and strict rules may lead to the reduction of rights of non-cheating products; Loose rules may make the anti-cheating effect not obvious enough; in order to solve the problem of incomplete information filled by merchants, in order to ensure that as many related products as possible are recalled, e-commerce websites choose to expand the retrieval range of commodity information at the expense of retrieval quality, That is, multiple product information fields are matched, and sometimes even fields with a large amount of data but poor quality such as "product description" are selected. Although more products can be recalled in this way, the recalled products cannot be recalled. Satisfied users, resulting in significant traffic loss

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Commodity labeling method suitable for electronic commerce Chinese website
  • Commodity labeling method suitable for electronic commerce Chinese website
  • Commodity labeling method suitable for electronic commerce Chinese website

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0037] The present invention specifically includes a method for building a word segmentation lexicon, a method for label collection, and a method for labeling commodities; the method for building a word segmentation lexicon is used to perform word segmentation processing on commodity names in e-commerce Chinese websites; the method for label collection is used to The product name is used to find the corresponding tags for all the products in the Chinese e-commerce website; the method of labeling the products is used to find the tags related to it for all the products in the Chinese e-commerce website. The product name is a short text description of the product by the merchant user of the e-commerce Chinese website.

[003...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A commodity labeling method suitable for an electronic commerce Chinese website includes the step of building a word segmentation lexicon, the step of collecting labels and the step of marking commodities with the labels. According to the step of building the word segmentation lexicon, based on frequency statistics, in different commodity descriptions, of all commodity key words in the electronic commerce Chinese website, the commodity key words with the frequency larger than three are reserved, and the key words with the number of Chinese characters of the commodity key words smaller than or equal to five are screened from the commodity key words to serve as lexicon data. According to the step of collecting the labels, based on the built word segmentation lexicon, word segmentation processing is carried out on all commodity names in the electronic commerce Chinese website through a reverse maximum matching word segmentation algorithm, after word segmentation processing is carried out through the reverse maximum matching word segmentation algorithm, the last word formed by word segmentation processing of each commodity is selected as the commodity label of the commodity, and finally all the labels form a label data set. According to the step of marking the commodities with the labels, relations between commodity attributes and the labels are found by using a text mining algorithm.

Description

technical field [0001] The invention belongs to the field of computer internet, and in particular relates to a method suitable for labeling commodities on e-commerce Chinese websites. Background technique [0002] On e-commerce Chinese websites, when users use keywords to search for products, they usually directly retrieve the basic information of the products. information, but still can not avoid the emergence of two types of problems: one is the problem of product information cheating, in order to provide the exposure rate and frequency of appearance of their products in the product search process, the merchants make the released products eye-catching, so that product buyers More products can be searched, and they abuse brand names or keywords that are not related to the product when describing the product, so that product buyers cannot accurately find the product they need; the second is that the product information is not comprehensive The problem is that merchants omit...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/951G06F16/958
Inventor 沈华楠赵亮亮姜平何学勇
Owner FOCUS TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products