Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A fast extraction method of online catering master label data based on Gaussian estimation

An extraction method and main label technology, applied in the field of data mining and recommendation system, can solve the problem of not being able to take into account the integrity and effectiveness of the content, and achieve the effect of low computational complexity

Active Publication Date: 2019-11-05
ZHEJIANG UNIV OF TECH
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to overcome the inability of the existing catering data extraction methods to take into account both content integrity and utility, the present invention provides a Gaussian-based estimation method that has a balanced performance in content integrity and utility after denoising data. Fast extraction method of online catering master label data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A fast extraction method of online catering master label data based on Gaussian estimation
  • A fast extraction method of online catering master label data based on Gaussian estimation
  • A fast extraction method of online catering master label data based on Gaussian estimation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The present invention will be further described below in conjunction with the accompanying drawings.

[0019] refer to figure 1 , a method for quickly extracting main label data of online catering based on Gaussian estimation. This invention uses the data officially disclosed by yelp to analyze the scheme of extracting taste labels in the sense of user dining behavior. The original data records the historical behavior information of each user and the details of restaurants. Information, taking this patent research on yelp users as an example, the required behavior data includes information such as the user's dining restaurant, restaurant taste tags, and comment text on restaurants. The text data about users’ comments on restaurants is used here as a verification data set for subsequent testing to predict the reliability of the user behavior model.

[0020] The present invention comprises the following steps:

[0021] S1: Obtain store label data, as well as user rating...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Gaussian estimation-based online catering main tag data quick extraction method. The method comprises the following steps of 1) obtaining store tag data and user score and review data, and preprocessing the store tag data; 2) for each user, calculating a mean value and a variance of Gaussian distribution of a score data set of each tag, namely, score Gaussian distribution of the user under the tag; 3) for each user, performing standardization processing on each tag score of a store each time and the score Gaussian distribution of the tag, and calculating maximum likelihood estimation as a target tag of the user going to the store at that time; and 4) testing an estimation tag, an actual complete tag and the review data, and taking a relative deviation of a matching rate of the tag and the review data as a final evaluation score of a model. According to the method, a maximum likelihood taste tag under the Gaussian distribution is extracted as a mainly selected taste tag of a dining behavior of the user; and the method is relatively high in extraction precision and relatively low in algorithm complexity, and is suitable for actual application scenes.

Description

technical field [0001] The invention relates to the field of data mining and recommendation systems, in particular to a method for quickly extracting main label data of online catering based on Gaussian estimation. Background technique [0002] The data collected in data mining often have various noises, such as missing data or abnormal data. Obviously, noisy data can affect the performance of subsequent modeling. Data denoising is a very important preprocessing step in order to extract data that retains the maximum amount of information. In the process of user data analysis, sometimes using a good data denoising method to improve accuracy is much better than complex algorithm optimization. [0003] The main purpose of designing a recommendation system is to predict user behavior preferences, and analysis materials often come from user historical behavior data. To discover the content of a user's purchase behavior, a common method is to analyze user comments through natura...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/9535G06F16/33G06F16/903G06Q30/02
CPCG06F16/334G06F16/90344G06F16/9535G06F2216/03G06Q30/0255G06Q30/0269
Inventor 宣琦周鸣鸣张致远傅晨波翔云吴哲夫
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products