Menu field seed word automatic extraction realization method and a storage medium
A technology of automatic extraction and implementation method, which is applied in the field of manual interaction of recipes and smart kitchens, can solve a large number of manual problems, and achieve the effect of speeding up and fast extraction, saving labor cost and time, and saving manual labeling
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0026] Please refer to figure 1 As shown, a method for automatically extracting seed words in the field of recipes includes the following steps:
[0027] Step S100: obtain recipe data;
[0028] Preferably, the recipe data can be obtained by using crawler technology to obtain a large amount of available data, but not limited to crawler technology. For example, the recipe data can also be obtained by manually maintaining and entering data: third-party open platform interface to obtain data and other methods.
[0029] Step S200: set up a semantic model based on word vector and document vector;
[0030] Specifically, the step of establishing a semantic model based on word vectors and document vectors specifically includes:
[0031] Using a large amount of recipe data as training samples, the documents in the recipe data are word-segmented, and the words and documents after word-segmentation are obtained;
[0032] Train the words and documents after word segmentation to obtain a...
specific Embodiment
[0043] Step S1000, obtain a large amount of recipe data, specifically, load the document data from the database, and organize the document data into a single sentence; then, use the open source toolkit jieba Chinese word segmentation component or hanlp Chinese language processing component to perform word segmentation on the document data ;
[0044] The jieba Chinese word segmentation component or the hanlp Chinese language processing component can support three word segmentation modes: precise mode, which tries to cut the sentence most accurately and is suitable for text analysis; full mode, which scans all the words that can be formed into words in the sentence Come out; search engine mode, on the basis of the precise mode, segment long words again to improve the recall rate, suitable for word segmentation in search engines.
[0045] The word segmentation process is as follows: "The taste of Cantonese cuisine is generally lighter" -> "Cantonese cuisine", "of", "taste", "gene...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com