Text label extraction method and device, equipment and storage medium
A technology of text labeling and extraction methods, applied in the computer field, can solve the problems of low efficiency of text labeling, insufficient personalization, poor scalability, etc., and achieve the goal of solving low efficiency of label extraction, improving personalization and comprehensiveness, and improving accuracy Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0039] The text label extraction method provided in this embodiment is applicable to the case of extracting labels from multiple texts under the same topic, and is especially suitable for extracting text labels in an e-commerce platform. The method can be executed by a text tag extraction device, which can be realized by software and / or hardware, and which can be integrated in a device such as a personal computer or a server. see Figure 1a , the method of this embodiment specifically includes the following steps:
[0040] S110. Obtain each text of the label to be extracted, and vectorize each text to obtain a text vector corresponding to the corresponding text.
[0041] Wherein, the label refers to a characteristic description of a certain aspect of the text, for example, it may be a keyword expressing the focus of the text. In an e-commerce platform, a tag can be a description of specific attributes of an item, such as the specification attribute of the item, the extended a...
Embodiment 2
[0065] On the basis of the first embodiment above, this embodiment further optimizes "obtaining each text of the label to be extracted". On this basis, it is also possible to further optimize "determining the text label of each text according to each label candidate word corresponding to each text clustering result". The explanations of terms that are the same as or corresponding to the above-mentioned embodiments will not be repeated here.
[0066] For the convenience of subsequent descriptions, the application scenario in this embodiment is set to extract text labels from item introduction detail diagrams on the e-commerce platform. In addition to the structured item specification attributes and item extension attributes, the item introduction detail map also contains a large number of unstructured item information descriptions, and these item information includes a large number of personalized tags of the item. These personalized tags can be used as corrections and supplem...
Embodiment 3
[0091] On the basis of the above-mentioned embodiments, this embodiment describes the steps of automatically tagging the text to be tagged. The explanations of terms that are the same as or corresponding to the above-mentioned embodiments will not be repeated here.
[0092] S310. Obtain the text to be tagged with tags to be tagged, perform word segmentation and stop word removal on the text to be tagged, and obtain a word segmentation result to be tagged corresponding to the text to be tagged.
[0093] The text to be marked is the text that needs to be automatically marked. It can be ordinary text, or it can be the text obtained from the item introduction detail map according to the method of S210-S220 in the second embodiment. Since the automatic labeling needs to use the text labels that have been extracted from a large number of texts, the text to be labeled should belong to the same topic as the text extracted from each text label. After the text to be marked is obtained,...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com