Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Topic mining method, system and device based on short text and storage medium

A short text and topic technology, applied in text database query, unstructured text data retrieval, special data processing applications, etc., can solve problems such as difficult to accurately mine high-quality short text topics

Pending Publication Date: 2020-07-28
TSINGHUA UNIV
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In order to solve the technical problem that it is difficult to accurately mine high-quality short text topics, the embodiments of the present invention provide a short text-based topic mining method, system, device, and storage medium

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Topic mining method, system and device based on short text and storage medium
  • Topic mining method, system and device based on short text and storage medium
  • Topic mining method, system and device based on short text and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0045] figure 1 A flowchart of a short text-based topic mining method provided by an embodiment of the present invention, such as figure 1 As shown, the method includes:

[0046] S1, acquiring the short text to be processed.

[0047] S2. Extract topic distribution information in the short text to be processed by using a preset short text topic ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention relates to the technical field of text data processing, and discloses a topic mining method, system and device based on a short text and a storage medium. The method comprises the steps of firstly obtaining a to-be-processed short text; and extracting topic distribution information in the to-be-processed short text through a preset short text topic mining model. Visibly, according to the embodiment of the invention, the topic mining operation of the short text is processed by applying the topic mining model special for short text processing, so that the topic ofthe short text can be accurately mined, and the technical problem that the topic of the high-quality short text is difficult to accurately mine is solved.

Description

technical field [0001] The invention relates to the technical field of text data processing, in particular to a short text-based topic mining method, system, device and storage medium. Background technique [0002] With the rapid development of the Internet today, short texts are becoming more and more popular. Typical short texts are microblogs, comments on shopping sites, and news headlines. [0003] It can be seen that short text is a type of text data with short text length and limited content. As for the word limit of the short text, it can be within 50 characters or within 100 characters. There is no hard limit here. Short text is a kind of text type expression widely used in academic circles. [0004] People tend to use short texts to express opinions and emotions, and the hidden topics mined from short texts have also played an important role in the fields of semantic analysis, user modeling and content recommendation. [0005] However, compared with ordinary long ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/33G06F40/126
CPCG06F16/334Y02D10/00
Inventor 李春平吴小宝
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products