Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Text data semantic space-time mode exploration method based on LDA model and LSTM network

A text data, spatiotemporal technology, applied in the direction of unstructured text data retrieval, text database query, special data processing applications, etc., can solve the problem that users cannot make decisions to add to the results, affect the ability of model extraction semantics, and cannot guarantee topic models. quality and other issues to achieve the effect of reducing errors, improving speed and accuracy, and increasing accuracy

Pending Publication Date: 2020-04-07
TIANJIN UNIV
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The above-mentioned works mainly use the text data of social media, but there are some shortcomings and deficiencies in these works: first, the above-mentioned work is to set the parameters of the topic model in advance, and use the trained topic model to extract semantics from the data of the present invention, because the topic model Sensitive to parameters, this cannot guarantee the quality of the topic model, thus affecting the ability of the model to extract semantics
Secondly, the above work mainly uses static data, and there is no requirement for data processing speed and query speed. When faced with massive text data, it is often impossible to respond in a timely manner.
Finally, the above work only provides a query function when displaying data, and cannot add the user's decision to the result

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text data semantic space-time mode exploration method based on LDA model and LSTM network
  • Text data semantic space-time mode exploration method based on LDA model and LSTM network
  • Text data semantic space-time mode exploration method based on LDA model and LSTM network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The present invention will be described in further detail below in conjunction with the accompanying drawings and specific embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0021] The present invention proposes a visual analysis framework to interactively explore the semantic spatiotemporal patterns of massive text data. First, the framework adopts a topic extraction method based on model integration. By projecting the extracted results of topic models with different parameters on the plane, users can intuitively understand the differences between different topics and choose topics of interest from them, thus solving the problem of inability to It is a matter of setting the parameters of the subject accurately. Secondly, the framework integrates a DataCube-based data and task organization structure, by pre-storing two types of indicators of the subject ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text data semantic space-time mode exploration method based on an LDA model and an LSTM network. The text data semantic space-time mode exploration method comprises the following steps: (1) integrating a topic model, including theme generation, theme quality evaluation and theme dimension reduction projection; extracting semantics from the text data by using an LDA topicmodel, generating a topic model by iterating different parameters, performing quality evaluation on the topic model, and then selecting high-quality topics for integration so as to solve the influenceof the parameters on the model quality; (2) constructing a theme space-time body: converting time, space and text theme data in the text data into a cubic data structure; and (3) visual interaction and prediction: including a theme projection view, a time-space projection view and a mode comparison view; for providing visual interaction exploration for a theme space-time body, and being convenient for a user to explore a data result conveniently in a visual mode; and using an LSTM method to predict numerical change in a future time period.

Description

technical field [0001] This patent mainly relates to the fields of natural language processing and data visualization, and specifically relates to a method for structured representation of massive text data and topic model optimization. Background technique [0002] The amount of text data in the world has grown exponentially in recent years, which urgently requires people to mine new knowledge and new ideas from text data. From social media analysis to risk management and cybercrime protection, working with text data has never been more important. Since text data usually contains time and space information, spatio-temporal attributes are often added when text data is processed. [0003] Among the works dealing with the distribution of text data in space and time, there are many works that focus on finding keywords in text data. A common approach is to analyze text data to detect associated events that occur at specific times and locations, identifying events from text gro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06N3/04
CPCG06F16/3344G06N3/044G06N3/045
Inventor 贺一桐张康李杰
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products