Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for extending semantic information of microblogs and selecting features thereof

A semantic feature and feature selection technology, which is applied in special data processing applications, instruments, unstructured text data retrieval, etc., can solve problems such as many irregular texts, sparse semantics, and short microblog information

Inactive Publication Date: 2014-07-09
BEIJING UNIV OF TECH
View PDF2 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, Weibo has the characteristics of short information, many irregular texts, and sparse semantics. It is no longer applicable to directly use traditional feature selection and text classification methods.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for extending semantic information of microblogs and selecting features thereof
  • Method for extending semantic information of microblogs and selecting features thereof
  • Method for extending semantic information of microblogs and selecting features thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and examples. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0046] according to figure 1 Shown, the method that the present invention proposes is to realize by following steps successively:

[0047] Step (1) Analyze microblog related information and define microblog semantic features.

[0048] Due to the character limit of Weibo itself, the semantic sparseness of the text of Weibo is unavoidable. But because Weibo can display information that other short texts do not have, such as author's personal information, comment content and other information. Therefore, a method for classifying microblog information is proposed here by combining these information with the body part. The analysis of microblog related information is shown in ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method for extending semantic information of microblogs and selecting features thereof, belongs to the field of text information processing, and particularly relates to a method and a system for extending semantic information of microblogs and selecting features thereof. The method has the advantages that the method is used for extracting the features of the microblogs on the basis of modified chi-square statistics; the classification features of the information of the microblogs are extended at first, frequency factors are imported on the basis of the traditional chi-square statistics, and a process for selecting the features is modified; a novel process for modifying the chi-square statistics is based on the traditional feature item weight computation, weight computation results are modified, and accordingly the microblog information classification accuracy can be improved by the method.

Description

technical field [0001] The invention belongs to the field of text information processing, and in particular relates to a microblog semantic information expansion and feature selection method and system. Background technique [0002] Weibo, the abbreviation of microblog, is an information sharing, dissemination and acquisition platform based on user relationships. Users can form personal communities through WEB, WAP and various clients, update information with about 140 characters, and realize real-time share. It has the characteristics of fast release of information and fast transmission speed. [0003] The rapid development of microblogging technology has greatly promoted people's communication and exchanges, and made great contributions to human civilization and development. However, the negative impact brought by the explosive growth of information has become increasingly prominent. Especially with the continuous popularization of major microblog websites and other fac...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/35G06F16/958
Inventor 刘磊
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products