Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for processing text semantics by utilizing image processing technology and semantic vector space

A technology of image processing and semantic processing, which is applied in semantic analysis, electronic digital data processing, special data processing applications, etc. It can solve the problems of not being able to deal with the influence of synonyms and synonyms of word variants, high calculation costs, and unable to meet real-time applications, etc. , to meet real-time application requirements and ensure lightweight effect

Inactive Publication Date: 2014-09-10
SHANGHAI JILIAN NETWORK TECH CO LTD
View PDF6 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the current text semantic processing technology still has some limitations. Taking the text semantic segmentation technology as an example, it basically starts from the perspective of word frequency statistics, by calculating the similarity of the word frequency statistical vectors of repeated words in adjacent text blocks. Degree to achieve semantic segmentation, such as the classic TextTiling algorithm, Dotplotting algorithm, but they do not take into account the semantic space contained in the word, can not deal with the impact of word variants or synonyms, so the robustness is not strong; after that Although some algorithms such as the ESA (Explicit semantic analysis) algorithm have enhanced robustness by introducing semantic vector space, they cannot meet the needs of real-time applications due to the high dimension of the semantic space and the huge computational cost; there is also the TopicTilling algorithm, although through Adding the connection between words and topics improves segmentation performance, but it requires complex topic models to intervene, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for processing text semantics by utilizing image processing technology and semantic vector space
  • Method and system for processing text semantics by utilizing image processing technology and semantic vector space
  • Method and system for processing text semantics by utilizing image processing technology and semantic vector space

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0180] Demonstrate the specific embodiment of the present invention with example below, each module of system is processed as follows successively:

[0181] (1) Text input

[0182] Enter a piece of text, segment the sentences and arrange them in order as follows:

[0183][1] The People's Republic of China (PRC), the third-largest country in the world after the former USSR and Canada and the largest nation in Asia, claims an area of ​​approximately 9.6 million square kilometers.

[0184] [2] China's landscape is vast and diverse, ranging from forest steppes and the Gobi and Taklamakan deserts in the arid north to subtropical forests in the wetter south.

[0185] [3] The Himalaya, Karakoram, Pamir and Tian Shan mountain ranges separate China from South and Central Asia.

[0186] [4] The Yangtze and Yellow Rivers, the third- and sixth-longest in the world, run from the Tibetan Plateau to the densely populated eastern seaboard.

[0187] [5] China's climate is mainly dominated b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of text semantic information processing, and in particular relates to a method and a system for processing text semantics by utilizing an image processing technology and semantic vector space. The system comprises a text input and preprocessing module, a semantic vector construction module, a semantic information processing module and a semantic processing result display module, wherein the semantic information processing module is specifically used for semantic turning sentence extraction, semantic noise sentence detection, semantic range tracking and semantic scene segmentation. According to the method and the system, a text unit is mapped to a pixel in an image, and a semantic vector which describes the text unit is taken as pixel grayscale of the image, so that various technologies and methods in an image processing field can be introduced to process a text flexibly and intuitively without the influence of the diversification of word forms; meanwhile, the semantic vector is constructed by instructing a Word2Vec method, so that the lightweight of algorithms is ensured to meet the requirements on real-time application.

Description

technical field [0001] The invention belongs to the technical field of text semantic information processing, and in particular relates to a lightweight text semantic processing method and system utilizing image processing technology and semantic vector space. Background technique [0002] With the development of computer technology and network, we have now entered the era of information explosion - all kinds of massive data are presented in the form of electronic text. In this case, it is possible to quickly and accurately extract the information that users care about. It is against this background that the text information processing technology emerges as the demand becomes more and more urgent, and the semantic processing of text is the most important thing, which makes us move from language processing to language understanding. Text semantic processing technology has great application value in many fields, such as text semantic segmentation, automatic text summarization e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/345G06F40/30
Inventor 王晓平肖仰华汪卫
Owner SHANGHAI JILIAN NETWORK TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products