Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Carrier-free information hiding method for big data Chinese text

An information hiding and big data technology, applied in digital data protection, electronic digital data processing, text database indexing, etc., can solve problems such as limited space for hidden capacity improvement and difficulty in meeting actual needs.

Pending Publication Date: 2020-10-20
CENTRAL SOUTH UNIVERSITY OF FORESTRY AND TECHNOLOGY
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the above literature, although the various methods proposed by the researchers have improved the hidden capacity, the room for improving the hidden capacity is not large, and it is still difficult to meet the actual needs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Carrier-free information hiding method for big data Chinese text
  • Carrier-free information hiding method for big data Chinese text
  • Carrier-free information hiding method for big data Chinese text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0073] Text segmentation and word frequency features

[0074] Sentence analysis in Chinese text needs to be segmented into words. How to accurately segment text sentences into words has always been a research hotspot in natural language processing technology. Hanlp is an open source Java word segmentation toolkit consisting of a series of models and algorithms. It not only provides word segmentation, but also has complete functions in terms of lexical analysis, syntactic analysis, and semantic understanding. In extreme speed mode, the word segmentation rate of Hanlp can reach 20 million words per second.

[0075] After text segmentation, it is often necessary to analyze the words in the text. In natural language processing, the word frequency statistics and word TF-IDF feature extraction of words are the most commonly used methods. The word frequency method believes that the subject words in the text often appear repeatedly in the text, so the word frequency in the text can ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a carrier-free information hiding method for a big data Chinese text, and the method comprises: firstly carrying out preprocessing of the big data Chinese text, and mainly comprises word segmentation of the text, calculation of the word frequency and TF-IDF feature information of words after word segmentation, and LDA topic model clustering of the text; then, the sender segments the secret information, converts the secret information into keyword IDs through a word index table, and searches a text containing secret information keywords in the big data text; and, secondly, the searched text is used as an index label according to theme distribution of the corresponding text and TF-IDF characteristics of keywords in the corresponding text, meanwhile, a random number isintroduced to control the sequence of the secret information keywords, and finally, the random number and the index are used as labels together to be encrypted and sent to a receiver. Experiments show that the method improves the concealment and security of secret information while improving the concealment capacity.

Description

technical field [0001] The invention relates to a carrier-less information hiding method for large data Chinese texts. Background technique [0002] Information hiding technology, as an important branch in the field of information security, mainly uses the redundancy of digital information by human sensory organs to hide one information in another information carrier, so that the hidden carrier information still shows original features. This information carrier can be various types of data, such as text, image, video or audio and so on. Although the external characteristics of the hidden carrier are still preserved, it still needs to change some information of the carrier, which makes it unable to effectively resist various steganographic detection tools such as replay attacks, OCR technology, and statistical analysis. [0003] In view of the existing information hiding technology that needs to change the carrier information, scholars have proposed the concept of informati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/60G06F16/31G06F16/33G06F16/35G06F40/289
CPCG06F21/602G06F16/316G06F16/3334G06F16/35G06F21/1066
Inventor 秦姣华周卓向旭宇谭云
Owner CENTRAL SOUTH UNIVERSITY OF FORESTRY AND TECHNOLOGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products