Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for embedding and extracting frequency domain water mark in English text

A text and English technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of watermark information influence, text watermark information loss, difficult to resist attacks, etc., and achieve the effect of enhancing robustness

Inactive Publication Date: 2008-04-30
TSINGHUA UNIV
View PDF1 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

People can design watermarking schemes for any file format, but it is difficult to find a watermarking technology suitable for all file formats
[0004] (2) Files in various formats can usually be converted to each other, and even the plain text content in the file can be directly extracted, such as selective paste in Word, only copying and pasting unformatted text will completely lose the text watermark information based on the format
The disadvantage of this method is that it is difficult to resist the attack of the same semantic transformation. Sometimes adding or deleting a word in the text may affect the watermark information.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for embedding and extracting frequency domain water mark in English text
  • Method for embedding and extracting frequency domain water mark in English text
  • Method for embedding and extracting frequency domain water mark in English text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] A kind of method of embedding and extracting frequency domain watermark in English text that the present invention proposes is described as follows in conjunction with accompanying drawing and example:

[0045] The method for embedding and extracting the frequency-domain watermark in the text proposed by the present invention includes two parts: watermark embedding and watermark extraction. The steps of embedding the watermark are shown in Figures 1, 2, and 3, including: first read the English text T, and then Vector extraction of T, the specific process is shown in Figure 2. The first step is to scan the text T from left to right, identify and obtain its first adjective or adverb w, and use the WordNet tool to find the set of synonyms for w S w . Judgment S w Whether it has been marked, if it has been marked, skip this word, continue to recognize backwards, and repeat this step; if S w has not been marked, then first set S w Marked as processed, which means that th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method for embedding and extracting a frequency domain watermark in an English text, belonging to the technical field of computer file protection. The method comprises acquiring an adjective or an adverb w from the English text T; finding a synonym assembly Sw from w as a dimension of a vector vc in the text T; finding an agent word wd from w; performing Hash operation to private key information k of a copywriter of the file to obtain a long integer R; dividing R by a preset grouping number n, (n is a positive integer) to obtain a grouping number i of the current Sw; performing single-direction Hash operation of each word ws in Sw with k, determining the oddity of the obtained remainder, and respectively adding into an assembly Ai and an assembly Bi; using the number of words ci of Ai as the vector vc of the English text T; setting a watermark vector vw corresponding to the text vector vc as the watermark information to be embedded or extracted. The method also comprises embedding and watermark detecting steps. The invention can be used for the original text protection.

Description

technical field [0001] The invention belongs to the technical field of computer text protection, in particular to a method for embedding and extracting frequency-domain watermarks in English texts. Background technique [0002] As an effective means of computer text protection, digital watermarking has increasingly become the focus of research. However, most of the current research on digital watermarking technology focuses on image, audio, and video data, and relatively little research on text watermarking. This is mainly due to the particularity of the text, and it is difficult to watermark the text: [0003] (1) Text is composed of content and format. Due to the different ways of expressing the content of the document, the format of the text document is also different. There are many types of text files, and the file formats are also various, such as WORD documents ( * doc), Web pages, plain text, PDF, etc. People can design a watermark scheme for any file format, but...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/28
Inventor 王建民王朝坤李德毅杨建龙
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products