Content identification method and device, electronic equipment and readable storage medium

A content recognition and text recognition technology, applied in the field of content recognition, can solve the problems of insecurity, low audit efficiency, endangering network health, etc., and achieve the effect of meeting the requirements of recognition

Pending Publication Date: 2022-01-14
成都颜创启新信息技术有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] With the development of Internet technology, the Internet platform has become the main platform for people to obtain information. On various Internet platforms, there are massive text, image and video data generated all the time. Among these massive data, there are extremely May produce unsafe content that is harmful to online health
[0003] If the review of insecure content relies on traditional manual review methods to meet the review needs of massive Internet content, there will be problems such as high review costs, low review efficiency, and uncontrollable review accuracy.
[0004] Existing technologies often use a single content recognition technology to detect text or images in websites, which can only recognize a single text or image, and cannot meet the requirements for content recognition of multi-modal web content

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Content identification method and device, electronic equipment and readable storage medium
  • Content identification method and device, electronic equipment and readable storage medium
  • Content identification method and device, electronic equipment and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] First, the professional terms that may be involved in the present application will be explained to facilitate understanding.

[0052] Machine study: Machine learning is a multi-discipline cross-professional, covering probability theory knowledge, statistical knowledge, approximate theoretical knowledge and complex algorithm knowledge, using computer as a tool and is committed to simulated human learning methods in real time, and will existing content Knowledge structure is divided to effectively improve learning efficiency.

[0053]Deep learning: Deep learning is the inherent law of study sample data and the level, which has a great help in the information obtained in these learning processes, such as text, images, and sounds. Its ultimate goal is to let the machine have analyzed learning capabilities like people, identify data such as text, images, and sound. Depth study in data mining machine translation, natural language processing, multimedia learning, and other related...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a content identification method and device, electronic equipment and a readable storage medium. The invention relates to the field of content identification, and the method comprises the following steps: obtaining source text data, source image data and video data in webpage information, converting the video data into corresponding image data, and identifying and labeling the source text data, the source image data and the image data converted from the video data through a first learning model and a second learning model; therefore, characters, images and video data in the webpage can be identified at the same time. The requirement for identifying the multi-modal webpage content is met, and meanwhile, the identification efficiency can be greatly improved.

Description

Technical field [0001] The present invention relates to the field of content identification, and in particular, to one content identification method, apparatus, electronic device, and readable storage medium. Background technique [0002] With the development of Internet technology, the Internet platform has become the main platform for people's information. On various Internet platforms, massive texts, images and video data are generated, while in these massive data, very May cause unsafe content that hazards network health. [0003] For unsafe content, if you rely on traditional manual audit methods to address the audit needs of Internet mass content, there is a problem of high audit cost, low audit efficiency, uncontrollable audit accuracy. [0004] The prior art is often detected by a single content identification technique, which can only be identified for a single text or image, and it is impossible to meet the requirements for content recognition of the multi-modal web pag...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06V10/774G06V20/62G06V30/10G06K9/62
CPCG06F18/214
Inventor 宋梓语
Owner 成都颜创启新信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products