Chinese online audio and video subtitle generation method

A technology of audio, video and subtitles, which is applied to the automatic generation of subtitles of audio and video in Chinese online courses, and the field of automatic generation of subtitles. It can solve the problems that the technology has not been widely used, and achieve the effects of reducing maintenance costs, improving quality, and broad application prospects

Active Publication Date: 2021-04-06
NANJING UNIV OF POSTS & TELECOMM
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Google scientist Mike Cohen said that subtitle generation technology integrates speech recognition and translation algorithms, but this technology is not perfect and still needs continuous improvement
Moreover, some scholars have conducted research on the automatic subtitle generation technology of Chinese audio and video in China, and found that this technology has not been widely used in the relevant sites of Chinese online courses.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese online audio and video subtitle generation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] like figure 1 As shown, the present invention discloses a subtitle generation method for Chinese online audio and video, comprising the following steps:

[0032] S1, the audio data extraction step, the server receives the audio and video files uploaded by the user, extracts the audio data from the received audio and video files, and converts the audio data into a standard format.

[0033] Specifically include: the user uploads an audio and video file through the Chinese online course video website, the server receives the audio and video file, extracts the audio data in it, the server reads the parameter information from the audio data, and converts the audio data into a standard format. The parameter information includes at least the number of sound channels, encoding mode and sampling rate.

[0034] The generated audio format processed and analyzed in this step is in wav format. Wav is a coding format developed by Microsoft and IBM for storing audio streams in person...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a subtitle generation method for Chinese online audio and video, including the following steps: S1, audio data extraction step, the server receives audio and video files, extracts audio data and converts it into a standard format; S2, noise reduction step, audio data Carry out noise reduction processing, obtain audio file; S3, data segmentation step, audio file is carried out endpoint segmentation, obtains audio sample; S4, segment recognition step, is further segmented to obtained audio sample, obtains voice segment, then Recognize the speech segment, sort out and obtain the recognition results of all audio data; S5, the subtitle generation step, integrate and analyze the text and the corresponding time axis, obtain the subtitle file, and match the subtitle with the audio data according to the generated subtitle file. The method of the invention can automatically complete the voice recognition of audio and video information and subtitle generation, effectively making up for the shortcomings of traditional manual shorthand in conversion efficiency in subtitle generation.

Description

technical field [0001] The invention relates to a method for automatically generating subtitles, in particular to a method for automatically generating subtitles for Chinese online course audio and video, and belongs to the technical field of audio recognition. Background technique [0002] With the continuous progress and improvement of Internet technology, various Chinese online audio and video course websites have also been widely popularized and developed rapidly, and the ways and forms of disseminating professional knowledge in various fields have changed. Synchronous subtitles in audio and video information help learners overcome difficulties in understanding new knowledge due to differences in regional culture and language, and also eliminate listening problems caused by lecturers’ inarticulate words, homophones, and non-standard pronunciation. , Obstacles to watching audio and video information. At the same time, adding subtitles to audio and video can also effectiv...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): H04N5/278H04N21/439G10L21/0208G10L15/22G10L15/04
CPCG10L15/04G10L15/22G10L21/0208H04N5/278H04N21/439H04N21/4394
Inventor 薛景陈康扬王宇
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products