Patents

Literature

Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.

336 results about "Cepstrum" patented technology

Filter

Efficacy Topic

Property

Owner

Technical Advancement

Application Domain

Technology Topic

Technology Field Word

Patent Country/Region

Patent Type

Patent Status

Application Year

Inventor

A cepstrum (/ˈkɛpstrʌm, ˈsɛp-, -strəm/) is the result of taking the inverse Fourier transform (IFT) of the logarithm of the estimated spectrum of a signal. It may be pronounced in the two ways given, the second having the advantage of avoiding confusion with "kepstrum", which also exists (see below). There is a complex cepstrum, a real cepstrum, a power cepstrum, and a phase cepstrum. The power cepstrum in particular has applications in the analysis of human speech.

Speech recognition system

InactiveUS7266496B2Improve execution speedLow costSpeech recognitionLinearityLinear prediction coefficient

The present invention discloses a complete speech recognition system having a training button and a recognition button, and the whole system uses the application specific integrated circuit (ASIC) architecture for the design, and also uses the modular design to divide the speech processing into 4 modules: system control module, autocorrelation and linear predictive coefficient module, cepstrum module, and DTW recognition module. Each module forms an intellectual product (IP) component by itself. Each IP component can work with various products and application requirements for the design reuse to greatly shorten the time to market.

Speech recognition system

Speech recognition system

Speech recognition system

Owner:NAT CHENG KUNG UNIV

Speaker verification for authorizing updates to user subscription service received by internet service provider (ISP) using an intelligent peripheral (IP) in an advanced intelligent network (AIN)

InactiveUS6337899B1Multiplex system selection arrangementsSpecial service provision for substationFrequency spectrumMatch algorithms

A method for controlling subscription services delivered to a user by an Internet Service Provider (ISP) coupled to an Automated Intelligent Network (AIN) telephone system with at least one central office switching system. An intelligent peripheral subsystem is connected to the central office switching system, via a call connection channel. The intelligent peripheral subsystem providing at least one auxiliary call processing capability via the call connection channel and provides a telephony speaker authentication method for selectively authorizing updates to user subscription services types provided by the ISP, wherein each of the subscription services types include one or more service options. The method for controlling subscription services including: a) prompting for a voice response; b) authenticating a user's voice response pattern by performing a speaker-dependent cepstrum matching algorithm with previously stored samples of the user's voice response, and if the user's voice response pattern is positively authenticated with previously stored samples, then performing the steps of: c) presenting a menu of user subscription types, each with available AIN service options; and d) responding to a user selection of at least one user subscription types, each with service option, by activating a service option selected by the user, wherein the service option is governed by said ISP by regulating one or more Uniform Resource Locator (URL) addresses received by a user subscriber device from said ISP over a telecommunication line.

Speaker verification for authorizing updates to user subscription service received by internet service provider (ISP) using an intelligent peripheral (IP) in an advanced intelligent network (AIN)

Speaker verification for authorizing updates to user subscription service received by internet service provider (ISP) using an intelligent peripheral (IP) in an advanced intelligent network (AIN)

Speaker verification for authorizing updates to user subscription service received by internet service provider (ISP) using an intelligent peripheral (IP) in an advanced intelligent network (AIN)

Owner:IBM CORP

Symbiotic automatic speech recognition and vocoder

InactiveUS6092039ASpeech recognitionUsabilitySpeech identification

The device and method of the invention receives a digital speech signal, which is processed by an Acoustic Processor to produce a Mel-Cepstrum Vector and Pitch. This is recalibrated and encoded. The encoded signal is transmitted over a narrow-band Channel, then decoded, split and recalibrated. From the split signals, one signal feeds a Statistical Processor which produces Recognized Text. Another signal feeds a Regenerator, which produces Regenerated Speech. The device and method according to the invention achieve simultaneously very perceptive Automatic Speech Recognition and high quality VoCoding, using Speech communicated or stored via a Channel with narrow-bandwidth; very perceptive Automatic Speech Recognition on a Client & Server system without a need to store or to communicate wide-bandwidth Speech signals; very perceptive Automatic Speech Recognition with Deferred Review and Editing without storage of wide-bandwidth Speech signals; better feedback in a system for Automatic Speech Recognition particularly for Deferred Automatic Speech Recognition; and good usability for unified Automatic Speech Recognition and VoCoding.

Owner:IBM CORP

Computer-implemented method and apparatus for audio data hiding

InactiveUS7058570B1Maximize survivabilityLittle changeSpeech analysisSound producing devicesHidden dataSurvivability

A computer-implemented method and apparatus for embedding hidden data in an audio signal. An audio signal is received in a base domain and then transformed into a non-base domain, such as cepstrum domain or LP residue domain. The statistical mean manipulation is employed on selected transform coefficients to embed hidden data. The introduced distortion is controlled by psychoacoustic model to ensure the imperceptibility of the embedded hidden data. Scrambling techniques can be plugged in to further increase the security of the data hiding system. The present new audio data hiding scheme provides transparent audio quality, sufficient embedding capacity, and high survivability over a wide range of common signal processing attacks.

Computer-implemented method and apparatus for audio data hiding

Computer-implemented method and apparatus for audio data hiding

Computer-implemented method and apparatus for audio data hiding

Owner:PANASONIC CORP

Cepstral domain pulse oximetry

InactiveUS7139599B2Improve determinationEliminate artifactsCatheterSensorsTime domainPulse oximetry

Processing of plethysmographic signals via the cepstral domain is provided. In one embodiment, a cepstral domain plethysmographic signal processing method (200) includes the steps of obtaining (210) time domain plethysmographic signals, smoothing (220) the time domain plethysmographic signals, performing (230) a first-stage Fourier transformation of the time domain plethysmographic signals to frequency domain plethysmographic signals, computing (240) power spectrums from the frequency domain plethysmographic signals, scaling (250) the power spectrums with a logarithmic function, performing (260) a second-stage Fourier transformation on log-scaled spectrums to transform the power spectrums into cepstrums, and examining (270) the cepstrums to obtain information therefrom relating to a physiological condition of the patient such as the patient's pulse rate or SPO2 level.

Cepstral domain pulse oximetry

Cepstral domain pulse oximetry

Cepstral domain pulse oximetry

Owner:DATEX OHMEDA

Double-microphone speech enhancer and speech enhancement method thereof

InactiveCN102347028AReduce noiseReduce background noiseSpeech analysisSpeech enhancement algorithmSpeech sound

The invention provides a double-microphone speech enhancer, which comprises a double-microphone array module, a delay compensation module, a cepstrum field dereverberation module, a speech enhancement processing module and an output module, wherein the cepstrum field dereverberation module has the effect of dereverberation. The invention also provides a speech enhancement method based on the double-microphone speech enhancer. The invention adopts the cepstrum field dereverberation module to carry out beam formation and low-pass filtering, and can effectively eliminate the affection of room reverberation, and meanwhile, by using a speech enhancement algorithm, background noise can be further suppressed.

Double-microphone speech enhancer and speech enhancement method thereof

Double-microphone speech enhancer and speech enhancement method thereof

Double-microphone speech enhancer and speech enhancement method thereof

Owner:AAC ACOUSTIC TECH (SHENZHEN) CO LTD +2

Voice enhancing method based on multiresolution auditory cepstrum coefficient and deep convolutional neural network

ActiveCN107845389AReduce complexityCompatible with auditory perception characteristicsSpeech recognitionMasking thresholdHuman ear

The invention discloses a voice enhancing method based on a multiresolution auditory cepstrum system and a deep convolutional neural network. The voice enhancing method comprises the following steps:firstly, establishing new characteristic parameters, namely multiresolution auditory cepstrum coefficient (MR-GFCC), capable of distinguishing voice from noise; secondly, establishing a self-adaptivemasking threshold on based on ideal soft masking (IRM) and ideal binary masking (IBM) according to noise variations; further training an established seven-layer neural network by using new extracted characteristic parameters and first / second derivatives thereof and the self-adaptive masking threshold as input and output of the deep convolutional neural network (DCNN); and finally enhancing noise-containing voice by using the self-adaptive masking threshold estimated by the DCNN. By adopting the method, the working mechanism of human ears is sufficiently utilized, voice characteristic parameters simulating a human ear auditory physiological model are disposed, and not only is a relatively great deal of voice information maintained, but also the extraction process is simple and feasible.

Voice enhancing method based on multiresolution auditory cepstrum coefficient and deep convolutional neural network

Voice enhancing method based on multiresolution auditory cepstrum coefficient and deep convolutional neural network

Voice enhancing method based on multiresolution auditory cepstrum coefficient and deep convolutional neural network

Owner:BEIJING UNIV OF TECH

Fundamental frequency identification method for detecting cord force of cable-stayed bridge

InactiveCN101201282AImprove accuracyHigh precisionFrequency analysisTension measurementEnvironmental noiseVibration acceleration

The invention discloses an identification method of fundamental frequency for detection of the force of the cable of a stayed-cable bridge, which gets a first fundamental frequency by an autopower spectrum module, a second fundamental frequency by a cepstrum module, and then determines whether the quotient of the following two values is less than or equal to the set threshold: 1. the absolute value of the difference of the first fundamental frequency and the second fundamental frequency; 2. a half of the sum of the first fundamental frequency and the second fundamental frequency. In this way, whether the fundamental frequency of the pull cable is a half of the first fundamental frequency and the second fundamental frequency is determined. By adopting the method, the accuracy and the precision of the fundamental frequencies got are significantly improved. In addition, as the vibration acceleration response time interval signal got by an acceleration sensor passes through a signal conditioning module for filtration and smooth processing firstly, the environmental noise in the vibration acceleration response time interval signal is effectively suppressed; the antijamming capability of the stayed-cable bridge is improved so that the fundamental frequency is identified more clearly and accurately.

Fundamental frequency identification method for detecting cord force of cable-stayed bridge

Fundamental frequency identification method for detecting cord force of cable-stayed bridge

Fundamental frequency identification method for detecting cord force of cable-stayed bridge

Owner:NINGBO UNIV

Rolling bearing fault detection method based on vibration detection

InactiveCN103076177AEliminate distractionsRich in detailsMachine bearings testingDiagnosis methodsFrequency characteristic

The invention relates to a fault diagnosis method, in particular to a rolling bearing fault diagnosis method based on the vibration detection. The method comprises the following steps of firstly decomposing the rolling bearing data collected by an acceleration sensor into three layers of wavelet packets, solving the energy of a third layer of wavelet packet coefficient rebuilding signals, selecting a frequency band with centralized energy to rebuild approximate evaluation of an original signal according to the variation of energy values of each frequency bands of the third layer; and utilizing a cepstrum to further analyze the rebuilt signal, and comparing the rebuilt signal with a theoretically-computed fault characteristic frequency and an edge frequency characteristic. By combining multiple resolutions of the wavelet packet and the cepstrum, the periodic component on a power spectrum, a separated-side frequency-band signal and the characteristics which are slightly subjected to the transmission route can be well detected. Meanwhile, the method is strong in manipulability and practicability.

Rolling bearing fault detection method based on vibration detection

Rolling bearing fault detection method based on vibration detection

Rolling bearing fault detection method based on vibration detection

Owner:KUNMING UNIV OF SCI & TECH

Adaptive K-factor-improvement filter for receiver of radio signals subject to multipath distortion

InactiveUS7151797B2Multiple-port networksTelevision system detailsChannel impulse responseWeight coefficient

An adjustable K-factor-improvement (KFI) filter having a sparse kernel composed of non-zero weighting coefficients of substantially equal amplitudes is included in a receiver for digital television signals or the like. The receiver includes apparatus for measuring the channel impulse response (CIR) or cepstrum of the signal as received. The respective polarities of the non-zero weighting coefficients in the sparse kernel of the adjustable KFI filter are adjusted responsive to the measured CIR. The differential delay between each successive pair of non-zero weighting coefficients in the sparse kernel of the adjustable K-factor-improvement filter is adjusted responsive to the measured CIR. The adjustment is such that the KFI filter response has a principal component with substantially more energy than any of its echo components. The KFI filter response is applied as input signal to subsequent adaptive equalization filtering in the receiver.

Adaptive K-factor-improvement filter for receiver of radio signals subject to multipath distortion

Adaptive K-factor-improvement filter for receiver of radio signals subject to multipath distortion

Adaptive K-factor-improvement filter for receiver of radio signals subject to multipath distortion

Owner:LIMBERG ALLEN LEROY

Data-driven filtering of cepstral time trajectories for robust speech recognition

InactiveUS7035797B2Improve Speech Recognition EfficiencyEfficiently filter out the noise component in the speech parametersSpeech recognitionFrequency spectrumSpeech identification

A method and apparatus for speech processing in a distributed speech recognition system having a front-end and a back-end. The speech processing steps in the front-end are as follows: extracting speech features from a speech signal and normalizing the speech features in order to alter the power of the noise component in the modulation spectrum in relation to the power of the signal component, especially with frequencies above 10 Hz. A low-pass filter is then used to filter the normalized modulation spectrum in order to improve the signal-to-noise ratio (SNR) in the speech signal. The combination of feature vector normalization and low-pass filtering is effective in noise removal, especially in a low SNR environment.

Data-driven filtering of cepstral time trajectories for robust speech recognition

Data-driven filtering of cepstral time trajectories for robust speech recognition

Data-driven filtering of cepstral time trajectories for robust speech recognition

Owner:NOKIA CORP

Extraction of heart inter beat interval from multichannel measurements

ActiveUS20100249628A1Accurate and almost unnoticeableImprove measurement qualityCatheterMeasuring/recording heart/pulse rateFrequency spectrumPressure sensing

A monitoring apparatus comprising a multichannel pressure sensing sensor for measuring a ballistocardiographic signal of a human body is disclosed. The monitoring apparatus also comprises means for selecting a time window for heart inter beat interval including two consecutive heart beats to be estimated, defining a spectrum for the ballistocardiographic signal averaging between at least two measurement channels of the multichannel pressure sensing sensor, a cepstrum from the logarithm of a spectrum, and a heart inter beat interval. The invention relates to a method for defining a heart inter beat interval, where a ballistocardiographic signal of a body is measured with a multichannel pressure sensing sensor, a time window for heart inter beat interval Including two consecutive heart beats to be estimated is selected, a spectrum for the ballistocardiographic signal averaging between at least two measurement channels of the multichannel pressure sensing sensor, a cepstrum from the logarithm of a spectrum, and a heart inter beat interval are defined.

Extraction of heart inter beat interval from multichannel measurements

Extraction of heart inter beat interval from multichannel measurements

Extraction of heart inter beat interval from multichannel measurements

Owner:VALTION TEKNILLINEN TUTKIMUSKESKUS

A Robust Speech Feature Extraction Method Based on Sparse Decomposition and Reconstruction

ActiveCN102290047ASpeech recognitionMutual transformationFourier transform on finite groups

The invention discloses a robust speech characteristic extraction method based on sparse decomposition and reconfiguration, relating to a robust speech characteristic extraction method with sparse decomposition and reconfiguration. The robust speech characteristic extraction method solves the problems that 1, the selection of an atomic dictionary has higher the time complexity and is difficult tomeet the sparsity after signal projection; 2, the sparse decomposition of signals has less consideration for time relativity of speech signals and noise signals; and 3, the signal reconfiguration ignores the prior probability of atoms and mutual transformation of all the atoms. The robust speech characteristic extraction method comprises the following detailed steps of: step 1, preprocessing; step 2, conducting discrete Fourier transform and solving a power spectrum; step 3, training and storing the atom dictionary; step 4, conducting sparse decomposition; step 5, reconfiguring the speech spectrum; step 6, adding a Mel triangular filter and taking the logarithm; and step 7, obtaining sparse splicing of Mel cepstrum coefficients and a Mel cepstrum to form the robust characteristic. The robust speech characteristic extraction method is used for the fields of multimedia information processing.

A Robust Speech Feature Extraction Method Based on Sparse Decomposition and Reconstruction

A Robust Speech Feature Extraction Method Based on Sparse Decomposition and Reconstruction

A Robust Speech Feature Extraction Method Based on Sparse Decomposition and Reconstruction

Owner:哈尔滨工业大学高新技术开发总公司

Speaking man recognizing method using base frequency envelope to eliminate emotion voice

InactiveCN101178897AEasy to identifyAvoid inconvenienceSpeech recognitionBase frequencySpeech sound

The invention relates to a speaker recognition method for filtering the emotional tone by using pitch envelop. In the test for speaker recognition, the mutually corresponding cepstrum features and pitch frequency which are firstly extracted from a segment of tone; the gender information is obtained by testing on the gender model which is trained in advance according to the cepstrum features; the thresholds adopted in the method for filtering the emotional tone are determined by gender information; the pitch envelope is picked out according to the thresholds, and then the cepstrum features are filtered according to the serial number of each frame in pitch envelope, thus acquiring the processed cepstrum features; finally, the GMM system test is carried out on the processed cepstrum features.The beneficial effects of the invention are as follows: the inconvenience to the system which needs providing the emotional tone of the speaker in the training or the emotion information of the speech in the test in a traditional method is eliminated, and the recognition performance is increased by 8% compared with traditional ASR system.

Speaking man recognizing method using base frequency envelope to eliminate emotion voice

Speaking man recognizing method using base frequency envelope to eliminate emotion voice

Speaking man recognizing method using base frequency envelope to eliminate emotion voice

Owner:ZHEJIANG UNIV

Device and method for the early recognition and prediction of unit damage

InactiveUS7039557B2Vehicle testingSubsonic/sonic/ultrasonic wave measurementFourier transform on finite groupsOperation mode

In a method and apparatus for early detection and prediction of damage to assemblies in machine plants, including mobile machine plants, structure-borne sound of the machine system is sensed by a sensor, output as an acceleration signal and analyzed in a digital signal processor. For this purpose, the acceleration signal is first transformed into the frequency domain by means of a fast-Fourier transformation, and the data obtained in this manner are then transformed back into the time domain by means of cepstrum analysis, so that resonance data relating to individual shock pulses (a cepstrum) is obtained in the time domain. This cepstrum is then compared with a reference cepstrum that is selected in accordance with load signals and rotational speed signals for the present operating state in a new machine plant in a storage device. When limiting values are exceeded, the diagnostic signal (in particular information relating to the assembly which is diagnosed as damaged and its predicted remaining service life) are displayed for the user and an emergency operating mode is initiated.

Device and method for the early recognition and prediction of unit damage

Device and method for the early recognition and prediction of unit damage

Device and method for the early recognition and prediction of unit damage

Owner:RAMSLE TECH GROUP

Blind restoration method for moving blurred image

InactiveCN101359398AImprove clarityOvercoming blurry smearing problemsImage enhancementComputer graphics (images)Inner loop

Disclosed is a blind restoration method for the motion blurred image; the steps are as follows: (1) the image is converted through the cepstrum method to figure out the blur extent and the blur direction of the blurred image; (2) the blur extent and the blur direction of the blurred image figured out in step (1), and the total variation (TV) method are adopted to process the restoration towards the blurred image. The method based on the total variation (TV) includes: the fixed-point iteration used as the outer-loop and the conjugate gradient method used as the inner-loop are adopted for loop iteration to obtain the restored image. The blind restoration method has the advantages of strong self-adaptation, strong anti-noise ability and good robustness; the blind restoration method has validity and practicality according to the processing effects of the simulation picture and the photographed picture.

Blind restoration method for moving blurred image

Blind restoration method for moving blurred image

Blind restoration method for moving blurred image

Owner:BEIHANG UNIV

Rolling bearing fault feature extraction method based on independent component analysis and cepstrum theory

InactiveCN103471848AEfficient separationEasy to separateMachine bearings testingVibration accelerationFrequency spectrum

The invention provides a rolling bearing fault feature extraction method based on an independent component analysis and cepstrum theory. The rolling bearing fault feature extraction method comprises the steps of acquiring a vibration acceleration testing signal of a rolling bearing by using an acceleration sensor; decoupling and separating the vibration acceleration testing signal by using FastICA based on negentropy maximization; selecting a separated signal capable of representing fault feather information to the maximum extent; carrying out cepstrum analysis on the selected separated signal, and drawing a cepstrum chart; observing whether the cepstrum chart has a fault feature frequency or an obvious peak value at a frequency multiplication position, and furthermore, judging whether the rolling bearing has a fault. By using the rolling bearing fault feature extraction method, the feature information of a fault signal of the rolling bearing can be effectively recognized from a complex sideband signal, a periodical fault component in a sideband can be conveniently extracted, the fault information is remarkably enhanced, the fault diagnosis precision is greatly improved, the fault diagnosis time period is shortened, and the spectral analysis difficulty is simplified; in addition, the rolling bearing fault feature extraction method is easy to realize and good in real-time property.

Rolling bearing fault feature extraction method based on independent component analysis and cepstrum theory

Rolling bearing fault feature extraction method based on independent component analysis and cepstrum theory

Rolling bearing fault feature extraction method based on independent component analysis and cepstrum theory

Owner:HARBIN ENG UNIV

End-to-end speech emotion recognition method and system

ActiveCN110097894AFully reflectGuaranteed basic accuracySpeech analysisFeature extractionSpeech sound

The invention discloses an end-to-end speech emotion recognition method and system. The method comprises the steps: extracting the phoneme features of speech data; extracting cepstrum features of thespeech data; aligning the phoneme vector sequence and the cepstrum feature by taking a file as a unit, taking the phoneme vector sequence and the cepstrum feature as input, and performing end-to-end speech emotion recognition model training by utilizing a deep neural network; when the model is deployed, carrying out resampling and effective speech segment detection on any input speech data. By using the feature extraction process and the recognition model, end-to-end recognition can be performed on the speech data, the efficiency is higher, and the prediction is more accurate.

End-to-end speech emotion recognition method and system

End-to-end speech emotion recognition method and system

End-to-end speech emotion recognition method and system

Owner:FOCUS TECH

Isolated word speech recognition method based on HRSF and improved DTW algorithm

InactiveCN102982803AEasy to identifyRecognition speed is fastSpeech recognitionMel-frequency cepstrumSpeech identification

The invention discloses an isolated word speech recognition method based on an HRSF (Half Raised Sine Function) and an improved DTW (Dynamic Time Warping) algorithm. The isolated word speech recognition method comprises the following steps that (1), a received analog voice signal is preprocessed; preprocessing comprises pre-filtering, sampling, quantification, pre-emphasis, windowing, short-time energy analysis, short-time average zero crossing rate analysis and end-point detection; (2), a power spectrum X(n) of a frame signal is obtained by FFT (Fast Fourier Transform) and is converted into a power spectrum under a Mel frequency; an MFCC (Mel Frequency Cepstrum Coefficient) parameter is calculated; the calculated MFCC parameter is subjected to HRSF cepstrum raising after a first order difference and a second order difference are calculated; and (3), the improved DTW algorithm is adopted to match test templates with reference templates; and the reference template with the maximum matching score serves as an identification result. According to the isolated word speech recognition method, the identification of a single Chinese character is achieved through the improved DTW algorithm, and the identification rate and the identification speed of the single Chinese character are increased.

Isolated word speech recognition method based on HRSF and improved DTW algorithm

Isolated word speech recognition method based on HRSF and improved DTW algorithm

Isolated word speech recognition method based on HRSF and improved DTW algorithm

Owner:SOUTH CHINA NORMAL UNIVERSITY +1

Music voice tone changing method for stabilizing tone quality

InactiveCN106228973AStable soundSimple methodSpeech synthesisFrequency spectrumVocal tract

The invention relates to a music voice tone changing method for stabilizing tone quality, and the method comprises the steps: exporting a spectrum envelope through employing a voice signal, which can be divided into a glottis stimulation component and a channel stimulation response component, and the cepstrum sequence of the voice signal; separating the stimulation component of the voice signal through employing the spectrum envelope; carrying out the processing of the stimulation components of the voice signal through a tone changing algorithm, and changing the pitch of the voice signal; finally enabling the stimulation components to be synthesized again after the spectrum envelope and the pitch are changed, and obtaining a music voice signal with the changed pitch and the stable tone quality. The method provided by the invention is simple, is flexible in implementation, and is higher in practicality.

Music voice tone changing method for stabilizing tone quality

Music voice tone changing method for stabilizing tone quality

Music voice tone changing method for stabilizing tone quality

Owner:FUZHOU UNIV

Audio steganography method and apparatus using cepstrum modification

InactiveUS7555432B1Speech analysisSecret communicationFrequency spectrumMasking threshold

Audio steganography methods and apparatus using cepstral domain techniques to make embedded data in audio signals less perceivable. One approach defines a set of frames for a host audio signal, and, for each frame, determines a plurality of masked frequencies as spectral points with power level below a masking threshold for the frame. The two most commonly occurring masked frequencies f1 and f2 in the set of frames are selected, and a cepstrum of each frame is modified to produce complementary changes of the spectrum at f1 and f2 to correspond to a desired bit value. Another aspect of the invention involves determining a masking threshold for a frame, determining masked frequencies within the frame having a power level below threshold, obtaining a cepstrum of a sinusoid at a selected masked frequency, and modifying the frame by an offset to correspond to an embedded data value, the offset derived from the cepstrum.

Audio steganography method and apparatus using cepstrum modification

Audio steganography method and apparatus using cepstrum modification

Audio steganography method and apparatus using cepstrum modification

Owner:PURDUE RES FOUND INC

Gas pipeline leakage detection and identification method based on optical fiber sensing data excavation

ActiveCN107590516AReal-time online intelligent detectionImprove anti-interference abilityData processing applicationsCharacter and pattern recognitionFeature extractionAcoustic wave

The present invention discloses a gas pipeline leakage detection and identification method based on optical fiber sensing data excavation. The method comprises: employing distributed optical fiber sound waves / a vibration sensor to pick up leakage sound waves / vibration signals propagated along a pipeline, performing Mel cepstrum, AR model feature extraction and feature selection of the collected leakage sound waves / vibration signals of each space point, and establishing an association rule of the selected feature attributes and a type of a leakage event through an improved feature rule excavation method and positive and negative sample excavation to perform real-time online intelligent detection, identification and classification of a gas pipeline leakage event and solve a gas pipeline online leakage detection problem under a complex noise environment. The gas pipeline leakage detection and identification method based on optical fiber sensing data excavation can detect and identify simple pipeline leakage signals, and can detect and identify leakage signals mixed with different interference sources.

Gas pipeline leakage detection and identification method based on optical fiber sensing data excavation

Gas pipeline leakage detection and identification method based on optical fiber sensing data excavation

Gas pipeline leakage detection and identification method based on optical fiber sensing data excavation

Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA

High-tension transmission line single-ended traveling wave fault distance detection method combined with time-frequency characteristics

InactiveCN103278747AEfficient separationImprove anti-interference abilityFault locationInformation technology support systemDistance detectionCable fault location

The invention discloses a high-tension transmission line single-ended traveling wave fault distance detection method combined with time-frequency characteristics. According to the method, time of reciprocating propagation between an end point and a fault point and inherent frequency corresponding to the time are fully used, wherein the end point and the fault point are simultaneously obtained in a cepstrum analysis. The advantages of a traveling wave method and the advantages of a traveling wave inherent-frequency distance detection method are centralized, and therefore the problems that in single-ended traveling wave fault location, wave velocity is difficult to calculate, propagation time of a small phase angle is difficult to seek, due to the fact that only the traveling wave inherent-frequency distance method is used, the frequency is not easy to extract, and in the calculation, an initial phase angle of a bus end reflection and an initial phase angle of fault point reflection need to be considered are solved. Meanwhile, due to the fact that other interference signals are irrelevant with traveling wave signals, the high-tension transmission line single-ended traveling wave fault distance detection method combined with the time-frequency characteristics has the advantages of being high in anti-interference capacity, easy to operate, high in distance detection accuracy and very suitable for distance detection on site.

High-tension transmission line single-ended traveling wave fault distance detection method combined with time-frequency characteristics

High-tension transmission line single-ended traveling wave fault distance detection method combined with time-frequency characteristics

High-tension transmission line single-ended traveling wave fault distance detection method combined with time-frequency characteristics

Owner:SOUTHEAST UNIV

Speaker's voice recognition system, method and recording medium using two dimensional frequency expansion coefficients

InactiveUS6934681B1Improve performanceLess effortSpeech recognitionShortest distanceReference patterns

A voice recognition system comprises an analyzer for converting an input voice signal to an input pattern including cepstrum, a reference pattern for storing reference patterns, an elongation / contraction estimating unit for outputting an elongation / contraction parameter in frequency axis direction by using the input pattern and the reference patterns, and a recognizing unit for calculating the distances between the converted input pattern from the converter and the reference patterns and outputting the reference pattern corresponding to the shortest distance as result of recognition. The elongation / contraction unit estimates an elongation / contraction parameter by using cepstrum included in the input pattern. The elongation / contraction unit does not have various values in advance for determining the elongation / contraction parameter, nor is it necessary for the elongation / contraction unit have to execute distance calculation for various values.

Speaker's voice recognition system, method and recording medium using two dimensional frequency expansion coefficients

Speaker's voice recognition system, method and recording medium using two dimensional frequency expansion coefficients

Speaker's voice recognition system, method and recording medium using two dimensional frequency expansion coefficients

Owner:NEC CORP

Audio-reverberation inhibiting device and inhibiting method thereof

ActiveCN103440869AImplement reverberation suppressionImprove auditory perception qualitySpeech analysisSound producing devicesComputation complexitySpectral subtraction

The invention discloses an audio-reverberation inhibiting device and an inhibiting method thereof. The device includes a reverberation-time blind estimation module, a later-period reverberation power spectrum estimation module, a spectral-subtraction module and a complex-cepstrum-domain filtering module. A reverberation voice estimates a reverberation time through the reverberation-time blind estimation module. The later-period reverberation power spectrum estimation module establishes a reverberation statistical model through the estimated reverberation time and carries out analysis processing on the reverberation voice so that a later-period reverberation power spectrum is obtained. The spectral-subtraction module includes a gain function structure and a spectral-subtraction implementation module and a spectral-subtraction gain function is constructed firstly through use of a reverberation-voice power spectrum and a later-period reverberation power spectrum. Then the spectral-subtraction gain function and the reverberation voice are input into the spectral-subtraction implementation module so that an earlier-period voice is obtained. Finally, the earlier-period voice is input into the complex-cepstrum-domain filtering module so that a reverberation-removed voice is obtained. The audio-reverberation inhibiting device and the inhibiting method thereof is low in calculation complexity, convenient to handle in a real-time manner and capable of inhibiting audio reverberation obviously and improving voice quality efficiently.

Audio-reverberation inhibiting device and inhibiting method thereof

Audio-reverberation inhibiting device and inhibiting method thereof

Audio-reverberation inhibiting device and inhibiting method thereof

Owner:DALIAN UNIV OF TECH

Method and apparatus for vibration-based automatic condition monitoring of a wind turbine

ActiveUS20090162186A1Avoid restrictionsAccurate valueVibration measurement in solidsPropellersVibration measurementFrequency spectrum

Method and apparatus for vibration-based automatic condition monitoring of a wind turbine, comprising the steps of: determining a set of vibration measurement values of the wind turbine; calculating a frequency spectrum of the set of vibration measurement values; calculating a cepstrum of the frequency spectrum; selecting at least one quefrency in the cepstrum, and detecting an alarm condition based upon an amplitude of the cepstrum at the selected quefrency, and a wind turbine therefor.

Method and apparatus for vibration-based automatic condition monitoring of a wind turbine

Method and apparatus for vibration-based automatic condition monitoring of a wind turbine

Method and apparatus for vibration-based automatic condition monitoring of a wind turbine

Owner:KK WIND SOLUTIONS VOJENS AS +1

Blurred image detection method fusing frequency spectrum information and cepstrum information

InactiveCN104282028AImage analysisCharacter and pattern recognitionImaging processingFrequency spectrum

The invention discloses a blurred image detection method fusing frequency spectrum information and cepstrum information, belongs to the technical field of image processing, and particularly relates to the detection technology of various blurred images. According to the blurred image detection method, first, an energy frequency spectrum distribution feature and a singularity cepstrum value histogram feature of an image are calculated, and serve as blur features of the image; second, a support vector machine classifier is selected for differentiating sharp image features from the blur image features, and collected images with demarcated blur categories is used for training corresponding parameters of the support vector machine classifier; finally, the trained support vector machine classifier is used for detecting whether an image to be detected is a blurred image. The blurred image detection method has the advantage that as a non-reference blurred image detection method, the blurred image detection method needs no reference image, thereby being wide in application range; meanwhile, the defined blur features have specific physical significance, and therefore the sharp image and the blurred image can be differentiated accurately.

Blurred image detection method fusing frequency spectrum information and cepstrum information

Blurred image detection method fusing frequency spectrum information and cepstrum information

Blurred image detection method fusing frequency spectrum information and cepstrum information

Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA

Speech enhancement method based on Gaussian mixture model (GMM) noise estimation

InactiveCN104464728AEasy to trackAccurate and pure voice signalSpeech recognitionTime domainFrequency spectrum

The invention discloses a speech enhancement method based on Gaussian mixture model (GMM) noise estimation, wherein the GMM is used for estimating background noise and a spectral subtraction coefficient, spectral subtraction is conducted on noisy speech, and pure speech is recovered. Firstly, the noisy speech is preprocessed so as to obtain the amplitude and phase of the noisy speech, the amplitude is used for noise estimation and spectral subtraction, and the phase is used for recovering a time-domain signal; then, the GMM is used for estimating noise parameters and pure speech cepstrum characteristics from the noisy speech in real time, and the spectral subtraction coefficient is calculated according to the estimated pure speech cepstrum characteristics; finally, spectral subtraction is conducted on the frequency spectrum of the noisy speech, the time-domain signal is recovered, and enhanced speech is obtained according to an overlap-add method. According to the speech enhancement method, the capability of the speech enhancement algorithm to track non-stationary noise can be improved remarkably.

Speech enhancement method based on Gaussian mixture model (GMM) noise estimation

Speech enhancement method based on Gaussian mixture model (GMM) noise estimation

Speech enhancement method based on Gaussian mixture model (GMM) noise estimation

Owner:HOHAI UNIV

Bi-ear time delay estimating method based on frequency division and improved generalized cross correlation

ActiveCN107479030AImprove positioning accuracyGood anti-reverberation performanceSpeech analysisPosition fixationSound sourcesTime delays

The invention provides a bi-ear time delay estimating method based on frequency division and improved generalized cross correlation in reverberation environment, and relates to the field of sound source positioning. A Gammatone filter is used to effectively simulate characteristics of a basal membrane of a human ear, voice signals are subjected to frequency division processing, and two-ear cross-correlation delay is estimated under a reverberation environment. Compared with a generalized cross correlation delay estimating method, the method can estimate time delay more accurately. The sound source positioning system has better robustness. A Gammatone filter is used to conduct frequency dividing processing for bi-ear signals, and each sub-band signal is subjected to inverse transformation to a time domain after reverberation processing of cepstrum and pre-filtering. Each sub-band signal of left and right ears are subjected to generalized cross correlation operation, an improved phase transformation weight function is employed in a generalized cross correlation algorithm to obtain cross correlation value of each sub-band for summing operation, and the bi-ear time difference corresponding to maximal cross correlation value is obtained.

Bi-ear time delay estimating method based on frequency division and improved generalized cross correlation

Bi-ear time delay estimating method based on frequency division and improved generalized cross correlation

Bi-ear time delay estimating method based on frequency division and improved generalized cross correlation

Owner:CHONGQING UNIV OF POSTS & TELECOMM

MFCC heart sound type recognition method based on improvement

ActiveCN104706321AEfficient identificationImprove recognition accuracyDiagnostic recording/measuringSensorsAbnormal heart soundsMedicine

The invention discloses an MFCC heart sound type recognition method based on an improvement. The method includes the following steps of firstly, preprocessing heart sound signals; secondly, segmenting the heart sound signals in a self-correlation mode; thirdly, conducting the MFCC extraction algorithm of the heart sound signals; fourthly, training and recognizing the heart sound signals. Compared with the prior art, the method has the advantages that by improving the cepstrum domain parameter, namely, the MFCC, the deep information capable of representing heart sound features of different types is extracted, normal heart sound signals and several types of abnormal heart sound signals are effectively recognized, the recognition accuracy is high, and the method is quite suitable for clinically assisting in diagnosing cardiovascular diseases.

MFCC heart sound type recognition method based on improvement

MFCC heart sound type recognition method based on improvement

Owner:SICHUAN CHANGHONG ELECTRIC CO LTD

Popular searches

Integrated circuit Recognition system Cepstrum Time to market Modular design Modularity Application-specific integrated circuit Smart products Speech processing Whole systems

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com