Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for new word identification

A new word recognition and new word technology, applied in the Internet field, can solve the problems of not being able to recognize in time, insufficient statistical information, and multiple fragment recognition.

Active Publication Date: 2015-11-25
北京鸿享技术服务有限公司
View PDF5 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] There are two deficiencies in the above technical solution: one is that it is easy to identify multiple fragments of common collocation errors that occur frequently as new words; the other is that new words that appear at low frequencies are often not recognized in time due to insufficient statistical information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for new word identification
  • Method and device for new word identification
  • Method and device for new word identification

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0026] Such as figure 1 Shown, a kind of new word recognition method is provided in one embodiment of the present invention, it comprises:

[0027] Step 110, extracting unmatched continuous fragments from the search query word submitted by the user. In the technical solution of this embodiment, the fragments can be matched characters, words, or even punctuation marks; and whether the fragments in the search query words can be matc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and a device for new word identification. The method comprises that a plurality of unmatched continuous segments are extracted from a search query word submitted by a user; statistics of corresponding relations between contents of a clicked search result web page corresponding to the search query word and the multiple segments is carried out; and according to the corresponding relations, whether the continuous multiple segments in the search query word will be identified into a new word is judged. According to the method and the device for the new word identification provided by the invention, whether the segments of the search query word can form the new word can be analyzed according to the statistics and analyze of the corresponding relations between the segments in the search query word and the search result webpage.

Description

technical field [0001] The present invention relates to the technical field of the Internet, in particular to a new word recognition method and device. Background technique [0002] In the field of search technology, since new words are constantly being generated, how to discover new words in time becomes an important issue. [0003] At present, in most technical solutions for discovering new words, various statistical indicators are calculated through statistical analysis of webpage content, and then candidate new words are found through the statistical indicators. [0004] There are two deficiencies in the above-mentioned technical solution: one is that it is easy to identify multiple fragments of common collocation errors that occur frequently as new words; Contents of the invention [0005] In view of the above problems, the present invention is proposed to provide a new word recognition method and device that overcome the above problems or at least partly solve the a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/313G06F16/951
Inventor 陈进平
Owner 北京鸿享技术服务有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products