Corpus generation method and device and man-machine interaction processing method and device
A technology of human-computer interaction and corpus, which is applied in the computer field, can solve problems such as poor use of question answering systems, achieve high retrieval efficiency, good answer accuracy, and improve response speed
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0026] refer to figure 1 , shows a flowchart of steps of a method for generating a corpus according to Embodiment 1 of the present invention.
[0027] The corpus generation method of the present embodiment comprises the following steps:
[0028] Step S102: Generate an initial corpus vector according to the acquired initial corpus.
[0029] The initial corpus can be one or a combination of text data, image data, voice data and other data expressed in natural language. The initial corpus vector may be a vector corresponding to the initial corpus.
[0030] Wherein, those skilled in the art can generate the initial corpus vector according to the acquired initial corpus in an appropriate manner according to actual needs. For example, the initial corpus vector is generated according to the initial corpus through the Word2vec algorithm; the initial corpus vector can also be generated according to the initial corpus through the BOW (bag-of-Word) model; or the initial corpus vector ...
Embodiment 2
[0040] refer to figure 2 , shows a flow chart of steps of a method for generating a corpus according to Embodiment 2 of the present invention.
[0041] The corpus generation method of the present embodiment comprises the following steps:
[0042] Step S202: Generate an initial corpus vector according to the acquired initial corpus.
[0043] The initial corpus may be text data, image data and / or voice data expressed in natural language. The initial corpus vectors may be vectors corresponding to these initial corpus.
[0044] Wherein, those skilled in the art can generate the initial corpus vector according to the acquired initial corpus in an appropriate manner according to actual needs. For example, the initial corpus vector is generated based on the initial corpus through the Word2vec algorithm; the initial corpus vector can also be generated based on the initial corpus through the BOW (bag-of-Word) model.
[0045] In order to ensure the accuracy of the question and answ...
Embodiment 3
[0093] refer to image 3 , shows a structural block diagram of a corpus generation device according to Embodiment 3 of the present invention.
[0094] The corpus generating device of the present embodiment includes: a vector type determining module 301, which is used to generate an initial corpus vector according to the acquired initial corpus, and determines the vector type of each initial corpus vector; an initial corpus generating module 302, which is used to generate an initial corpus vector according to the described The vector type and the initial corpus vectors generate an initial corpus with an inverted chain index.
[0095] The corpus generation device generates an initial corpus with an inverted chain index structure, and clusters and stores the initial corpus vectors with the same vector type, so that the corpus generated by the corpus generation method The storage space occupied by the initial corpus is smaller, the retrieval efficiency is higher during retrieval,...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com