Intelligent opinion clue collection method and system based on data repetition
A technology of data duplication and clues, applied in data processing applications, digital data processing, natural language data processing, etc., can solve the problems of duplication of opinion clue data and long processing cycle, so as to solve the duplication of opinion clue data and reduce the time. , the effect of improving work efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0048] as attached figure 1 As shown, the intelligent collection method of opinion clues based on data repetition degree of the present embodiment, the method is specifically as follows:
[0049] S1. Obtain the key indicators for judging the repetition rate in the opinion clue data, and pre-process the key indicators;
[0050] S2. Use the Levenshtein Distance algorithm to calculate the repetition rate of key indicators;
[0051] S3, batch processing the opinion clue data collected into one category.
[0052] The key indicators in step S1 of this embodiment include the object of the opinion thread, the content of the opinion thread, the location of the opinion thread, and the time of the opinion thread.
[0053] The preprocessing of key indicators in step S1 of this embodiment is as follows:
[0054] Perform word segmentation on the content of opinion threads.
[0055] The repetition rate of using the Levenshtein Distance algorithm to calculate the key index in the step S2 ...
Embodiment 2
[0075] The intelligent collection system of opinions and clues based on data repetition degree of this embodiment, the system includes:
[0076] The acquisition module is used to obtain the key indicators for judging the repetition rate in the opinion clue data, and preprocess the key indicators; wherein, the key indicators include the opinion clue object, the opinion clue content, the opinion clue territory and the opinion clue time;
[0077] The calculation module is used to calculate the repetition rate of key indicators using the Levenshtein Distance algorithm;
[0078] The processing module is used for batch processing the opinion clue data grouped into one category.
Embodiment 3
[0080] Embodiments of the present invention also provide an electronic device, including: a memory and at least one processor;
[0081] wherein, the memory stores computer-executed instructions;
[0082] The at least one processor executes the computer-executable instructions stored in the memory, so that the at least one processor executes the method for intelligent collection of opinion clues based on data repetition degree in any embodiment of the present invention.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com