Method and system for processing unstructured data
A technology of unstructured data and processing methods, which is applied in the fields of unstructured text data retrieval, electronic digital data processing, special data processing applications, etc. question
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0061] Such as figure 1 As shown, the unstructured data processing method of this embodiment includes the following steps:
[0062] S 1 , Setting multiple feature templates, each feature template includes keywords;
[0063] S 2 1. Use each feature template to scan a database storing multiple pieces of unstructured data, respectively judge whether there is content consistent with each feature template for each piece of unstructured data, and use the feature template whose judgment result is yes as Feature template records matched by each piece of unstructured data;
[0064] S 3 1. Generate a plurality of template vectors corresponding to the plurality of unstructured data respectively, each template vector has a plurality of dimensions corresponding to the plurality of feature templates one by one, in the plurality of dimensions, each unstructured The scalar value of the dimension corresponding to the feature template that the data matches is 1, and the scalar value of the...
Embodiment 2
[0069] Such as figure 2 As shown, compared with Embodiment 1, the unstructured data processing method of this embodiment differs only in that the method of this embodiment also includes 3 After performing the following steps:
[0070] S 4 , Read the features to be mined;
[0071] S 5 , judging whether there is a feature template consistent with the feature to be mined in the plurality of feature templates, if so, execute S 6 , otherwise execute S 7 ;
[0072] S 6 1. Select a feature template that is consistent with the feature to be mined to match the multiple template vectors, select the template vector that matches successfully as the vector to be output, and execute S 9 ;
[0073] S 7 , generating a feature template combination to represent the feature to be mined, the feature template combination being a number of feature templates connected by logical operators;
[0074] S 8 , using the feature template combination to match the multiple template vectors, selec...
Embodiment 3
[0081] Compared with Embodiment 2, the unstructured data processing method of this embodiment differs only in the method of this embodiment, S 2 It also includes: recording the number of occurrences of content consistent with each feature template in each piece of unstructured data.
[0082] S 3 by S 3a Substitute, S 3a To: generate a plurality of template vectors corresponding to the plurality of pieces of unstructured data respectively, each template vector has a plurality of dimensions corresponding to the plurality of feature templates one by one, and the labels of the plurality of dimensions of each template vector The magnitudes are respectively the number of occurrences of content consistent with the corresponding plurality of feature templates in the corresponding unstructured data.
[0083] Moreover, part of the plurality of feature templates is a retrieval formula including keywords and logical operators. For example, there is a feature template "European and Ame...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com