Structured model training method, text structuring method and related devices
A text structure and structured technology, applied in the field of data processing, can solve problems such as labor costs
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0120] Please combine figure 1 To understand, the text structuring method provided in this embodiment will be described in detail below. The text structuring method mainly includes two parts. The first part is to train the structured model, and the second part is to structure the text. Representation.
[0121] First, train the structured model;
[0122] The structured model includes an entity extraction model for extracting entities and an extraction model for extracting relationships between the entities. The training method includes the following steps:
[0123] Step 101: Obtain a labeled first corpus set, the first corpus set being obtained by performing entity corpus labeling on each text in the first text set according to a first preset rule.
[0124] The first text collection includes, but is not limited to, technical documents, patents, academic papers, etc. The first text collection in the embodiments of the present application is described by taking a patent as an example. F...
Embodiment 2
[0220] See Figure 5 As shown, the embodiment of the present application also provides a method for determining text similarity. The method in this example is applied to an electronic device. The electronic device may be a server or a terminal. The method may include the following steps:
[0221] Step 301: Obtain a target text and a candidate data set. The candidate data set includes a plurality of arrays, each of the plurality of arrays represents a semantic vector of an entity; the entity is included in the candidate text.
[0222] The server may receive the target text sent by the terminal, for example, the target text may be a patent.
[0223] The specific method for the server to obtain the candidate data set includes at least the following two methods:
[0224] In the first possible implementation:
[0225] First, obtain a text collection. The text collection includes n candidate texts, where n is an integer greater than or equal to 2. It is understandable that the text collection...
Embodiment 3
[0327] See Figure 7 As shown, the embodiment of the present application also provides a method for determining the novelty of a text. The method is applied to an electronic device. The electronic device can be a server or a terminal. In this embodiment, the electronic device can be a terminal As an example, the method specifically includes the following steps:
[0328] Step 401: Determine the target text.
[0329] For example, the target text may be a patent or a paper. In this embodiment, the target text is explained by taking a patent as an example.
[0330] Step 402: Extract multiple target entities in the target text to obtain a set of target entities.
[0331] In this example, multiple target entities in the target text are extracted through the entity extraction model in Embodiment 1. Specifically, the target text is input to the entity extraction model, and the target text is identified through the entity extraction model Multiple target entities in, the multiple target entit...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com