Question and answer content extraction method and system in programming environment, electronic equipment and medium
A programming environment and extraction system technology, applied in the Internet field, can solve problems such as extraction by scholars, achieve the effects of reducing costs, improving development efficiency, and reducing browsing time
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0029] Embodiment 1: The present invention proposes an extraction system for question and answer content under a programming environment, including:
[0030] The data processing module is used to perform: preprocessing the input network question and answer text data, removing useless information and performing word segmentation;
[0031] An entity recognition module, configured to perform: performing entity recognition in the field of software engineering on the text processed by the data processing module;
[0032] The document reading module is used to execute: input the text identified by the entity recognition module into the neural network for document reading;
[0033] The summary extraction module is used for execution: using another neural network to extract the key content in the question and answer text.
[0034] Preferably, the specific execution of the data processing module includes: initial state; processing code segments in the question-and-answer text; processin...
Embodiment 2
[0037] Embodiment 2: the present invention also proposes the extraction method of question and answer content under the programming environment, and the overall framework of the present invention is as follows figure 1 As shown, a method for extracting question and answer content under a programming environment proposed by the present invention includes the following 4 steps:
[0038] Step 1: For Q&A text on the web, first clear all tab, since the code snippet in the Q&A appears in the tab, clear The content in the tag also clears the code segment; then delete all the html tags, for example And so on; then replace the URL that appears in the text with "@u@", replace the expression that appears such as ":)" with "@e@", and replace the "@" that appears with other users' content with "@a@ "Replace; Finally, use the nltk word segmentation tool to segment the text. The word segmentation needs to take the API name as a whole. For example, os.path.join(path) needs to be separat...
Embodiment 3
[0048] Embodiment 3: The present invention also proposes an electronic device, including a memory, a processor, and a computer program stored on the memory and operable on the processor, wherein the method is implemented when the processor executes the program step.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com