System and method for acquiring Web API knowledge based on Stack Overflow website
A website and knowledge technology, applied in character and pattern recognition, special data processing applications, instruments, etc., to achieve the effect of improving prediction accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0044] At the time of data collection and filtering, the data file posts.xml was downloaded from the publicly available data dump on the Stack Overflow website, which contained all questions and answers posted between August 2008 and February 2019. For each Web API, the first step of data collection is carried out by combining keyword search and tag search, such as the YouTube API, collecting all questions containing the keyword "youtube" and tags related to the API; and then eliminating Remove some irrelevant data, which only contain keywords in code segments or HTML hyperlinks; finally select a label that is most relevant to the Web API, and use the corresponding data as a positive sample, combined with the remaining unlabeled samples, Use the semi-supervised learning method of PUL (Positive and Unlabeled Learning) to filter positive sample data from these unlabeled samples, and use all positive sample data as the data set of the Web API.
[0045] When classifying the proble...
Embodiment 2
[0048] A system for obtaining Web API knowledge from the Stack Overflow website provided by the present invention includes: data collection and filtering modules, problem category classification modules and performance measurement and prediction modules, such as figure 1 shown.
[0049] First, the data collection and filtering module downloads the data file posts.xml from the data dump disclosed by the Stack Overflow website, which contains all questions and answers issued between a specific time period, and this embodiment contains data from August 2008 All questions and answers posted between April and February 2019. For each Web API, the first step of data collection is carried out through a combination of keyword search and tag search, such as the YouTube API, which collects all questions containing the keyword "youtube" and tags related to the API; then eliminates Remove some irrelevant data, which only contain keywords in code segments or HTML hyperlinks; finally select...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com