Headline clickbait identification method and device, server and storage medium
A recognition method and title party technology, applied in the Internet field, can solve the problems of poor generalization ability, low recognition accuracy, large accidental injury, etc., and achieve the effect of high accuracy and high recall.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0025] figure 1 The flow chart of the headline party identification method provided by Embodiment 1 of the present invention, this embodiment is applicable to the situation where the title party needs to be identified, and the method can be executed by a title party identification device, which can use software and / or implemented in hardware. Such as figure 1 As shown, the method specifically includes:
[0026] Step 110, extract the text statistical features and semantic features of the title.
[0027] Title party is a type of title with click bait. This type of title usually uses some prominent text features such as exaggeration, phrases or short sentences that have a large gap with reality to attract readers’ attention. In addition, this type of title also has its unique semantic features. Therefore, we can use the textual features, semantic features or a combination of the two to judge whether the title is a title party.
[0028] In this embodiment, in order to accurat...
Embodiment 2
[0045]This embodiment provides a preferred implementation of step 110 on the basis of embodiment one. The text statistical features in embodiment one include the number of punctuation marks, the number of stop words, the number of regional words, and the number of lure words At least one of the number of pronouns, the number of pronouns, or the number of lure segments. In this embodiment, only the number of lure segments in text statistical features is used as an example for illustration. figure 2 It is a flow chart of the title party identification method provided by Embodiment 2 of the present invention, such as figure 2 As shown, the method includes:
[0046] Step 210: Segment the title according to the punctuation marks in the title to obtain at least one segmented phrase.
[0047] The title usually contains punctuation marks. In this embodiment, the title can be divided into at least one short sentence by using the punctuation marks in the title. Exemplarily, the titl...
Embodiment 3
[0071] image 3 It is a structural schematic diagram of the headline party identification device in the third embodiment of the present invention. Such as image 3 As shown, the title party identification devices include:
[0072] The feature extraction module 310 is used to extract the text statistical features and semantic features of the title, wherein the text statistical features may preferably include the number of punctuation marks, the number of stop words, the number of regional words, the number of lure words, and the number of pronouns Or at least one of the number of lure fragments.
[0073] The decision-making scoring module 320 is configured to use a pre-trained decision-making model, take text statistical features and semantic features as input to the decision-making model, and output the decision-making score of the title.
[0074] The score comparison module 330 is configured to compare the decision score with a first preset threshold, and determine whether...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com