Chinese word segmentation method and device, electronic equipment and storage medium
A Chinese word segmentation and text sequence technology, which is applied in electrical digital data processing, instruments, calculations, etc., can solve the problem of high time complexity, and achieve the effect of reducing the amount of calculation, shortening the time consumed, and improving work efficiency.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0052] This embodiment provides a Chinese word segmentation method, figure 1 It is a flowchart illustrating the extraction, mapping and judgment of the text sequence to be processed according to some embodiments of the present invention. Although the processes described below include operations in a particular order, it should be clearly understood that these processes may also include more or fewer operations, which may be performed sequentially or in parallel (e.g., using parallel processors) or multi-threaded environment). Such as figure 1 As shown, the method includes:
[0053] S101. Acquire a text sequence to be processed, where the text sequence to be processed includes a plurality of sequentially arranged characters.
[0054] In the above implementation steps, the word segmentation of the Chinese text sequence is usually to distinguish the word boundary of a sentence in an article or several paragraphs, and extract the continuous characters in the article or several ...
Embodiment 2
[0095] This embodiment provides a Chinese word segmentation device, which is used to perform word segmentation processing on the text sequence to be processed, such as figure 2 shown, including:
[0096] The acquiring module 201 is configured to: acquire a text sequence to be processed, the text sequence to be processed includes a plurality of sequentially arranged characters; for details, please refer to the relevant description of step S101 in Embodiment 1, which will not be repeated here.
[0097] The extraction module 202 is configured to: extract the feature vector corresponding to each character in the text sequence to be processed to obtain a feature vector group; for details, please refer to the relevant description of step S102 in Embodiment 1, which will not be repeated here.
[0098] A mapping module 203, configured to map each eigenvector in the eigenvector group to a two-dimensional vector, wherein the two-dimensional vector includes a first dimension value and a...
Embodiment 3
[0102] This embodiment provides an electronic device, such as image 3 As shown, the device includes a processor 301 and a memory 302, wherein the processor 301 and the memory 302 can be connected through a bus or in other ways, image 3 Take connection via bus as an example.
[0103] The processor 301 may be a central processing unit (Central Processing Unit, CPU). The processor 301 may also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), graphics processors (Graphics Processing Unit, GPU), embedded neural network processors (Neural-network Processing Unit, NPU) or other Dedicated deep learning coprocessor, application specific integrated circuit (Application Specific Integrated Circuit, ASIC), field programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components and other chips, or a combination of the above-mentioned ty...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com