The invention relates to a
Chinese word sense disambiguation method based on a graph
convolutional neural network (GCN) fused with a
support vector machine (SVM), in particular to a
Chinese word sense disambiguation method based on the graph
convolutional neural network (GCN) fused with the
support vector machine (SVM) and a
Chinese word sense disambiguation method based on the graph
convolutional neural network (GCN) fused with the Chinese
word sense disambiguation method based on the graph convolutional neural network (GCN) fused with the
support vector machine (SVM). The method comprises the steps of firstly preprocessing corpora; and performing word segmentation, part-of-speech tagging and semantic tagging
processing on statements of the training and testing corpora. A
word sense disambiguation graph is constructed by taking sentences where ambiguous words are located and word forms, part-of-speech and semantic classes of vocabulary units on two sides of the ambiguous words as disambiguation features and taking the disambiguation features as nodes. Weights of nodes and edges in the graph are calculated by using Word2Vec, a Doc2Vec tool, point-by-point
mutual information (PMI) and a TF-IDF
algorithm. And training the GCN model by the training corpus, and optimizing the model. And calculating disambiguation features of training and testing corpora by using the optimized GCN model, inputting the calculated disambiguation features of the training corpora into an
SVM classifier, optimizing the
SVM classifier, and classifying the testing corpora to obtain classification conditions of ambiguous vocabularies under semantic categories. The method has a good
word sense disambiguation effect, and the real meaning of the ambiguous vocabulary is accurately judged.