The invention discloses a cross-media Chinese herbal
medicine plant image searching method based on
deep learning. The method comprises the following steps that 1, on the basis of OCR, text structural
processing is performed, and
plant classification description characters are extracted from books such as '
plant taxonomy'; 2, a
Chinese word segmentation tool is used for performing preprocessing including word segmentation and stop word removal on all plant classification description characters; 3, a word2vec
algorithm is used for generating word vectors according to a description text; 4,
Fisher Vector is used for encoding the description text; 5, a
convolutional neural network is used for performing training on an image set, and the
network structure is made to converge to the best state; 6, the full last but one connection layer output of the
convolutional neural network is extracted as an image
feature vector; 7, textual features and image features are fused; 8, a linear kernel
SVM classifier is used for performing model training; 9, during user retrieval, images can be input, plant texts can be described, and then a final
image retrieval result is obtained through the steps 4, 6, 7 and 8.