The invention relates to a visual feature representing method based on an
autoencoder word bag. The method includes the steps that training samples are input to form a
training set; the training samples in the
training set are preprocessed, and influences of illumination,
noise and the like on the image representing accuracy are reduced; a
visual dictionary is generated, an
autoencoder is used for extracting random image block features, then the clustering method is used for clustering the random image block features into a plurality of visual words, and the
visual dictionary is composed of the visual words; a sliding window mode is used for sequentially collecting image blocks of images in the
training set, the collected image blocks serve as input of the
autoencoder, and output of the autoencoder is local features of the images; the local features of the images are quantized into the visual words according to the
visual dictionary; the frequency of the visual words is counted, a
visual word column diagram is generated, and the
visual word column diagram is overall visual feature representing of the images. By means of the visual feature representing method, feature representing characteristics are independently studied through the adoption of the autoencoder, and requirements for the quantity of the training samples are reduced through a BoVW framework.