The invention discloses a webpage text extraction method based on a logic link block, and the method only depends on the current webpage in the process of extracting a webpage template and a text, anddoes not need heuristic rule support, so that the method has good universality; the extraction process of the webpage template does not need manual intervention, and the automation degree is high; the analysis process is simple, and label analysis does not need to be carried out on the webpage, so that the analysis speed is high, the anti-interference performance is high, and the method can better adapt to a Web page with nonstandard design; the method also has a better extraction effect on pages with very short text contents; and finally, the template extracted by the method is simple in form and easy to use. Therefore, it is determined that the method has potential application value in the aspect of Web page text extraction, can be used for text extraction of various news, blogs or webpages with similar structures, and also has wide application prospects in other Web information processing and mining fields with low requirements for link block fine granularity.