The invention discloses a method and
system for predicting stocks based on
big data published by
the internet. The method comprises the following steps:
crawling related information of the stocks before a
business day; and then performing the
feature extraction using the crawled data, constructing a training dataset, and using a
Group Lasso to perform prediction model training, wherein the evaluation standard of the model is
yield rate in a period of time in the
operation mode of selling stocks purchased in late trading day and
purchasing the stocks recommended at the current trading day at the opening every day; and then constructing a new testing set according to the data crawled at the trading day, predicting using the prediction model trained in former step to obtain the finally recommended stocks. Through the adoption of the method and
system disclosed by the invention, a new, useful and reliable information source is provided for quantitative stock selection or
stock prediction, the adding of above information can more reflect the market in combination with the traditional information; on the basis of method and
system, the
stock prediction model obtained using the
machine learning technique can more capture the internal operation mechanism of the market, and the benefit of the investor can be effectively improved.