The invention discloses a distributed origin guarantee regular path query
algorithm based on Pregel. The distributed origin guarantee regular path query
algorithm based on the Pregel includes the steps that 1, regarding a given regular path query Q=(x, r, y), according to a
regular expression r, first, last and follow sets are calculated; 2, a Glushkov
automaton A=(St, sigma,
delta, q0, F) equivalent to the
regular expression r is further built; 3, a Pregel
information transmission model is matched with the regular path query in
RDF graph data, and through an
information transmission mode, result paths are obtained after calculation is conducted by taking a peak as a center; 4, all the result paths meeting the
regular expression r are counted to serve as query results. According to the distributed origin guarantee regular path query
algorithm based on the Pregel, the Glushkov
automaton can be adopted, the origin guarantee regular path query is conducted on the large-scale
RDF graph data, and by introducing optimizing strategies, the purpose of reducing query time and intermediate results to improve algorithm expansibility is achieved.