The invention relates to a
database searching method and a
database searching
system for
open type protein identification. The
database searching method comprises the steps of step 1, inputting
protein sequences, simulating and splitting each
protein sequence,
ranking all generated subsequences according to
mass, generating
a peptide sequence data
list, and establishing an index file according to the
peptide sequence data
list; step 2, inputting
mass spectrums, generating a search set by extracting spectrum peaks from each
mass spectrum, searching the index file, and obtaining a sequence set; step 3, generating a candidate
peptide fragment according to decoration and combination on each
mass spectrum and the sequence set corresponding to each
mass spectrum, and marking the candidate
peptide fragment; step 4, integrating a marking result, inferring from the
peptide fragment to protein, and obtaining an identification result. According to the database searching method disclosed by the invention, the
protein identification is carried out in a way that a user is allowed not to assign the types of
digestion and decoration or is allowed to assign arbitrary types among the types, and the database searching method is used for solving identification problems of
digestion and decoration in the arbitrary types.