A computer implemented method for analyzing an externally generated document for use in a
document management system having a Native Template
database including a
list of templates for one or more types of documents having common characteristics and a Conversion
Database including a
list of one or more data points associated with each listed document type, one or more descriptive text entries associated with each listed
data point, and proximity range information relating to the location of the
data point within the descriptive text. The externally generated document is introduced into the
system. The locations of words, sentences, paragraphs, and sections within the document are recorded. A document type is selected from the Native Template
database that has characteristics in common with the externally generated document. A
data point is selected from the template. The introduced document is searched for Possible Data Points based on the
Data Type of the selected data point in the Conversion
Database. Proximity range information is obtained from the Conversion
Database for the Descriptive Text entries associated with the selected data point. A determination is made as to whether Possible
Data Point values for the selected data point are located within the Proximity range for each Descriptive
Text entry. A cumulative Evaluation
Score is calculated for each Possible
Data Point value based on its proximity to each Descriptive
Text entry. The Possible
Data Point with the highest
score that has been accepted by the user is recorded. Upon user acceptance of a Possible Data Point, additional Descriptive Text entries are stored to apply to other externally generated documents. These steps are repeated until each data point has been selected. The user reviews the recorded data which is approved, modified or rejected.