Skip to content

Structure Comparison

iks github user edited this page Nov 15, 2019 · 8 revisions

Analysing the structure of a document and comparing it with an expected structure is one of the three different approaches of the SysNat DocVal Framework. To follow this approach an Excel file is needed that contains the structure information. At first, document parts (of which a document may consists) must be defined in the following form:

In addition to these part definitions one ore more possible structures can be defined in the following form:

To enable SysNat to find these document structure data, a dat-file must exist with the following properties:

ExcelFile=EBookStructureInfo.xlsx

PartsExcelSheet=Parts
PartsRootCell=B3

StructuresExcelSheet=Structures
StructuresRootCell=C2

A root cell is the upper left cell of a data matrix and a matrix is a square of cells in a Excel sheet that is surrounded by empty cells.

Having prepared these structure data, it can be used in an Executable Example by one of the two following natural language instructions:

Does structure of document 'MyDocument.pdf' in assumption of structure information from "StructureInfo.dat" match structure 'Structure1'?
Does structure of document 'MyDocument.pdf' in assumption of structure information from "StructureInfo.dat" match the content in text file "ExpectedDocumentStructure.txt"?

See also this example.

Back to DocVal

Clone this wiki locally