Skip to content

Latest commit

 

History

History
10 lines (10 loc) · 1.87 KB

README.md

File metadata and controls

10 lines (10 loc) · 1.87 KB

Files & Directories

  • ./dataset: Contains folders that contains data in a form of triplets. ./data_path_file.txt should be updated to match your local path.
  • ./inconsistency_validation: Contains wet-lab validation results of the inconsistencies resolved by the inconsistency resolver.
  • ./data_path_file.txt: File that lists locations of all the datasets to integrate. The first column represents the file path, and the second column represents the name of the dataset. Note that headers Path and Source must exist.
  • ./data_rules.xml: Data rule in .xml format that lists logical statements to infer new facts from existing facts in the integrated dataset. Please refer to the section "Rules to add knowledge" in the Manuscript.docx.
  • ./domain_range.txt: File that specifies the domain and range information (entity type) of all the relations in the dataset. Note that headers Relation, Domain, and Range must exist.
  • ./inconsistency_rules.xml: Inconsistency rule in .xml format that lists logical statemetns to find conflictings facts in the integrated dataset. Please refer to the section "Rules to identify inconsistency" in the Manuscript.docx.
  • ./label_rules.xml: File in .xml format to be used in the postprocessing step to assign label to the triplets.
  • ./name_map.txt: Name mapping table that lists all the synonyms. Note that headers Source and Target must exist.
  • ./replace_rules.txt: File in .xml format to be used in the postprocessing step to find and replace certain sections of the triplet.