Skip to content

CaroHolt/WDIProjekt

Repository files navigation

----------------------------------------------------------------------------------
This readme describes the folder structure and the files contained in the submission
file for the WDI-Project "Integrated Sightseeing" by Group 7.
----------------------------------------------------------------------------------
1. SchemaMatchingSights: contains the files/data created for Schema Matching

	a. ../Target_Schema: contains the .xsd-target schema as well as an 
			     .xml-example
	b. ../MapForceMatching_Files: contains all .mfd files for translating the
				      source data sources into the target schema 
	b. ../XML_Output_Files: contains our data sources in .xml-format as 
			        resulting from the schema matching step
--------
2. IdentityResolutionSights: Java Maven project which contains the files/data 
			     created for Identity Resolution

	a. ../data/input: contains the deduplicated data sources in .xml-format 
			  created from the XML_Output_Files from SchemaMatching 
			  by applying deduplication measures
	b. ../data/goldstandard: contains a train- & a test-split for each of our
				 three dataset-combinations
	c. ../data/output: contains the .csv-files that result from the identity 
			   resolution (i.e. the correspondences, debugResultsBlocking 
			   and debugResultsMatching)
--------
3. DataFusionSights: Java Maven project which contains the files/data 
	             created for Data Fusion

	a. ../data/correspondences: contains the correspondences .csv-files as generated in Identity resolution
	b. ../data/goldstandard: contains our gold standard (?)
	c. ../data/input: contains the same files as 2.a.
	d. ../data/output: contains the .csv-files that result from the data fusion
--------
4. FinalReport: contains the final report in .pdf-format
--------
5. TeamContribution: contains the .xlx sheet that lists the contribution of our team members

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages