Case Study «ERB» : entity-resolution benchmarking

country name	population	Urban Concentration Ratio CR8	Gini index	HDI	GDP USD	year
Afghanistan	37,172,386	0.184	27.8	0.478	14,786,861,638	2021
…	…	…	…	…	…	…
Zimbabwe	14,439,018	0.267	50.3	0.593	28,371,238,666	2021

www.geonames.org	data.worldbank.org	www.wikidata.org
Congo Republic	Congo, Rep.	Republic of the Congo
DR Congo	Congo, Dem. Rep.	Democratic Republic of the Congo
…	…	…

technology	strategy	data
KNIME	native many-to-many routine, closed-box	countrynames, no capitals
python difflib	one-to-one routine, no-replacement	countrynames, no capitals
elasticsearch fuzzy	one-to-one routine, no-replacement	countrynames, no capitals

¶ Summary