WikiIndaba conference 2018/Program/Wiki resources in semantic technologies: The Tunisian experience

ID: SemanticTech
Speakers: Mohamed Ben Aouicha and Mohamed Ali Hadj Taieb (Faculty of Sciences of Sfax, University of Sfax, Tunisia)	Time block: friday-afternoon	Start: 14:15
	Location: Great room	Duration: 25 min.
Description : The Semantic Relatedness (SR) consists of quantifying any type of relationship between two concepts or two words. In this context, the quantification of this distance is based on the semantic information that can be extracted from huge corpora or structured resources such as the knowledge bases. These corpora can be used to extract the concepts or words that are co-occurring. In fact, these can express a certain relationship between them. This type of information will be very useful to determinate the SR between concepts. Wikipedia is exploited frequently as a resource for extracting the co-occurrence between the different words. In our proposed approach, we are interested in filtering only words having the same part of speech as nouns, verbs, adverbs and adjectives from this encyclopedia. Our approaches are also enriched through the use of Wiktionary to determine words that are in these forms. The process considers the filtering of the articles from Wikipedia and, then, to design and develop an application to render available a set of services offering statistics on co-occurring words. The first part provides a preliminary study including the presentation of the two exploited resources Wikipedia and Wiktionary. The second part is dedicated to the project design, which consists in presenting a collection of functional and technical needs towards the developed system.		Themes: Wikimedia Research and Technology, Interface & Infrastructure
		Tags: Semantic relatedness, Semantic technologies, Wikipedia, Wiktionary
		Notes: #WikiIndaba18_SemanticTech