The terminological tagging of the NooJ italian compound word dictionary
DOI :
https://doi.org/10.35305/an.vi3.20Mots-clés :
NooJ, Lexicon-Grammar, NooJ Electronic Dictionaries, Compound Words, TerminologyRésumé
In this paper, and in relation to the construction of electronic dictionaries for NooJ, we will deal with the tagging of Italian compound words, and with how it differs from that of simple words as for methods, functions, and purposes. We will especially focus our attention on the tagging of technical-scientific compound words, demonstrating how this operation, in NooJ, represents a crucial tool for both information extraction and knowledge automatic management and representation. Furthermore, with the intention to producing a complete analysis, we will provide the definitions of simple word and compound word, from both a formal and a linguistic point of view. As for the linguistic examination, we will adopt two different approaches. For the first one, we will use the analytic methods of Zellig S. Harris, who first set out, in structuralist terms and in relation to English, the study of the composition of different morphemes in more complex linguistic units, hence also of word groups or phrases. For the second one, we will make extensive reference to the methodological framework of language formalization described by Maurice Gross' Lexicon-Grammar, as to its subsequent adaptations to the Italian language. Finally, as we will see, it will be of fundamental importance for us to differentiate the definitions that we will give here of compound words from the more generic and less precise one of multiword expressions (MWE). To justify this differentiation, we will provide not only formal indications, but also lexical, morphosyntactic and semantic ones.
Téléchargements
Références
Gross G. et alii (1986), Typologie des noms composés, Rapport A.T.P. du C.N.R.S., Paris.
Gross G., Jung, R., Mathieu-Colas, M. (1987). Noms composés, rapport n° 5 du Programme de recherches coordonnées « informatique linguistique », CNRS, Paris.
Gross, M. (1975). Méthodes en syntaxe, Paris, Cantilène.
Gross, M. (1990). Grammaire transformationnelle du français : syntaxe de l'adverbe. Cantilène, Paris.
Harris, Z. S. (1946). “From Morpheme to Utterance”, Language 22, pp. 161-83.
Harris, Z. S. (1970). Papers in Structural and Transformational Linguistics, D. Reidel Publishing Company, Dordrecht, Holland.
Mathieu-Colas, M. (1987). Variations graphiques de mots composés, rapport n° 4 du Programme de recherches coordonnées « informatique linguistique », CNRS, Paris.
Monteleone M., Elia A., De Bueriis G., Di Maio F. (2005). “Le polirematiche dell'italiano”. In De Bueriis, G. (a cura di) Lessici elettronici e descrizioni semantiche, sintattiche e morfologiche. Risultati del Progetto PRIN 2005. Atlanti Tematici Informatici - ALTI, Collana “Lessici & Combinatorie”, n. 2. Dipartimento di Scienze della Comunicazione dell'Università degli Studi di Salerno. Salerno, Plectica.
Piot, M. (1988). Conjonctions de subordination et fixement, Langages n 90, Les expressions figées, Larousse, Paris.
Silberztein, M. (1989). Dictionnaires électroniques et reconnaissance lexicale automatique, thèse de doctorat, LADL, Université Paris VII
Silberztein, M. (2003 - ). The NooJ Manual. Available for download at: http://www.nooj-association.org.
Silberztein, M. (2007). An Alternative Approach to Tagging. Proceedings of NLDB 2007. LNCS Series, Springer-Verlag Eds, 1-11.
Silberztein, M. (2007). Les unités linguistiques et leur annotation automatique. Modèles Linguistiques n. 55.
Silberztein, M. (2016). Formalizing Natural Languages: The NooJ Approach. ISTE Ltd and John Wiley & Sons Inc, London.
Wikipedia Encyclopedia, entry Lexicon-Grammar, available at the page https://en.wikipedia.org/wiki/Lexicon-grammar.
Z. S. Harris Web Site Homepage www.zelligharris.org.
Téléchargements
Publiée
Comment citer
Numéro
Rubrique
Licence
Ce travail est disponible sous licence Creative Commons Attribution - Pas d’Utilisation Commerciale - Partage dans les Mêmes Conditions 4.0 International.