Detection of Loanwords in Angolan Portuguese: A Text Mining Approach
Keywords:Text Mining, Natural Language Processing, Incremental Text Processing, Contrastive Lexicology, Neology.
Angola is characterized by many different languages and social, cultural and political realities, which had a marked effect on Angolan Portuguese (AP). Consequently, AP is characterized by diatopic variation. One of the marked effects is in the loanwords imported from other Angolan languages. Our objective is to analyze different Angolan texts, analyze the lexical forms used and conduct a comparative study with European Portuguese, whose aim is to identify the possible loanwords in Angolan. This process was automated, as well as the identification of cotexts of all loanwords. In addition, we determine the lexical class of each loanword and the Angolan language of origin. Most lexical loanwords come from the Kimbundu, although AP includes loanwords from some other Angolan languages, too. Our study serves as a basis for preparation of an Angolan regionalism dictionary. We note that more than 700 loanwords identified do not figure in the existing dictionaries.
How to Cite
Copyright (c) 2022 Iberamia & The Authors
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Open Access publishing.
Lic. under Creative Commons CC-BY-NC
Inteligencia Artificial (Ed. IBERAMIA)
ISSN: 1988-3064 (on line).
(C) IBERAMIA & The Authors