Detection of Loanwords in Angolan Portuguese: A Text Mining Approach

Authors

  • Timóteo Muhongo Universidade do Porto, Portugal https://orcid.org/0000-0003-1866-3163
  • Pavel Brazdil Universidade do Porto, Portugal
  • Fátima Silva Universidade do Porto, Portugal

DOI:

https://doi.org/10.4114/intartif.vol25iss69pp87-106

Keywords:

Text Mining, Natural Language Processing, Incremental Text Processing, Contrastive Lexicology, Neology.

Abstract

Angola is characterized by many different languages and social, cultural and political realities, which had a marked effect on Angolan Portuguese (AP). Consequently, AP is characterized by diatopic variation. One of the marked effects is in the loanwords imported from other Angolan languages. Our objective is to analyze different Angolan texts, analyze the lexical forms used and conduct a comparative study with European Portuguese, whose aim is to identify the possible loanwords in Angolan. This process was automated, as well as the identification of cotexts of all loanwords. In addition, we determine the lexical class of each loanword and the Angolan language of origin. Most lexical loanwords come from the Kimbundu, although AP includes loanwords from some other Angolan languages, too. Our study serves as a basis for preparation of an Angolan regionalism dictionary. We note that more than 700 loanwords identified do not figure in the existing dictionaries.

Downloads

Download data is not yet available.

Downloads

Published

2022-06-22

How to Cite

Muhongo, T., Brazdil, P., & Silva, F. . (2022). Detection of Loanwords in Angolan Portuguese: A Text Mining Approach. Inteligencia Artificial, 25(69), 87–106. https://doi.org/10.4114/intartif.vol25iss69pp87-106