AN IMPROVED HAUSA WORD STEMMING ALGORITHM
Keywords:
Hausa Language, Information Retrieval, Natural Language Processing, StemmingAbstract
The explosion of scientific publications in different domains coupled with the introduction and socialization of the internet experienced in the last few decades has made information more available than ever before. Consequently, digital storage capacity has been consistently doubling to reflect this geometric increase in information. In view of this, Information Retrieval (IR), nowadays considered the dominant form of information access has become even more critical. However, the problem of using free text in indexing and retrieval arising from spelling mistake, alternative in spelling, affixes and abbreviations has continued to bedevil the field of IR. To mitigate this problem, Stemming Algorithm was introduced in the 1960s. Stemming is an automated process of stripping all word derivatives of their inflectional affixes in order to obtain stem of the word. Because stemming is language specific, there are stemming algorithms designed specifically for most of the major languages in the world. With a speaker population of about 150 million Hausa language stands in need of a better stemming algorithm. This research is an attempt to improve upon the existing Hausa word stemming algorithm. Affix stripping method of conflation with reference lookup was used. Using Sirsat’s evaluation method, this research achieved 96.9% as Correctly Stemmed Word Factor (CSWF), Index Compression Factor – 74.76%, Words Stemmed Factor (WSF) – 70.44% and Average Word Conflation Factor – 59.47%.
Published
How to Cite
Issue
Section
FUDMA Journal of Sciences
How to Cite
Most read articles by the same author(s)
- Ruqayyah Rabiu Ibrahim, G. N. Obunadike, Jamilu Ahmad Bashir, DESIGN AND ANALYSIS OF INCENTIVE MECHANISM FOR DISTRIBUTED FILE SHARING NETWORKS , FUDMA JOURNAL OF SCIENCES: Vol. 8 No. 3 (2024): FUDMA Journal of Sciences - Vol. 8 No. 3 (Special Issue)
- R. M. Dima, G. N. Obunadike, SEQURESQL – A FRAMEWORK FOR QUERY OPTIMIZATION AND PRIVACY ON OUTSOURCED DATA , FUDMA JOURNAL OF SCIENCES: Vol. 1 No. 1 (2017): FUDMA Journal of Sciences - Vol. 1 No. 1