Une nouvelle méthode de racinisation hybride et statistique pour la langue arabe
No Thumbnail Available
Date
2025
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
university of bordj bou arreridj
Abstract
This research focuses on the process of stemming in Arabic texts, a
fundamental step in Arabic Natural Language Processing (NLP). It aims
to propose a novel hybrid stemming method that combines statistical
techniques, semantic resources, and machine learning models to enhance
the accuracy of root extraction.
The work includes a critical review of existing Arabic stemming
approaches, a comparative evaluation of statistical methods, and the
development of a flexible statistical model based on morphological
rules. The proposed method is tested on a corpus of Arabic texts, and the
results demonstrate its superiority in terms of precision and linguistic
coverage compared to traditional stemmers.
Description
Keywords
Natural Language Processing, Statistical Methods, Stemming, Morphology, Root Extraction, Arabic Language, Arabic Corpora, Lexical Resources.