System for Arabic
Morphological Analysis and Part-Of-Speech tagging, which is aimed to
be the kernel of a large framework that provides APIs for Arabic language processing.
Cliticization parsing of the morphological analysis component at Qutuf uses finite state automata and rules.
Moreover, at Qutuf, some new concepts were
identified and implemented.
For example at the reprocessing phase, The First Normalization
and Second
Normalization text forms, plus the
Premature and Overdue Tagging at the Part-Of-Speech tagging
task.
Furthermore, the POS tagging was designed and implemented as a rule-based
expert system, where the POS tagset was based on a morphological feature tagset.
Most human knowledge does exist in a Natural Language form, not in relational database or pictures. So, we expect that the current century will focus on NLP.
For us, more NLP means more put-to-use knowledge and more spreading of it. Unfortunately, little work has been practised on Arabic.
One reason for that is the sophistication of the Arabic language.
also, We are Arabs and we can carry this heavy work with passion. This
enthuses us to work on our affluence language.
We aim to build a Morphological Analyzer that serves the task of Part-Of-Speech Tagging without being lost into the details of Arabic morphology, and to construct a Part-Of-Speech tagger that assigns POS tags to an input text.
Our view of this project is to start building libraries and a framework to help researchers and developers working on the Arabic Language Processing by enabling them to build their applications on top of our framework without going through the complicated details of Arabic.
In this project, the framework will contain basic and important tools. Meanwhile, we aim to expand this framework in the future with more tools and features. There is no need to say that this project will make life easier for native Arabs or non-Arabic speakers, even we hope so.
Precisely, what we will provide is going to serve Search Engines, Database Engines, Information Extraction applications and any other AI applications that make use of Arabic Language Processing.
All what you download at Qutuf is free for personal and non-commercial use only. If you work with group or you aim to use it commercially, then you need a license. Whether you aim to use the product or the report, or you want to use a part or a section, you will need a license if you want to use it at your business or at your organization, institute, or company. You may not pay anything, if you want to use it at you research lap. But, you have to contact us for that first. For the time being you can download the following: