DIACRITIZATION IN URDU LANGUAGE CURRENT STATUS AND FUTURE DIRECTIONS

Authors

  • Muhammad Sabih

Abstract

In this paper we discuss the problem of automatic diacritization in Urdu language. The paper provides an overview of research in Urdu language processing. Basically Urdu language is a derived form of Arabic language; therefore a comparative analysis is conducted to highlight the potential research that can be carried out for Urdu language. Methodologies are derived for automatic diacritization of Urdu inferred from Arabic language research. An example method for automatic diacritization from Arabic language research is formulated for automatic diacritization of Urdu text. A simple method on letter level is also devised in case of less amount of diacritized corpus availability.

Published

2021-08-24
دستگاه بافت مو جوراب افزایش قد ژل افزایش قد