Urdu is a beautiful language known for its poetic elegance. Spoken in Pakistan and India, it is also referred as Hindustani or Hindi-Urdu due to its similarity with Hindi. Despite differences in script, lexicon and some linguistic features, the speakers are mutually intelligible and constitute the fourth largest linguistic community in the world. Urdu is one of the 22 scheduled languages in India and the national lingua franca in Pakistan. It is also the official language of many Indian states.

A small Urdu enthusiast group was started by Dr Girish nath Jha in the summer of 2012 when he and his reseach students and project staff started learning the script with the help of a senior research student of the Indian Laguage Center of JNU. The group which includes research students/scholars/freelance translators from various Indian language centers in JNU and outside has been very active in translating Indian English of the web into Urdu for training MT systems. Besides corpora collection, a transliteration engine and other tools are being developed for Urdu and many other Indian languages. In collaboration with Microsoft Research, the group organized a workshop in JNU in July 2012 to sensitize the community in creating resources for the language. Urdu group has been helping Microsoft Translator to develop the English to Urdu Translation system for Bing search, Microsoft Office and Internet Explorer.