Skip to content

Latest commit

 

History

History
42 lines (23 loc) · 1.74 KB

41. Enriching Wikipedia Content in Indian Languages.md

File metadata and controls

42 lines (23 loc) · 1.74 KB

Enriching Wikipedia Content in Indian Languages

Wikipedia is the most widely used resource of the Encyclopaedic Knowledge, Education, and Literacy platform on the Web. In India, the majority (approx. 90%) of the population is proficient only in their native language, refraining them from using English Wikipedia directly. However, creating content in Wikipedia is a non-trivial task and requires the skills of scientific writing. To facilitate this, we are building an automated system named IndicWikiBot to generate an initial version of the full Wikipedia article that humans can later edit or improve. Besides, we are also developing strategies to build a large community of human experts (community building) and advancing technology to enhance the productivity of the Wikipedia editors (content development). IndicWikiBot processes both structured (Knowledge Bases, Databases, etc.) and unstructured information (running texts, webpages, etc.) to generate Wikipedia articles in Indian languages. It involves using cutting-edge AI/ML/NLP and text generation techniques.

Poster Video


Faculty Name

Tushar Abhishek
Anubhav Sharma
Bhavyajeet Singh
Divanshi Gupta
Vasu Singhal
Shivprasad Sagare
Radhika Mamidi Praveen Garimella
Vasudeva Varma

Research Area

Data to Text generation, enrichement of Knowledge graph (KG), Open Information Extraction,.

Type of Work

Software System.

Current State of work

Technology designed but not implemented.

Potential Applications

To help Indian wikipedia users to proficiently access the informative articles in native languages.