Skip to content

Latest commit

 

History

History
21 lines (16 loc) · 1.05 KB

README.md

File metadata and controls

21 lines (16 loc) · 1.05 KB

Physics-Corpus

Project Team Members:

Jaromir Savelka Fattane Jabbari Zhipeng Luo Salim Malakouti

This project contains codes and files for a physics corpus extracted from wikipedia and different webpages and books. This corpus is created as a part of our work on Student Short Answer grading using NLP techniques. To goal was to observe the effectiveness of using a domain specific corpus.

Currently, the corpus contains more than 600 Wikipedia pages on Physics topics or Physics history.

You may find the following useful:

  1. About
  2. XML Structure
  3. How Download Data
  4. Sources
  5. List of Wikipedia Pages
  6. How to Extend Wikipedia Pages