Skip to content

This project contains codes and files for a physics corpus extracted from wikipedia and different webpages and books.

Notifications You must be signed in to change notification settings

salimm/Physics-Corpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Physics-Corpus

Project Team Members:

Jaromir Savelka Fattane Jabbari Zhipeng Luo Salim Malakouti

This project contains codes and files for a physics corpus extracted from wikipedia and different webpages and books. This corpus is created as a part of our work on Student Short Answer grading using NLP techniques. To goal was to observe the effectiveness of using a domain specific corpus.

Currently, the corpus contains more than 600 Wikipedia pages on Physics topics or Physics history.

You may find the following useful:

  1. About
  2. XML Structure
  3. How Download Data
  4. Sources
  5. List of Wikipedia Pages
  6. How to Extend Wikipedia Pages

About

This project contains codes and files for a physics corpus extracted from wikipedia and different webpages and books.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages