Skip to content

prtyspt/PatentAnalysis

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 

Repository files navigation

PatentAnalysis

Team no. - 201712-31

Data:

  1. Please download the following files from http://www.patentsview.org/download/:
  • patents.zip - link
  • assignees.zip - link
  • patent_assignee.zip - link
  1. Unzip the downloaded files to a sub-folder named 'data'.

  2. Ensure that the 'SPARK_HOME' environment variable is set to the Spark installation directory.

  3. Ensure that the Graphene docker images are running for the address localhost:8010

Code Structure

├── data                   # I/O data required for the scripts.
├── src                    # Scripts that comprise the project.

Order of Executing scripts:

  1. src\patents_sanitization.py
  2. src\patents_analysis.py
  3. src\run_all.sh

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.7%
  • Shell 0.3%