AUPatentPredictionProject

This is a repo to analyze data stored in https://bulkdata.uspto.gov/data/patent/grant/redbook/fulltext/2018/. WINDOWS ONLY CURRENTLY

Steps for use:

Download project to any directory
Clear downloads folder
Go to https://bulkdata.uspto.gov/data/patent/grant/redbook/fulltext/2018/ and download as many IPG###### zipped files as needed
Edit lines 120 in parser.py to suit your download directory location
run parser.py
run factory.py
run analyzer.py
run app.py **

parser.py - used to extract downloaded zipped files, convert each IPG######.xml file into individual xml files, and create a single CSV file from each of the individual xml files. Results in a single CSV for all utility patents holding patent number, grant date, filing date, app number, art unit, title, abstract, description, and claims. Saves IndividualFiles in PatentData and GrantData in CSVs.

factory.py - utilized for building art unit feature vector and classifier models based on the created single CSV file. Shows performance on a randomly subsetted test set. Saves TrainTestPrepared Data, TFIDFvectorizers, Classifiers, and CMs.

analyzer.py - used to find top tokens used in each art unit for use in the web app. Saves TopWords.

auxiliary.py - used for storing minor functions

app.py - used for predicting new data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AUPatentPredictionProject

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
CMs		CMs
CSVs		CSVs
Classifiers		Classifiers
PatentApp		PatentApp
PatentData		PatentData
TFIDFvectorizers		TFIDFvectorizers
TopWords		TopWords
TrainTestPreparedData		TrainTestPreparedData
.gitignore		.gitignore
README.md		README.md
analyzer.py		analyzer.py
auxiliary.py		auxiliary.py
factory.py		factory.py
parser.py		parser.py

ermlickw/AUPatentPrediction

Folders and files

Latest commit

History

Repository files navigation

AUPatentPredictionProject

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages