An Augmented Large Language Model for Patent Acceptance Prediction

This project was conducted by Anh Ta, Erin McGowan, and Maksat Kuanyshbay as a part of the curriculum for the Machine Learning (CSCI-GA.2565-001) course at the New York University Courant Institute of Mathematical Sciences.

Files included in this repository:

HUPD_metadata_preliminary_analysis.ipynb: This file contains the code that was used to conduct our preliminary analysis of the 25,000 patents we sampled from the larger HUPD dataset to determine if the “filing date,” “examiner art unit,” “ipc label”, “foreign,” “small entity indicator,” and “aia first to file” metadata variables were actually correlated with patent acceptance rate.
BERT_model_benchmark.ipynb: An implementation of BERT fine-tuned on the Harvard USPTO Patent Dataset, which we used as a benchmark for our PatentLLM model.
PatentLLM.ipynb: Our hierarchical transformer-based model for patent acceptance prediction, trained on a subset of the Harvard USPTO Patent Dataset.
PatentLLM_with_Metadata.ipynb: An augmented version of our hierarchical transformer-based model for patent acceptance prediction that incorporates metadat variables, trained on a subset of the Harvard USPTO Patent Dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
BERT_model_benchmark.ipynb		BERT_model_benchmark.ipynb
HUPD_metadata_preliminary_analysis.ipynb		HUPD_metadata_preliminary_analysis.ipynb
PatentLLM.ipynb		PatentLLM.ipynb
PatentLLM_Report.pdf		PatentLLM_Report.pdf
PatentLLM_with_Metadata.ipynb		PatentLLM_with_Metadata.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BERT_model_benchmark.ipynb

BERT_model_benchmark.ipynb

HUPD_metadata_preliminary_analysis.ipynb

HUPD_metadata_preliminary_analysis.ipynb

PatentLLM.ipynb

PatentLLM.ipynb

PatentLLM_Report.pdf

PatentLLM_Report.pdf

PatentLLM_with_Metadata.ipynb

PatentLLM_with_Metadata.ipynb

README.md

README.md

Repository files navigation

An Augmented Large Language Model for Patent Acceptance Prediction

Files included in this repository:

References for Code:

About

Releases

Packages

Languages

egm68/ML-Final_Project

Folders and files

Latest commit

History

Repository files navigation

An Augmented Large Language Model for Patent Acceptance Prediction

Files included in this repository:

References for Code:

About

Resources

Stars

Watchers

Forks

Languages