U.S. Patent Phrase to Phrase Matching

Kaggle Competition - Help Identify Similar Phrases in U.S. Patents

Introduction

Over the past two centuries, the USPTO has amassed nearly 11 million patents, and such massive amounts of data have created difficulties in patent examination and search. How can a patent examiner determine whether a newly-filed patent has previously been described? What happens if a patent searcher finds the subject he is looking for in the vast ocean of data?

We can address the aforementioned issues by training models on a novel semantic similarity dataset to extract relevant information by matching key phrases in patent documents. Specifically, given a pair of phrases, our model can predict the similarity score (0/0.25/0.5/0.75/1) between the two phrases.

Cooperative Patent Classification was added as a technical domain context to assist us in resolving such ambiguities as an additional feature for the disambiguate. For example, if one invention claims to be "strong material" and another uses "steel," they may be equivalent if the domain is steel, but not if the domain is ripstop fabric (you don't want steel for your parachute).

Todo：

Upload Model

Upload review

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
dataset		dataset
.gitignore		.gitignore
README.md		README.md
[USPPPM][EDA].ipynb		[USPPPM][EDA].ipynb
[USPPPM][Feature-Engineering].ipynb		[USPPPM][Feature-Engineering].ipynb
u-s-patent-review.ipynb		u-s-patent-review.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

U.S. Patent Phrase to Phrase Matching

Kaggle Competition - Help Identify Similar Phrases in U.S. Patents

Introduction

About

Releases

Packages

Languages

ym-xu/US-Patent-Phrase-to-Phrase-Matching

Folders and files

Latest commit

History

Repository files navigation

U.S. Patent Phrase to Phrase Matching

Kaggle Competition - Help Identify Similar Phrases in U.S. Patents

Introduction

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages