Skip to content

This Repo contains my projects that I have used to improve SEO and Content Strategy

Notifications You must be signed in to change notification settings

nitinjosephrepo/Using-NLP-Models-To-Improve-SEO

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Similarity Score with BERT

Google Helpful content update has again brought in to focus the importance of writing content that is unique and helpful. Content strategy in the past was often dictated by volume over quality which now requires reevulation to reasses and remove thin/low quality content from the websites. Narrowing down on the problem is a considerable effort in itself. If Google is using BERT to understand content it makes sense to create a pipeline to evaulate the content from BERT's perspective. This is exactly what this project accomplishes.

Similarity score measures how close pages are semantically to each other is a good way to identify pages that need to be revisted.

There are many algorithm to measure content similarity between pages, tools like screaming frog often use older Text Rank based methods or evaluate page simliarity based on frequency of words. Which are great but do not check for Semantic similarity.

I decided to write my own Similarity Score checker with Google BERT using Transformers from Huggingface. You will need to install Goose3 which fetches all urls supplies in the list.

Screenshot 2023-02-24 at 10 32 13 PM

About

This Repo contains my projects that I have used to improve SEO and Content Strategy

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published