Skip to content

Problem Statement : Classify businesses and companies across a standard taxonomy. This dataset comes with pre-classified companies along with data from the website. The main objective is to cluster companies based on their description on the website.

Notifications You must be signed in to change notification settings

Rishabh1928/Company_Classification_Clustering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

Company_Classification_Clustering

Problem Statement : We are given with web scraped data of various businesses and companies. We need to somehow categorize these businesses and companies across a standard taxonomy (consists of term names and labels that are specific to an organization's information and unique to how that business operates). So that, business can leverage this information and target potential companies.

Overview of DATASET

Website: The website of the company/business

Company Name: The company/business name

Homepage Text : Visible homepage text

H1: The heading 1 tags from the html of the home page

H2: The heading 2 tags from the html of the home page

H3: The heading 3 tags from the html of the home page

Navlink text: The visible titles of navigation links on the homepage (Ex: Home, Services, Product, About Us, Contact Us)

Meta keywords: The meta keywords in the header of the page html for SEO

Meta description: The meta description in the header of the page html for SEO

About

Problem Statement : Classify businesses and companies across a standard taxonomy. This dataset comes with pre-classified companies along with data from the website. The main objective is to cluster companies based on their description on the website.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published