Skip to content

Conversation

Roverbk
Copy link
Contributor

@Roverbk Roverbk commented Jun 1, 2024

#153
[clustering movie data from imdb issue]

Description
1.We scrape IMDb data including movie titles, ratings, and summaries using BeautifulSoup
2.We use TF-IDF vectorization to convert the movie summaries into numerical feature vectors.
3.We apply K-means clustering to group the movies into clusters based on their summaries.
4.We use PCA for dimensionality reduction to visualize the clusters in a 2D space.

Type of PR

  • Bug fix
  • Feature enhancement
  • Documentation update
  • Other (specify): _______________

image
image
image

Checklist:

  • I have performed a self-review of my code
  • I have read and followed the Contribution Guidelines.
  • I have tested the changes thoroughly before submitting this pull request.
  • I have provided relevant issue numbers, screenshots, and videos after making the changes.
  • I have commented my code, particularly in hard-to-understand areas.

Co-authored-by: Mercia Jeno mercia.jeno@gmail.com
Co-authored-by: Bhakti Kalyankasture kalyankasturebhakti@gmail.com

@Roverbk
Copy link
Contributor Author

Roverbk commented Jun 1, 2024

@sanjay-kv Can you merge it.
attaching link: #108

@sanjay-kv sanjay-kv merged commit 8a4bb9d into recodehive:main Jun 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants