A Python toolbox for gaining geometric insights into high-dimensional data
-
Updated
Mar 19, 2024 - Python
A Python toolbox for gaining geometric insights into high-dimensional data
The project has text vectorization, handling big data with merging and cleaning the text and getting the required columns while boosting the performance by feature extraction and parameter tuning for NN, compares the Performances through applied different models treating the problem as classification and regression both.
📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.
Word Factor Vectors
Comment Sentiment Analysis using Deep Learning
Given a document, identifying the closest documents within the list of documents using tf-idf matrix and cosine similarity
A simple Python script for transforming a corpus of documents into text vectors suitable for visualization
Experiments in the field of Sentiment Analysis using ML Algorithms namely Logistic Regression, Naive Bayes along with tfidf, one hot encoding, bag of words vectorization. Different MLP and RNN models viz. LSTM, GRU, Bidirectional LSTM. Lastly, state of the art BERT model
A diploma project focused on vectorizing scientific texts using the Top2Vec algorithm, with the aim of analyzing thematic groups, identifying trends, and visualizing the dynamics of interest in various topics in the field of computer science.
A content based movie recommender system.
Predictive Text Analysis project! This repository contains code for predicting answers to science exam questions using advanced natural language processing techniques. Check out the code and results!
Using text-vectorization and similarity-based-matrix computation
IMDB movie review classification using neural network (text-vectorization v/s word-embeddings)
🚀 Course Recommendation System is a machine learning-powered web application designed to recommend similar courses from Coursera's vast dataset of over 3,000 courses. Built using Python, Scikit-learn, and Streamlit, the app preprocesses course data, applies text vectorization, and leverages cosine similarity to offer personalized recommendations.
Syracuse University, Masters of Applied Data Science - IST 736 Text Mining
Evaluation of the accuracy of vectorization and text classification methods
Machine Learning & Natural Language Processing: Reads Classic Novels and Predicts the Author of a Phrase
Text Classification of Legitimate and Rogue Online Privacy Policies: A manual analysis and an experimental procedure
Movie Recommender based on Content based filtering.
Homeworks and final project for Infosearch course
Add a description, image, and links to the text-vectorization topic page so that developers can more easily learn about it.
To associate your repository with the text-vectorization topic, visit your repo's landing page and select "manage topics."