Bluesky Trending Topics

🌐 Other languages:

Description

This project extracts real-time post trends and stores them in a database. The project uses natural language processing techniques to analyze the posts and extract trends. The trends are stored in Supabase for later querying.

Technologies Used

Supabase: Database to store the trends.
Compromise: NLP library for text processing.
Google Gemini: Generative AI API to classify topics.

Features

N-grams Extraction: Extraction of words, phrases, and hashtags from posts.
Content Filtering: Filtering of stopwords, blacklist words, and irrelevant content.
Text Classification: Topic classification using a text classifier.
Trend Storage: Storage of trends in Supabase.

Requirements

Node.js: Install Node.js to run the project.
Supabase (optional): Set up an account on Supabase and obtain the necessary credentials.
Google Gemini API Key (optional): Set up an account on Google Cloud and create a project with the Generative Language API (Gemini API) enabled and obtain the API key from https://aistudio.google.com/app/apikey.

The use of Supabase is optional and Google Gemini is used only to classify topics. You can replace these services with others of your choice.

Installation

Clone the repository:

git clone https://github.com/Rafael-BD/Bsky-Trends
cd Bsky-Trends

Set up the environment variables in the .env file:

SUPABASE_URL=your_supabase_url
SVC_KEY=your_supabase_key
GOOGLE_API_KEY=your_google_api_key
DEV=true # Set to false in production

Create a Supabase table named trends with the following columns:
- id
- trend (jsonb)
- lang (text)
- updated_at (TIMESTAMPZ)
Create a Supabase Storage bucket named checkpoints to store the trends checkpoints that are used to the server to recover the trends in case of a restart.

Usage

Install the dependencies:
```
npm install
```
or
```
bun install
```
Start the WebSocket client to listen to posts:
```
npm run start
```
or
```
bun server.ts
```
Now make GET requests to http://localhost:8003/trending to get the trends.

Feature Explanation

N-grams Extraction

The project extracts words, phrases, and hashtags from posts using NLP techniques. The extraction is done through the extractWords, extractSentences, and extractHashtags functions.

Content Filtering

The extracted content is filtered to remove stopwords, blacklist words, and irrelevant content. The filtering is done through the filterWords and filterSentences functions.

Text Classification

The extracted topics are classified using Google Gemini AI. The classification is done through the classifyText function.

Trend Storage

The trends are stored in Supabase. The storage is done through the services/saveTrends.ts file.

Public API

The project also has a public API to get the trends. The API documentation is available at https://github.com/Rafael-BD/Bsky-Trends-API.

Contribution

Contributions are welcome! Feel free to open issues and pull requests.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
assets		assets
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bun.lockb		bun.lockb
package-lock.json		package-lock.json
package.json		package.json
server.ts		server.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bluesky Trending Topics

Description

Technologies Used

Features

Requirements

Installation

Usage

Feature Explanation

N-grams Extraction

Content Filtering

Text Classification

Trend Storage

Public API

Contribution

About

Releases

Packages

Contributors 2

Languages

License

rafabd1/Bsky-Trends

Folders and files

Latest commit

History

Repository files navigation

Bluesky Trending Topics

Description

Technologies Used

Features

Requirements

Installation

Usage

Feature Explanation

N-grams Extraction

Content Filtering

Text Classification

Trend Storage

Public API

Contribution

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages