X (Twitter) Scraper

A Python script to scrape tweets from X (formerly Twitter) using their web API. This script uses GraphQL endpoints and session cookies for authentication.

Features

Fetch tweets from any public Twitter account
Get tweets from your home timeline
Get pinned tweets
Save tweets to JSON file
Configurable number of tweets to fetch
Uses GraphQL API for better reliability

Requirements

Python 3.7+
Required packages (install via pip install -r requirements.txt):
- requests
- python-dotenv

Setup

Clone the repository:

git clone https://github.com/yourusername/XScraperBot.git
cd XScraperBot

Create a virtual environment and activate it:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Create a .env file with your Twitter session cookies and tokens:

# Required for scraping tweets
X_AUTH_TOKEN=your_auth_token_here
X_CT0=your_ct0_here

# Required for getting latest GraphQL hashes
X_CSRF_TOKEN=your_csrf_token_here
X_TWITTER_AUTH_TYPE=OAuth2Session
X_TWITTER_CLIENT_LANGUAGE=en
X_TWITTER_ACTIVE_USER=yes

To get these tokens:

Log into Twitter in your browser
Open Developer Tools (F12)
Go to Application tab
Look for Cookies under Storage
Find and copy the following values:
- auth_token → X_AUTH_TOKEN
- ct0 → X_CT0 and X_CSRF_TOKEN
- twid → X_TWITTER_AUTH_TYPE
- lang → X_TWITTER_CLIENT_LANGUAGE
- active_user → X_TWITTER_ACTIVE_USER

Usage

First, get the latest GraphQL hashes:

python get_hashes.py

Then fetch tweets using one of these methods:

Fetch tweets from a specific user:

python scraper.py username [--count NUMBER]

Example:

python scraper.py elonmusk --count 5

Fetch tweets from your home timeline:

python scraper.py --home [--count NUMBER]

Example:

python scraper.py --home --count 10

The script will:

Fetch the specified number of tweets
Display them in the console (with usernames for home timeline tweets)
Save them to a JSON file:
- username_tweets.json for user timeline
- home_timeline_tweets.json for home timeline

Notes

The script uses Twitter's web API, so it's subject to their rate limits
Make sure to keep your session tokens secure and never commit them to version control
The GraphQL hashes need to be updated periodically as Twitter updates their web client
Home timeline access requires valid authentication tokens

License

MIT License - feel free to use this code for any purpose.

Disclaimer

This tool is for educational purposes only. Make sure to comply with Twitter's terms of service and rate limits when using this script.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
README.md		README.md
get_hashes.py		get_hashes.py
requirements.txt		requirements.txt
scraper.py		scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

X (Twitter) Scraper

Features

Requirements

Setup

Usage

Fetch tweets from a specific user:

Fetch tweets from your home timeline:

Notes

License

Disclaimer

About

Uh oh!

Releases

Packages

Uh oh!

Languages

TheNeuroDeveloper/XScraper

Folders and files

Latest commit

History

Repository files navigation

X (Twitter) Scraper

Features

Requirements

Setup

Usage

Fetch tweets from a specific user:

Fetch tweets from your home timeline:

Notes

License

Disclaimer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages