Xarchive is a free, open-source web application for exploring and downloading social media archive snapshots from the Internet Archive's Wayback Machine. Search, preview, and export historical Twitter and Instagram data in multiple formats.
🔗 Live Demo: https://xarchive.net
- Search by Username: Find all archived tweets for any Twitter account
- Date Range Filtering: Narrow down results to specific time periods
- Wayback Machine Integration: Direct access to Internet Archive CDX API data
- Multiple Export Formats: Download results as HTML, CSV, or JSON
- Tweet Preview: View archived tweet text, timestamps, and status codes
- Unique URL Filtering: Option to collapse duplicate snapshots
- Profile Search: Discover archived Instagram posts by username
- Historical Data Access: Browse Instagram snapshots saved on Wayback Machine
- Flexible Export Options: Export data in HTML, CSV, or JSON formats
- Date Range Control: Filter archives by specific date ranges
- Snapshot Preview: View archived post metadata and URLs
- No Registration Required: Start archiving immediately
- Privacy-Focused: No tracking, no data collection
- Fast & Responsive: Built with Next.js 15 and React 19
- Modern UI: Beautiful interface powered by shadcn/ui and Tailwind CSS
- Export Ready: All data downloadable with timestamped filenames
- Node.js 20+ or higher
- Yarn package manager
- Clone the repository:
git clone https://github.com/yourusername/xarchive.git
cd xarchive- Install dependencies:
yarn install- Run the development server:
yarn dev- Open http://localhost:3000 in your browser
yarn build
yarn start- Visit xarchive.net or run locally
- Enter a Twitter username (without @)
- Optionally set date range filters and result limits
- Click "Go" to search the Wayback Machine
- Preview results in the interactive table
- Choose your export format (HTML, CSV, or JSON)
- Download your archive data
- Visit xarchive.net/archive-instagram
- Enter an Instagram username (without @)
- Set your desired date range
- Click "Go" to retrieve snapshots
- Export results in your preferred format
- Framework: Next.js 15 with App Router
- UI Library: React 19
- Language: TypeScript
- Styling: Tailwind CSS 4
- Components: shadcn/ui
- Icons: Lucide React
- Data Source: Internet Archive CDX API
xarchive/
├── app/ # Next.js App Router pages
│ ├── api/ # API routes
│ │ ├── archive/ # Twitter archive endpoint
│ │ ├── archive-instagram/ # Instagram archive endpoint
│ │ └── twitter-embed/ # Twitter embed endpoint
│ ├── archive-instagram/ # Instagram archive page
│ ├── layout.tsx # Root layout
│ └── page.tsx # Home page (Twitter archive)
├── components/ # React components
│ ├── archive-tool.tsx # Twitter archive component
│ ├── archive-instagram-tool.tsx
│ ├── site-header.tsx # Header component
│ ├── tabs/ # Export format tabs
│ └── ui/ # shadcn/ui components
├── lib/ # Utility libraries
│ ├── archive/ # Archive processing logic
│ │ ├── parser.ts # Twitter CDX parser
│ │ ├── parser-instagram.ts # Instagram CDX parser
│ │ ├── service.ts # Twitter service
│ │ ├── service-instagram.ts
│ │ ├── exporter.ts # Export functionality
│ │ └── types.ts # TypeScript types
│ └── utils.ts # General utilities
└── public/ # Static assets
Contributions are welcome! Whether it's bug reports, feature requests, or code contributions, please feel free to:
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature) - Commit your changes using Semantic Commit Messages
- Push to the branch (
git push origin feature/amazing-feature) - Open a Pull Request
- Use TypeScript for all new code
- Follow the existing code style and conventions
- Use Yarn for dependency management
- Test your changes thoroughly before submitting
- Update documentation as needed
This project is licensed under the GNU Affero General Public License v3.0 (AGPL-3.0) - see the LICENSE file for details.
- Internet Archive for providing the Wayback Machine CDX API
- Vercel for hosting infrastructure
- shadcn/ui for the beautiful component library
- All contributors and users of Xarchive
- Website: xarchive.net
- Issues: GitHub Issues
Made with ❤️ for archival and data preservation