This is a Streamlit-based web application designed to help users generate leads by finding business websites and extracting verified email addresses. It combines the power of OpenAI's GPT for generating website lists and web scraping for email extraction and validation.
- Enter a search query (e.g., "AI companies in Canada").
- Uses OpenAI's GPT to generate a list of relevant websites.
- Download the list of websites for further use.
- Scrapes websites for email addresses.
- Validates emails using DNS MX records to ensure they are real.
- Saves the results as a clean CSV file.
- Built with Streamlit for an easy-to-use and visually appealing interface.
- Includes tabs for finding websites and extracting emails.
git clone https://github.com/your-username/your-repo-name.git
cd your-repo-nameCreate a virtual environment to manage dependencies
source .venv/bin/activate # On macOS/Linux
# On Windows: .venv\Scripts\activate- Install the required Python packages:
pip install -r requirements.txt
Set your OpenAI API key as an environment variable
export OPENAI_API_KEY="your_openai_api_key"
- To run the application, run the following command in your terminal:
streamlit run webscraper.py
- Open up the localhost port that it provides, and you're done!