Skip to content

This is a python web-scraper for Internshala ,a website for finding Internships. It is made using BeautifulSoup4 to assist in getting info regarding all the ongoing internships in any field with several filters and easy to use UI.

License

Notifications You must be signed in to change notification settings

het-parekh/Internshala-Web-Scraper-Internshala.com

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web-Scraper-Internshala.com [Python 3.7+]

Last Updated on 19/09/2020

Internshala Logo

Third Party Libraries Required :

  1. requests [To fetch the Url content ]
  2. BeautifulSoup4 [Library used for web scraping]
  3. xlwt [To export the data to a Excel File with multiple sheets]

Different filters are available such as :

  • Include work from home
  • Part-time
  • Internships for women
  • Internships with job offer
  • Starting from (or after)
  • Max Duration
  • select multiple locations
  • select multiple Category

Stores the following data for every internship available based on the selected filters :

  • Title
  • Company Name
  • Category
  • Location
  • Duration
  • Stripend
  • Last Date to apply
  • Number of applicants who have applied
  • Skills Required
  • Perks Provided
  • Number of openings
  • Link to that internship

How to use it :

  1. Download or clone the repository
  2. Install Required Libraries
  3. Run main.py
  4. Provide appropriate input
  5. Obtain the excel file in .xls format

Input Example :

Include Work From home?
Include Part-time?
Internships for women?
Internships with job offer?
(Represent your choice with 1-True or 0-False separated by commas such as 1,0,0,1)

1,0,0,0
Enter different categories separated by commas* (Required)
Web Development
Enter different locations separated by commas* (Required)
Mumbai,Delhi
Enter start date in format (yyyy-mm-dd) or leave empty for current date

Enter maximum duration or leave empty for any duration
3
--------------------------------------------------------------------
How many pages you would like to get? Max Pages (16)
2
Different pages on different sheets?(Default: Yes) | 1: No
#Leave empty if Yes 
--------------Scraping Page 1 -----------------
--------------Scraping Page 2 -----------------

1: Add New Sheet
2: Save and Open the file in Excel
3: Save file
4: Discard file and Exit
2
Enter the name of the file
Web_Dev

Excel File 1 Excel File 2

About

This is a python web-scraper for Internshala ,a website for finding Internships. It is made using BeautifulSoup4 to assist in getting info regarding all the ongoing internships in any field with several filters and easy to use UI.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages