Skip to content

ceroberoz/id-jobs

Repository files navigation

id-jobs: Indonesian Job Market Data Aggregator 💼🇮🇩

Daily Job Data Update
License: GPL v3
Python 3.12+
Powered by Scrapy
Enhanced by Playwright

🆕 Latest Updates

  • Added TechInAsia spider for job data collection
  • Integrated Algolia API for efficient data retrieval
  • Improved data sanitization and CSV export
  • Enhanced error handling and logging
  • Updated docs with new data source details

📊 Overview

id-jobs gathers job listings from Indonesian job portals and company websites, following each site's terms of service.

View Data on Google Sheets: https://s.id/id-jobs-v2
View Dashboard on Looker Studio: https://s.id/id-jobs-dashboard

🎨 Job Age Colors

Age Time Color
New ≤ 1 day #00CC00 Green
Hot 1-7 days #FF6600 Orange
Recent 8-15 days #FFFF00 Yellow
Aging 16-21 days #E6E6E6 Gray
Old 22-30 days #CCCCCC Dark Gray
Expired > 30 days #B3B3B3 Very Dark Gray

🔧 How It Works

id-jobs scrapes job data from multiple sites, cleans it, and compiles it into a single spreadsheet. We use Scrapy for most sites and Playwright for JavaScript-heavy sites.

Scraping Process

👀 Preview

id-jobs Preview

🌟 Why Use id-jobs?

id-jobs aggregates job listings in one place, offering insights like work arrangements, job levels, and deadlines.

📚 Data Sources

We collect data from several job portals and company websites, including:
Blibli, Dealls, Evermos, Flip, GoTo, Glints (Lite), Jobstreet, Kalibrr, Karir.com, Kredivo, Mekari, SoftwareOne, Tiket, Tech in Asia Jobs, and more.

🚀 Features

  • Daily updates
  • Identifies work arrangements & job levels
  • Tracks application deadlines
  • Accurate data
  • User-friendly Google Sheets interface
  • Tracks job age
  • Handles JavaScript-rendered content
  • Efficient pagination
  • Integrated with Algolia API

🏁 Getting Started

For a quick guide, check the Quickstart Guide.

❓ FAQ

See our FAQ for common questions.

📄 License

id-jobs is open source under the GPL-3.0 license. You can use, modify, and share it, as long as it remains open source.
We respect website terms of service when collecting data.

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •