Skip to content

ceroberoz/id-jobs

Repository files navigation

id-jobs: Indonesian Job Market Data Aggregator 💼🇮🇩

Daily Job Data Update
License: GPL v3
Python 3.12+
Powered by Scrapy
Enhanced by Playwright

🆕 Latest Updates

  • Added TechInAsia spider for job data collection
  • Integrated Algolia API for efficient data retrieval
  • Improved data sanitization and CSV export
  • Enhanced error handling and logging
  • Updated docs with new data source details

📊 Overview

id-jobs gathers job listings from Indonesian job portals and company websites, following each site's terms of service.

View Data on Google Sheets: https://s.id/id-jobs-v2
View Dashboard on Looker Studio: https://s.id/id-jobs-dashboard

🎨 Job Age Colors

Age Time Color
New ≤ 1 day #00CC00 Green
Hot 1-7 days #FF6600 Orange
Recent 8-15 days #FFFF00 Yellow
Aging 16-21 days #E6E6E6 Gray
Old 22-30 days #CCCCCC Dark Gray
Expired > 30 days #B3B3B3 Very Dark Gray

🔧 How It Works

id-jobs scrapes job data from multiple sites, cleans it, and compiles it into a single spreadsheet. We use Scrapy for most sites and Playwright for JavaScript-heavy sites.

Scraping Process

👀 Preview

id-jobs Preview

🌟 Why Use id-jobs?

id-jobs aggregates job listings in one place, offering insights like work arrangements, job levels, and deadlines.

📚 Data Sources

We collect data from several job portals and company websites, including:
Blibli, Dealls, Evermos, Flip, GoTo, Glints (Lite), Jobstreet, Kalibrr, Karir.com, Kredivo, Mekari, SoftwareOne, Tiket, Tech in Asia Jobs, and more.

🚀 Features

  • Daily updates
  • Identifies work arrangements & job levels
  • Tracks application deadlines
  • Accurate data
  • User-friendly Google Sheets interface
  • Tracks job age
  • Handles JavaScript-rendered content
  • Efficient pagination
  • Integrated with Algolia API

🏁 Getting Started

For a quick guide, check the Quickstart Guide.

❓ FAQ

See our FAQ for common questions.

📄 License

id-jobs is open source under the GPL-3.0 license. You can use, modify, and share it, as long as it remains open source.
We respect website terms of service when collecting data.