You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
# id-jobs: Indonesian Job Market Data Aggregator 💼🇮🇩
2
2
3
-
[](https://github.com/ceroberoz/id-jobs/actions/workflows/scrape.yml)

3
+
[](https://github.com/ceroberoz/id-jobs/actions/workflows/scrape.yml)

7
7

8
8
9
9
## 🆕 Latest Updates
10
10
11
-
- Added TechInAsia spider to collect job data from Tech in Asia Jobs portal
12
-
-Implemented Algolia API integration for efficient data retrieval from TechInAsia
13
-
-Enhanced data sanitization to ensure CSV-friendly output
14
-
-Improved error handling and logging for the new spider
15
-
- Updated documentation to reflect the addition of TechInAsia as a data source
11
+
- Added **TechInAsia** spider for job data collection
12
+
-Integrated **Algolia API**for efficient data retrieval
13
+
-Improved data sanitization and CSV export
14
+
-Enhanced error handling and logging
15
+
- Updated docs with new data source details
16
16
17
17
## 📊 Overview
18
18
19
-
id-jobs collects job listings from Indonesian job portals and company websites, respecting each site's terms of service.
19
+
id-jobs gathers job listings from Indonesian job portals and company websites, following each site's terms of service.
20
20
21
-
**View the Data on Google Sheets:**[https://s.id/id-jobs-v2](https://s.id/id-jobs-v2)
22
-
23
-
**View the Dashboard on LookerStudio by Google:**[https://s.id/id-jobs-dashboard](https://s.id/id-jobs-dashboard)
21
+
**View Data on Google Sheets:**[https://s.id/id-jobs-v2](https://s.id/id-jobs-v2)
22
+
**View Dashboard on Looker Studio:**[https://s.id/id-jobs-dashboard](https://s.id/id-jobs-dashboard)
24
23
25
24
## 🎨 Job Age Colors
26
25
27
-
| Age | Time | Color |
28
-
|-----|------|-------|
29
-
| New | ≤ 1 day | Bright Green |
30
-
| Hot | 1-7 days | Bright Orange |
31
-
| Recent | 8-15 days | Bright Yellow |
32
-
| Aging | 16-21 days | Light Gray |
33
-
| Old | 22-30 days |Medium Gray |
34
-
| Expired | > 30 days | Dark Gray |
26
+
| Age | Time | Color|
27
+
|------------|------------|----------------|
28
+
| New | ≤ 1 day | Green |
29
+
| Hot | 1-7 days | Orange |
30
+
| Recent | 8-15 days | Yellow |
31
+
| Aging | 16-21 days | Gray |
32
+
| Old | 22-30 days |Dark Gray |
33
+
| Expired | > 30 days | Very Dark Gray |
35
34
36
35
## 🔧 How It Works
37
36
38
-
id-jobs automatically collects job data from various websites, cleans the information, and compiles it into a single spreadsheet. We use Scrapy for most sites and Playwright for sites with complex JavaScript rendering.
37
+
id-jobs scrapes job data from multiple sites, cleans it, and compiles it into a single spreadsheet. We use **Scrapy** for most sites and **Playwright** for JavaScript-heavy sites.
39
38
40
39

41
40
@@ -45,36 +44,34 @@ id-jobs automatically collects job data from various websites, cleans the inform
45
44
46
45
## 🌟 Why Use id-jobs?
47
46
48
-
id-jobs simplifies job searching by gathering information from multiple sources into one place, providing insights on work arrangements, job levels, and application deadlines.
47
+
id-jobs aggregates job listings in one place, offering insights like work arrangements, job levels, and deadlines.
49
48
50
49
## 📚 Data Sources
51
50
52
-
We collect data from various job portals and company websites, including:
51
+
We collect data from several job portals and company websites, including:
53
52
Blibli, Dealls, Evermos, Flip, GoTo, Glints (Lite), Jobstreet, Kalibrr, Karir.com, Kredivo, Mekari, SoftwareOne, Tiket, Tech in Asia Jobs, and more.
54
53
55
54
## 🚀 Features
56
55
57
56
- Daily updates
58
-
- Work arrangement identification
59
-
- Job level detection
60
-
- Application deadline calculation
61
-
- Improved data accuracy
57
+
- Identifies work arrangements & job levels
58
+
- Tracks application deadlines
59
+
- Accurate data
62
60
- User-friendly Google Sheets interface
63
-
-Job age tracking
64
-
- JavaScript-rendered content handling with Playwright
65
-
- Efficient pagination across multiple pages
66
-
-Integration with Algolia API for improved data retrieval
61
+
-Tracks job age
62
+
-Handles JavaScript-rendered content
63
+
- Efficient pagination
64
+
-Integrated with Algolia API
67
65
68
66
## 🏁 Getting Started
69
67
70
-
For a quick guide, see our[Quickstart Guide](QUICKSTART.md).
68
+
For a quick guide, check the[Quickstart Guide](QUICKSTART.md).
71
69
72
70
## ❓ FAQ
73
71
74
-
Check our [FAQ](FAQ.md) for common questions.
72
+
See our [FAQ](FAQ.md) for common questions.
75
73
76
74
## 📄 License
77
75
78
-
id-jobs is open source under the GPL-3.0 license. You can use, modify, and share the code, as long as you keep it open source.
79
-
76
+
id-jobs is open source under the GPL-3.0 license. You can use, modify, and share it, as long as it remains open source.
80
77
We respect website terms of service when collecting data.
0 commit comments