Setup DB and Add Pagination to Conform to OGC Specs #39

ar-siddiqui · 2023-06-29T22:50:25Z

This PR makes changes to the app that were required to comply with the pagination requirements of OGC specs: '/req/job-list/limit-definitionand/req/core/pl-limit-definition`. The solution implemented also overcame the shortcoming of the cache (as discussed in the paper).

The following enhancements have been implemented:

/jobs and processes routes now support offset and limit parameters and include links to previous and next pages. This was a requirement of OGC Specs.
Add GitHub Actions like symbols to job statuses
Update HTML pages to be sync with recent code changes
Update tests to reduce testing time from ~5 min to ~4 min
Make echo logs colorful to easily identify requests that were failed or bad
Use of SQLite database in place of snapshots to persist job records.
Get rid of the JobsCache hashmap and introduce the ActiveJobs hashmap. ActiveJobs keep a record of currently active jobs. As soon as a job reaches a terminated state (SUCCESSFUL, FAILED, DISMISSED). The job is being removed from the ActiveJobs and status, logs, etc are updated in the database.
Introduce a hit and miss strategy in handlers. A job id is first checked in ActiveJobs, if it is there (hit) then return a response immediately without checking the database, else if job is not in ActiveJobs (miss) then look in the database. This is similar to the cache hit-and-miss concept.

System design before and after:

Benefits achieved with this PR:

Use of the database allows easy pagination, before we were using hashmap to store jobs which has no particular order.
SQLite is a file-based database and will be in sync with the system all the time. Hence it solves the issue of losing data if the server crashes in between snapshots without adding the complexity of standing up a new database server.
SQLite file can be viewed and updated by software such as DBeaver previously the only way to look at jobs was through API, providing no control over raw data.
JobsCache was never a cache, to begin with, it was serving as a data store.
No need to calculate and track the size of jobs in memory as only active jobs are stored in memory. The remaining jobs are in the database (disk).

Fix #37 Fix #27 Fix #18

Close #18

Add Pagination to jobs endpoint #37 Close #9

#37

Fix#15

mxkpp · 2023-07-03T18:39:20Z

I see there is mutex lock/unlock going on and have some questions:

Does the current Mutex behavior protect the sqlite file or only the in-memory job lists?
Are deadlocks possible with the current Mutexes?
Are deadlocks possible with the current sqlite interactions?
Should we add timeouts?

ar-siddiqui · 2023-07-06T18:08:16Z

I see there is mutex lock/unlock going on and have some questions:

Does the current Mutex behavior protect the sqlite file or only the in-memory job lists?

Are deadlocks possible with the current Mutexes?

Are deadlocks possible with the current sqlite interactions?

Should we add timeouts?

@mxkpp I will make this an issue to research later.

mxkpp · 2024-01-15T16:56:08Z

I see there is mutex lock/unlock going on and have some questions:

Does the current Mutex behavior protect the sqlite file or only the in-memory job lists?

Are deadlocks possible with the current Mutexes?

Are deadlocks possible with the current sqlite interactions?

Should we add timeouts?

@mxkpp I will make this an issue to research later.

@ar-siddiqui was this resolved, or an issue opened to track?

EDIT: nvm I just made this one, please lmk if it is duplicated.
#86

ar-siddiqui added 30 commits June 20, 2023 13:12

update readme to include S3_META_DIR

fd6bb2a

Change cache to activeJobs

9bde73b

Handle addition and removal from ActiveJobs

63bd8b9

Clear cache artifacts

f8cbe66

Close #18

create database

89bb5c2

Add docker_job records to database

2a5e387

Update AWSBatchJob to write to DB

8723ed1

Write updates and logs to database

a39c921

Add Pagination to jobs endpoint #37 Close #9

Make API logs colorful

6a2aab5

update tests

73c88f2

Set maximum limit to 100 for job lists

b41b828

#37

Add pagination to process list

4f6b9a8

#37

Update tests for limit parameter

b0b7818

Prettify job logs

2f7b463

Add script to allow Dbeaver connection

cc9e1f0

When shutting down allow db queries to finish

541e316

Fetch AWS logs only when needed

25b22d0

Upsert logs

3407cc9

Replace aepGrid with dfc process in e2e tests

c499163

Update dfc process

caefccb

Update design diagram

1c0a4f2

Conform to /req/job-list/job-list-success

bb00828

Fix HTML Pages

fad7dcd

Fix#15

Delete scratch postman collection

dc8e6f0

Fix prev page link

cfe55f6

Add pagination to processes endpoint

0217906

Deprecate ioutil

0b2a920

Create .data directory if doesn't exist

ea4aa22

Add documentation to database functions

52846db

Update tests for pagination links

b541550

ar-siddiqui requested a review from slawler June 30, 2023 18:27

Update env variables in readme

6b44aad

slawler approved these changes Jul 6, 2023

View reviewed changes

ar-siddiqui merged commit e3881e3 into main Jul 6, 2023

ar-siddiqui deleted the feature/db-and-pagination branch July 6, 2023 18:07

mxkpp mentioned this pull request Jan 15, 2024

Add test to determine if mutex deadlocks / other race conditions are possible #86

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Setup DB and Add Pagination to Conform to OGC Specs #39

Setup DB and Add Pagination to Conform to OGC Specs #39

ar-siddiqui commented Jun 29, 2023 •

edited

Loading

mxkpp commented Jul 3, 2023

ar-siddiqui commented Jul 6, 2023

mxkpp commented Jan 15, 2024 •

edited

Loading

Setup DB and Add Pagination to Conform to OGC Specs #39

Setup DB and Add Pagination to Conform to OGC Specs #39

Conversation

ar-siddiqui commented Jun 29, 2023 • edited Loading

The following enhancements have been implemented:

Benefits achieved with this PR:

mxkpp commented Jul 3, 2023

ar-siddiqui commented Jul 6, 2023

mxkpp commented Jan 15, 2024 • edited Loading

ar-siddiqui commented Jun 29, 2023 •

edited

Loading

mxkpp commented Jan 15, 2024 •

edited

Loading