Official Website: https://relational.fit.cvut.cz/dataset/CORA
Script connects to public database with given credentials and retrieves/gathers regarding data from different tables into CSV files.
- Database connection
pool_sizeandmax_overflowis set to give a custom behavior tosqlachemy.pool.QueuePoolwhich is a connection pool. - There is an extra thread-pool in the script itself to retrieve and process each unique paper with single SQL query. That means all transactions are distributed among underlying CPUs.
Here is the memory and CPU usages:
Host Machine Specs:
- CPU: Apple M1 (8 Core)
- Memory: 16384MiB
