In this project am trying to build out-of-core analytics command-line application and gradually improve it to build web-based application.
- Convert CSV file to Parquet file(
- Use Umap for memory mapping at userspace.
- Use thrust vector_host for in-memory data structue.
- Use thrust parallel algorithms to do analytics.
Development Stack
- Apache arrow c++(
- Apache parquet-cpp(
- LLNL/umap(
- NVIDIA/thrust(