From 9159eaf6d71e48367994c0af8024abc7f130ef59 Mon Sep 17 00:00:00 2001 From: Savio Dias <91362589+Savio629@users.noreply.github.com> Date: Tue, 7 May 2024 18:54:21 +0530 Subject: [PATCH] Create README.md --- README.md | 13 +++++++++++++ 1 file changed, 13 insertions(+) create mode 100644 README.md diff --git a/README.md b/README.md new file mode 100644 index 0000000..31b7869 --- /dev/null +++ b/README.md @@ -0,0 +1,13 @@ +### We need to scrap the data from [ejalshakti.gov.in](https://ejalshakti.gov.in/JJM/JJMReports/BasicInformation/JJMRep_AbstractData_D.aspx?Istate=9or6Umv%2bgig%3d&IAgency=9or6Umv%2bgig%3d&IDistrict=gMqMutIC0u0%3d&Iblock=gMqMutIC0u0%3d&IFinyear=joOf9Wxy6nf0qdH7vFm42w%3d%3d&ICategory=5C1KxeqUjmo%3d) + +This pull request consists of workflow code and the index.js https://github.com/ChakshuGautam/ejalshakti.gov.in-scraper/pull/3/files + +Successfully runned job in the workflow: https://github.com/Savio629/workflow-testing/actions/runs/8973011035 +(We just deleted the data folder for testing parellization) + +The index.js iteratets through all the dropdown present on the website using puppeteer and it works fine + +The issue right now is to run this jobs parallely as it can take a lot of time to download the data + +Ref of one job that is running right now +https://github.com/Savio629/workflow-testing/actions/runs/8982203324/job/24669283595#step:5:19