Skip to content

Commit 75cd341

Browse files
fixed issues
1 parent 67735b0 commit 75cd341

File tree

3 files changed

+6
-6
lines changed

3 files changed

+6
-6
lines changed

Dockerfile

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,11 +17,11 @@ ADD requirements.txt /home/jovyan/openaire/
1717

1818

1919
# this command is to download the data and set up the requirements
20-
# COMMENT NOT TO DOWNLOAD THE DATA
20+
# COMMENT NOT TO DOWNLOAD THE DATA. UNCOMMENT the last three lines
2121
RUN cd /home/jovyan/openaire && make data
2222

23-
# UNCOMMENT THE FOLLOWING COMMANDS TO COPY THE DATA FROM LOCAL FOLDER
2423
# the following commands are for downloaded dataset stored in the local folder to be added to the image
24+
# UNCOMMENT THE FOLLOWING COMMANDS TO COPY THE DATA FROM LOCAL FOLDER. COMMENT the previous command
2525
# ADD [downloaded_data_folder] /home/jovyan/openaire/[downloaded_data_folder]
2626
# RUN ls /home/jovyan/openaire/[downloaded_data_folder]/*.tar | xargs -i tar xf {} -C /home/jovyan/openaire/data/raw/
2727
# RUN cd /home/jovyan/openaire && make requirements

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@ This will take care of everything is needed to create a Docker image. At the end
2020

2121
# How to run the container
2222
Open Docker Engine, locate the Docker image you just built and run it, having care of specifying a port (e.g., 8888).
23-
In the log, you should see an URL promped like `https://127.0.0.1:8888/...`; that's our guy, click on it.
23+
In the log, you should see an URL prompted like `https://127.0.0.1:8888/...`; that's our guy, click on it.
2424
The browser will open a page with JupyterLab; this is where you can start playing with the OpenAIRE Graph.
2525

2626
You can run the container from the Terminal too with the following command

notebooks/beginners_kit.ipynb

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -66,7 +66,7 @@
6666
"metadata": {},
6767
"outputs": [],
6868
"source": [
69-
"df = pd.read_json('../data/raw/publication/part-00000.txt.gz', compression='gzip', lines=True)\n",
69+
"df = pd.read_json('../data/raw/publication/part-00000.json.gz', compression='gzip', lines=True)\n",
7070
"df.head(2)"
7171
]
7272
},
@@ -250,7 +250,7 @@
250250
"outputs": [],
251251
"source": [
252252
"pretty_print(json.loads(publications\n",
253-
" .where(\"id='78975075580c::2ff84f3173897001283274434e8f3eaa'\")\n",
253+
" .where(\"id='dedup_wf_001::0e3994a1750486fa484062a783fb3cae'\")\n",
254254
" .toJSON()\n",
255255
" .first()), expanded=False)"
256256
]
@@ -721,7 +721,7 @@
721721
"query = \"\"\"\n",
722722
"SELECT legalshortname, legalname\n",
723723
"FROM organizations \n",
724-
"WHERE legalshortname IS NULL and legalshortname IS NOT NULL\n",
724+
"WHERE legalshortname IS NULL and legalname IS NOT NULL\n",
725725
"\"\"\"\n",
726726
"\n",
727727
"spark.sql(query).limit(20).toPandas()"

0 commit comments

Comments
 (0)