Warning and cannot download any submission files #11

Mer9ury · 2023-01-10T08:40:22Z

When I'm trying to download dataset,

I got warning
/opt/conda/lib/python3.7/site-packages/pmaw/Request.py:263: UserWarning: 2000 items were not found in Pushshift
warnings.warn(f"{self.limit} items were not found in Pushshift")

and cannot download any images :(

Downloading comments of 0 submission files
Getting images for:
[]

How can I solve this problem?

MartinEthier · 2023-01-26T20:18:49Z

Same here. It would be better if the authors gave us a link to directly download the dataset (ex: from Google Drive) instead of making us scrape the dataset ourselves.

dveni · 2023-01-30T07:42:18Z

Hi! Sorry for the wait, it's true that it's far from convenient the way to download the data, we are looking for alternatives that also comply with Reddit terms of service. Let me look into the warning and why it's not downloading images and get back to you!

dveni · 2023-02-13T13:18:14Z

Hi! Finally got some time to get into this, apparently Pushshift is migrating to a new infrastructure and all data before Novemeber is not yet available (see official thread). They are also updating the API, so there were some issues in pmaw. I've already updated pmaw, but we will have to wait until Pushshift is completely online again.

In the meantime, you can uncomment this line to scrape the latest post in the subreddit (although I still have to check whether the preprocessing scripts will need to be updated) or contact me by email.

I'll post updates in this issue :)

leiyaqi · 2023-05-07T07:58:34Z

hello，I have commented this line of code, but I still encounter the following problem. How can I solve this problem?

FBehrad · 2023-06-19T08:35:10Z

hello，I have commented this line of code, but I still encounter the following problem. How can I solve this problem?

I have the same problem. :(

1190202328 · 2023-08-15T07:38:59Z

I meet the same problem. Does anyone know how to solve this problem?

uyo9ko · 2023-08-15T10:23:36Z

Has anyone successfully downloaded this data? Is there a backup of this data available?

dveni mentioned this issue Feb 28, 2023

Problem with downloading data from Reddit #13

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Warning and cannot download any submission files #11

Warning and cannot download any submission files #11

Mer9ury commented Jan 10, 2023

MartinEthier commented Jan 26, 2023

dveni commented Jan 30, 2023

dveni commented Feb 13, 2023

leiyaqi commented May 7, 2023

FBehrad commented Jun 19, 2023

1190202328 commented Aug 15, 2023

uyo9ko commented Aug 15, 2023

Warning and cannot download any submission files #11

Warning and cannot download any submission files #11

Comments

Mer9ury commented Jan 10, 2023

MartinEthier commented Jan 26, 2023

dveni commented Jan 30, 2023

dveni commented Feb 13, 2023

leiyaqi commented May 7, 2023

FBehrad commented Jun 19, 2023

1190202328 commented Aug 15, 2023

uyo9ko commented Aug 15, 2023