Replies: 8 comments 3 replies
-
I vote for "Ability to start and stop the crawler and resume from the previous position" because gui not that much important for me. Other features are nice to have but can be easily implemented / manipulated by scripting languages or excel/google sheets etc except long and lat feature. |
Beta Was this translation helpful? Give feedback.
-
I will try to implement lon and lat customization first. |
Beta Was this translation helpful? Give feedback.
-
When scraping a lot of cities next to each other i get a lot of duplicate results. Scraping all cities for a category that is not available in most cities will cause google to zoom out and collect results from other cities. But these results also appear when scraping the city where the result is actually in. For a small country i scraped 20x more then needed. Which is huge. Maybe adding a identifier to one of the columns in gmaps table and doing upsert based on it helps. For example using the google identifier (not sure if you have it at that time). But if first all queries in the txt file are done, followed by the 'scrape jobs' it will reduce the queries significantly. Imagine doing scraping at scale (200k cities/villages in usa) and then do 20x more then needed. This will drastically decrease the total duration. |
Beta Was this translation helpful? Give feedback.
-
Does it have progressbar ? |
Beta Was this translation helpful? Give feedback.
-
Please select the feature which you would like to see next.
Voting will be open until 25 Jun 2024
23 votes ·
Beta Was this translation helpful? Give feedback.
All reactions