Wechat-Moments-Scraper

Usages

A program used to scrape and collect WeChat Moments data from your friends. Possible use cases include

Data analysis of user activity (Possibly in violation of TOS)
Stalking your friends through their Moments activity
Producing fancy graphs to demonstrate your superior friend circle

Running the program

Important: Can only be used on Windows. Also, please read the Bugs section before making the decision to run the program.

The program was not originally designed to be used by anyone other than myself. Therefore, it is very user-unfriendly. Steps for running with Postgres database follow:

Download and unzip the repository
Set up a PostgreSQL database (Refer to Youtube for the thousands of tutorials available)
Open scraperV2.py and edit user settings with your credentials
- Must Change
  - HOSTNAME: Your database IP
  - DATABASE: Database name
  - USERNAME: Database login username
  - PWD: Database login password
  - PORT_ID: The port of the database
- Optional
  - MAX_PYQ: Number of posts the scraper will aim for. I suggest setting the value to ~500 for the initial scrape and 0 after that.
  - SCRAPER_NAME: Your scraper username, intended to track progress of multiple scraping machines.
  - UPDATE_FREQ: Uploads data to the database every x posts scraped.
  - SCROLL_DIST: How far the program scrolls after each post. Keep it at its default value.
  - PROCESS_ALL: Processes all posts in the database to generate users. Don't change this.
Open WeChat Moments and scroll to the top.
Open a Terminal window and make sure the Moments window is fully visible. Run the command python3 scraperV2.py and let the program run.
The program will (should) automatically stop when it hits the week mark, where the date data is considered too inaccurate to be useful.

Data structure

Posts
Users

Bugs

The program is full of bugs. The approach of reading application memory does not work very well, as the data is provided in lines instead of sections. This makes data processing incredibly tedious, and I am convinced that the raw data of the moments posts cannot be separated into their correct sections without the use of AI. The current code misses or reads but does not index a lot of posts. It also misplaces text in different sections (i.e. content in likes).

A much better approach would be to emulate an android phone. There are many repositories that accomplish the task through that approach already.

As my visual approach appears to be a dead end, there would probably be no more updates.

Credits:

https://github.com/HYLZ-2019/FriendsOfFriends used as the base.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
README.md		README.md
Scraper.py		Scraper.py
generateDialogue.py		generateDialogue.py
scraperV2.py		scraperV2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wechat-Moments-Scraper

Usages

Running the program

Important: Can only be used on Windows. Also, please read the Bugs section before making the decision to run the program.

Data structure

Bugs

Credits:

About

Releases

Packages

Languages

middleclicker/Wechat-Moments-Scraper

Folders and files

Latest commit

History

Repository files navigation

Wechat-Moments-Scraper

Usages

Running the program

Important: Can only be used on Windows. Also, please read the Bugs section before making the decision to run the program.

Data structure

Bugs

Credits:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages