Skip to content

A scraping service that fetches jobs on wellfound by keywords and is able to remotely send messages from a server. Built by reverse engineering the /graphql endpoint of wellfound.

Notifications You must be signed in to change notification settings

subbuwu/wellfound_graphqlscout

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Wellfound Job + Messaging Service/Scraper - Manual Header/Cookie Injection

Watch the App In Action :

1) Job Scraping Service : Click Here

2) Messaging Service : Click Here

Overview

A full-stack job scraping application that enables users to search for jobs on Wellfound using custom keywords and messasing service. The project leverages FastAPI for the backend, React with Vite and Tailwind CSS for the frontend, and implements scraping techniques through graphql endpoints to bypass anti-bot security measures.

Screenshots of app :

Screenshot 2024-12-06 at 9 13 50 PM Screenshot 2024-12-06 at 9 14 38 PM Screenshot 2024-12-07 at 8 09 09 PM Screenshot 2024-12-07 at 8 10 08 PM

Docs - Checkout http://localhost:8000/docs after running local python fastapi backend server

Tech Stack

Backend

  • Python
  • FastAPI
  • Requests library
  • Other helper lib

Frontend

  • React
  • Vite
  • Tailwind CSS
  • Shadcn UI Components

Prerequisites

  • Python
  • Node.js
  • pip
  • npm / yarn / pnpm / bun

Backend Cookies/Headers Setup

  1. Navigate to /backend/app/scraper/companies.py file
Replace all the cookies and headers from your wellfound cookie storage

Running the Application

Start Backend

cd backend/app
uvicorn app.main:app --reload   

Start Frontend

cd frontend
npm run dev

Endpoint Description

/search-jobs Endpoint

Request Payload

{
  "userKeywords" : ["laravel","backend"]
}

Response Payload

[
    {
        "job_title": "Full-Stack Engineer (Remote)",
        "company_name": "Infisical",
        "salary": "$70k – $150k • 0.1% – 0.5%",
        "company_type": "PromotedResult"
    },
]

/send-message Endpoint

Request Payload

{
  "userMessage" : "Hey there , How's it going ?"
}

Anti-Bot Bypass Techniques

The application implements advanced techniques to bypass Wellfound's security:

  • Manual cookie + header injection such as for example : (cf_clearance)
  • Apollo GraphQL signature
  • Brotli Compression/Decompression Algorithm

Legal Disclaimer

🚨 This tool is for educational purposes. Always respect Wellfound's terms of service and robots.txt. Ensure you have proper authorization before scraping.

Contributing

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/AmazingFeature)
  3. Commit your changes (git commit -m 'Add some AmazingFeature')
  4. Push to the branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

License

Distributed under the MIT License. See LICENSE for more information.

Contact

Subramanian - nsubbu2004@gmail.com

About

A scraping service that fetches jobs on wellfound by keywords and is able to remotely send messages from a server. Built by reverse engineering the /graphql endpoint of wellfound.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published