Skip to content

rawford-ilderman/pump-fun-crypto-coin-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 

Repository files navigation

Pump Fun Crypto Coin Scraper

Pump Fun Crypto Coin Scraper collects rich, real-time data on Solana tokens listed on Pump.fun, giving you a full view of each coin’s market behavior, social footprint, and metadata in one place. By automating discovery and monitoring, it saves hours of manual research while keeping you close to emerging trends. This crypto coin scraper is ideal for analysts, traders, and builders who need structured token intelligence at scale.

Bitbash Banner

Telegram   WhatsApp   Gmail   Website

Created by Bitbash, built to showcase our approach to Scraping and Automation!
If you are looking for Pump Fun Crypto Coin Scraper you've just found your team — Let’s Chat. 👆👆

Introduction

Pump Fun Crypto Coin Scraper retrieves detailed information about Solana-based tokens listed on Pump.fun and returns it in a structured dataset. It simplifies exploring the constantly changing token landscape by surfacing key metrics like market caps, creation times, social channels, and activity timestamps.

This project helps:

  • Traders who want a reliable data source for strategy development and backtesting.
  • Market researchers tracking token launches, growth patterns, and community signals.
  • Tool builders feeding live token data into dashboards, bots, and analytics pipelines.

Token Intelligence for Solana Builders

  • Tracks up to 1050 tokens per run, sorted by creation time, last trade, last reply, or market cap.
  • Supports flexible sorting direction (ascending or descending) to match your analysis flow.
  • Filters by NSFW status and keyword search terms to narrow down specific token segments.
  • Returns comprehensive token profiles including metadata, social links, and market metrics.
  • Designed for repeatable runs so you can monitor market evolution and community engagement over time.

Features

Feature Description
High-volume token scraping Collects data for up to 1050 Solana tokens in a single execution, covering a wide slice of the Pump.fun market.
Flexible sorting options Sort tokens by last trade time, creation timestamp, last reply, or market cap to match your research focus.
NSFW filtering Include or exclude NSFW tokens to keep your datasets aligned with your compliance and research constraints.
Search term filtering Use keyword-based searchTerms to focus on specific narratives or themes (e.g., “doge”, “pepe”).
Rich market metrics Captures market cap, virtual reserves, trading timestamps, and live status for deeper quant and trend analysis.
Social & metadata enrichment Pulls website, Twitter/X, Telegram, and image/metadata URIs to enrich CRM systems or dashboards.
Creator profiling Stores creator wallet, username, and profile image to help map out influential wallets and communities.
Ready for analytics Output is structured for direct use in data warehouses, BI tools, or algorithmic trading engines.

What Data This Scraper Extracts

Field Name Field Description
mint Unique token mint address on Solana that identifies the asset on-chain.
name Human-readable display name of the token.
symbol Trading symbol or ticker used to represent the token.
description Free-text description of the token or project.
total_supply Total supply of the token as defined by the project.
show_name Boolean flag indicating whether the name should be publicly displayed.
market_cap Market capitalization in SOL based on current pool data.
usd_market_cap Market capitalization converted into USD.
virtual_sol_reserves Virtual SOL reserves in the associated liquidity pool.
virtual_token_reserves Virtual token reserves in the same pool.
market_id Unique identifier for the market or pool.
inverted Boolean flag showing if the market pair is inverted.
is_currently_live Indicates whether the market is currently active and live.
raydium_pool Address of the Raydium pool associated with the token, if available.
bonding_curve Address of the token’s bonding curve contract.
associated_bonding_curve Address of the associated bonding curve used for pricing/liquidity.
image_uri URI pointing to the token’s logo or image asset.
metadata_uri URI for extended token metadata (e.g., JSON on IPFS).
website Project’s main website URL.
twitter Link to the token or project’s Twitter/X profile.
telegram Link to the Telegram channel or group.
created_timestamp Unix-based timestamp representing when the token was created.
last_trade_timestamp Timestamp of the most recent trade event for the token.
king_of_the_hill_timestamp Timestamp associated with “king of the hill” or featured state.
last_reply Timestamp of the latest reply or social interaction associated with the token.
reply_count Number of replies, comments, or social messages linked to the token.
creator Wallet address of the token creator.
username Creator’s username if available.
profile_image URL to the creator’s profile image.
complete Boolean indicating if the token configuration/setup is fully completed.
hidden Boolean or null indicating whether the token is hidden from default views.
nsfw Boolean flag marking whether the token is classified as NSFW.

Example Output

The scraper returns an array of token objects; each entry describes one Pump.fun token with its market, social, and metadata fields.

Example:

[
  {
    "mint": "AxGAbdFtdbj2oNXa4dKqFvwHzgFtW9mFHWmd7vQfpump",
    "name": "TOP HAT",
    "symbol": "HAT",
    "description": "Solana's first no-code AI agent launchpad.",
    "image_uri": "https://ipfs.io/ipfs/QmPPhPbe9t5AKkBgBP4uon7aBdwLzukdS5PJoujC4YjJBj",
    "metadata_uri": "https://ipfs.io/ipfs/QmRGP81WsfLPtw4VaExcsJr2sE2P1qftuiszQx5aHBwce6",
    "twitter": "https://x.com/TopHat_One",
    "telegram": "https://t.me/+DLUgJl8MCDBjNTFk",
    "bonding_curve": "EopMZSxBTEad6Y43zvfd4LntmULkqp5hFYViC5cFn4oT",
    "associated_bonding_curve": "9v7VXvR2rytENnBrYbzAfTpwACNxTdGubD5Wdi7Y8xM8",
    "creator": "BXjEFPEDwgNrAdLRHb9kqLK4DrLtd3uQPcmiQpKPZTd5",
    "created_timestamp": 1730374103343,
    "raydium_pool": "Hoz3sC78FMWSziyyvJuv5kTEamBMSe2j1LUGAMP9SbYU",
    "complete": true,
    "virtual_sol_reserves": 115005359321,
    "virtual_token_reserves": 279900000000000,
    "hidden": null,
    "total_supply": 1000000000000000,
    "website": "tophat.one",
    "show_name": true,
    "last_trade_timestamp": 1730375245000,
    "king_of_the_hill_timestamp": 1730374135000,
    "market_cap": 121100,
    "reply_count": 429,
    "last_reply": 1734886752430,
    "nsfw": false,
    "market_id": "HTJT8npsXVVAYH9QTZ3cJAFpZ33JzaR9QBBkvtxR9NDF",
    "inverted": true,
    "is_currently_live": false,
    "username": null,
    "profile_image": null,
    "usd_market_cap": 22678397
  }
]

Directory Structure Tree

Below is a representative folder structure for a complete implementation. Adjust file names and languages as needed for your tech stack.

Pump Fun Crypto Coin Scraper/
├── src/
│   ├── index.js
│   ├── config/
│   │   └── defaultConfig.json
│   ├── lib/
│   │   ├── client/
│   │   │   └── pumpfunClient.js
│   │   ├── parsers/
│   │   │   ├── tokenParser.js
│   │   │   └── marketMetricsParser.js
│   │   └── utils/
│   │       ├── log.js
│   │       ├── rateLimiter.js
│   │       └── validation.js
│   ├── services/
│   │   ├── tokenFetcher.js
│   │   └── datasetWriter.js
│   └── runners/
│       └── mainRunner.js
├── data/
│   ├── example-input.json
│   └── sample-output.json
├── tests/
│   ├── tokenParser.test.js
│   ├── marketMetricsParser.test.js
│   └── client.test.js
├── .env.example
├── package.json
├── README.md
└── LICENSE

Use Cases

  • Quant traders use it to feed structured market data into their trading models, so they can backtest and execute strategies on fast-moving Solana tokens.
  • Crypto research teams use it to monitor new token launches and growth signals, so they can spot narratives and high-potential projects earlier.
  • Portfolio managers use it to track market caps and social activity across many tokens, so they can rebalance portfolios based on real-time fundamentals and sentiment.
  • Analytics platform builders use it to populate dashboards with detailed token metrics and social links, so they can deliver richer insights to their end-users.
  • Community managers use it to follow reply counts and engagement metrics, so they can judge community traction and focus outreach on promising ecosystems.

FAQs

Q: What do I need before running this scraper? A: You need a runtime capable of making HTTP requests and handling JSON (for example, a Node.js environment). Configure your environment variables or config file to point at the relevant Pump.fun endpoints and customize inputs like sort, order, includeNsfw, searchTerms, and maxItems.

Q: How many tokens can I collect in a single run? A: The scraper is designed to collect up to 1050 tokens per execution. You can lower this using the maxItems input if you want smaller, more frequent runs for streaming-style analytics.

Q: Can I focus only on specific types of tokens or memes? A: Yes. Use the searchTerms input to pass an array of keywords (for example, “doge”, “pepe”, “cat”) and the scraper will return only tokens whose metadata matches those terms.

Q: Does it handle NSFW tokens? A: You control this behavior using the includeNsfw flag. When set to false, NSFW tokens are excluded from results; when true, they are included alongside other tokens.


Performance Benchmarks and Results

Primary Metric: In typical setups, the scraper can retrieve and process between 800 and 1050 token entries in under a few minutes, depending on network latency and concurrency limits.

Reliability Metric: With sensible retry and backoff logic, success rates above 95% per run are achievable, even across multiple consecutive executions.

Efficiency Metric: By batching requests and reusing connections, the scraper maintains a high throughput while keeping CPU and memory usage modest—suitable for running on standard cloud instances or local machines.

Quality Metric: The combination of on-chain identifiers, liquidity metrics, and social links yields highly complete token profiles, with key fields such as mint, market_cap, usd_market_cap, and website present for the vast majority of active tokens.

Book a Call Watch on YouTube

Review 1

"Bitbash is a top-tier automation partner, innovative, reliable, and dedicated to delivering real results every time."

Nathan Pennington
Marketer
★★★★★

Review 2

"Bitbash delivers outstanding quality, speed, and professionalism, truly a team you can rely on."

Eliza
SEO Affiliate Expert
★★★★★

Review 3

"Exceptional results, clear communication, and flawless delivery.
Bitbash nailed it."

Syed
Digital Strategist
★★★★★

Releases

No releases published

Packages

 
 
 

Contributors