Pump Fun Crypto Coin Scraper collects rich, real-time data on Solana tokens listed on Pump.fun, giving you a full view of each coin’s market behavior, social footprint, and metadata in one place. By automating discovery and monitoring, it saves hours of manual research while keeping you close to emerging trends. This crypto coin scraper is ideal for analysts, traders, and builders who need structured token intelligence at scale.
Created by Bitbash, built to showcase our approach to Scraping and Automation!
If you are looking for Pump Fun Crypto Coin Scraper you've just found your team — Let’s Chat. 👆👆
Pump Fun Crypto Coin Scraper retrieves detailed information about Solana-based tokens listed on Pump.fun and returns it in a structured dataset. It simplifies exploring the constantly changing token landscape by surfacing key metrics like market caps, creation times, social channels, and activity timestamps.
This project helps:
- Traders who want a reliable data source for strategy development and backtesting.
- Market researchers tracking token launches, growth patterns, and community signals.
- Tool builders feeding live token data into dashboards, bots, and analytics pipelines.
- Tracks up to 1050 tokens per run, sorted by creation time, last trade, last reply, or market cap.
- Supports flexible sorting direction (ascending or descending) to match your analysis flow.
- Filters by NSFW status and keyword search terms to narrow down specific token segments.
- Returns comprehensive token profiles including metadata, social links, and market metrics.
- Designed for repeatable runs so you can monitor market evolution and community engagement over time.
| Feature | Description |
|---|---|
| High-volume token scraping | Collects data for up to 1050 Solana tokens in a single execution, covering a wide slice of the Pump.fun market. |
| Flexible sorting options | Sort tokens by last trade time, creation timestamp, last reply, or market cap to match your research focus. |
| NSFW filtering | Include or exclude NSFW tokens to keep your datasets aligned with your compliance and research constraints. |
| Search term filtering | Use keyword-based searchTerms to focus on specific narratives or themes (e.g., “doge”, “pepe”). |
| Rich market metrics | Captures market cap, virtual reserves, trading timestamps, and live status for deeper quant and trend analysis. |
| Social & metadata enrichment | Pulls website, Twitter/X, Telegram, and image/metadata URIs to enrich CRM systems or dashboards. |
| Creator profiling | Stores creator wallet, username, and profile image to help map out influential wallets and communities. |
| Ready for analytics | Output is structured for direct use in data warehouses, BI tools, or algorithmic trading engines. |
| Field Name | Field Description |
|---|---|
| mint | Unique token mint address on Solana that identifies the asset on-chain. |
| name | Human-readable display name of the token. |
| symbol | Trading symbol or ticker used to represent the token. |
| description | Free-text description of the token or project. |
| total_supply | Total supply of the token as defined by the project. |
| show_name | Boolean flag indicating whether the name should be publicly displayed. |
| market_cap | Market capitalization in SOL based on current pool data. |
| usd_market_cap | Market capitalization converted into USD. |
| virtual_sol_reserves | Virtual SOL reserves in the associated liquidity pool. |
| virtual_token_reserves | Virtual token reserves in the same pool. |
| market_id | Unique identifier for the market or pool. |
| inverted | Boolean flag showing if the market pair is inverted. |
| is_currently_live | Indicates whether the market is currently active and live. |
| raydium_pool | Address of the Raydium pool associated with the token, if available. |
| bonding_curve | Address of the token’s bonding curve contract. |
| associated_bonding_curve | Address of the associated bonding curve used for pricing/liquidity. |
| image_uri | URI pointing to the token’s logo or image asset. |
| metadata_uri | URI for extended token metadata (e.g., JSON on IPFS). |
| website | Project’s main website URL. |
| Link to the token or project’s Twitter/X profile. | |
| telegram | Link to the Telegram channel or group. |
| created_timestamp | Unix-based timestamp representing when the token was created. |
| last_trade_timestamp | Timestamp of the most recent trade event for the token. |
| king_of_the_hill_timestamp | Timestamp associated with “king of the hill” or featured state. |
| last_reply | Timestamp of the latest reply or social interaction associated with the token. |
| reply_count | Number of replies, comments, or social messages linked to the token. |
| creator | Wallet address of the token creator. |
| username | Creator’s username if available. |
| profile_image | URL to the creator’s profile image. |
| complete | Boolean indicating if the token configuration/setup is fully completed. |
| hidden | Boolean or null indicating whether the token is hidden from default views. |
| nsfw | Boolean flag marking whether the token is classified as NSFW. |
The scraper returns an array of token objects; each entry describes one Pump.fun token with its market, social, and metadata fields.
Example:
[
{
"mint": "AxGAbdFtdbj2oNXa4dKqFvwHzgFtW9mFHWmd7vQfpump",
"name": "TOP HAT",
"symbol": "HAT",
"description": "Solana's first no-code AI agent launchpad.",
"image_uri": "https://ipfs.io/ipfs/QmPPhPbe9t5AKkBgBP4uon7aBdwLzukdS5PJoujC4YjJBj",
"metadata_uri": "https://ipfs.io/ipfs/QmRGP81WsfLPtw4VaExcsJr2sE2P1qftuiszQx5aHBwce6",
"twitter": "https://x.com/TopHat_One",
"telegram": "https://t.me/+DLUgJl8MCDBjNTFk",
"bonding_curve": "EopMZSxBTEad6Y43zvfd4LntmULkqp5hFYViC5cFn4oT",
"associated_bonding_curve": "9v7VXvR2rytENnBrYbzAfTpwACNxTdGubD5Wdi7Y8xM8",
"creator": "BXjEFPEDwgNrAdLRHb9kqLK4DrLtd3uQPcmiQpKPZTd5",
"created_timestamp": 1730374103343,
"raydium_pool": "Hoz3sC78FMWSziyyvJuv5kTEamBMSe2j1LUGAMP9SbYU",
"complete": true,
"virtual_sol_reserves": 115005359321,
"virtual_token_reserves": 279900000000000,
"hidden": null,
"total_supply": 1000000000000000,
"website": "tophat.one",
"show_name": true,
"last_trade_timestamp": 1730375245000,
"king_of_the_hill_timestamp": 1730374135000,
"market_cap": 121100,
"reply_count": 429,
"last_reply": 1734886752430,
"nsfw": false,
"market_id": "HTJT8npsXVVAYH9QTZ3cJAFpZ33JzaR9QBBkvtxR9NDF",
"inverted": true,
"is_currently_live": false,
"username": null,
"profile_image": null,
"usd_market_cap": 22678397
}
]
Below is a representative folder structure for a complete implementation. Adjust file names and languages as needed for your tech stack.
Pump Fun Crypto Coin Scraper/
├── src/
│ ├── index.js
│ ├── config/
│ │ └── defaultConfig.json
│ ├── lib/
│ │ ├── client/
│ │ │ └── pumpfunClient.js
│ │ ├── parsers/
│ │ │ ├── tokenParser.js
│ │ │ └── marketMetricsParser.js
│ │ └── utils/
│ │ ├── log.js
│ │ ├── rateLimiter.js
│ │ └── validation.js
│ ├── services/
│ │ ├── tokenFetcher.js
│ │ └── datasetWriter.js
│ └── runners/
│ └── mainRunner.js
├── data/
│ ├── example-input.json
│ └── sample-output.json
├── tests/
│ ├── tokenParser.test.js
│ ├── marketMetricsParser.test.js
│ └── client.test.js
├── .env.example
├── package.json
├── README.md
└── LICENSE
- Quant traders use it to feed structured market data into their trading models, so they can backtest and execute strategies on fast-moving Solana tokens.
- Crypto research teams use it to monitor new token launches and growth signals, so they can spot narratives and high-potential projects earlier.
- Portfolio managers use it to track market caps and social activity across many tokens, so they can rebalance portfolios based on real-time fundamentals and sentiment.
- Analytics platform builders use it to populate dashboards with detailed token metrics and social links, so they can deliver richer insights to their end-users.
- Community managers use it to follow reply counts and engagement metrics, so they can judge community traction and focus outreach on promising ecosystems.
Q: What do I need before running this scraper? A: You need a runtime capable of making HTTP requests and handling JSON (for example, a Node.js environment). Configure your environment variables or config file to point at the relevant Pump.fun endpoints and customize inputs like sort, order, includeNsfw, searchTerms, and maxItems.
Q: How many tokens can I collect in a single run?
A: The scraper is designed to collect up to 1050 tokens per execution. You can lower this using the maxItems input if you want smaller, more frequent runs for streaming-style analytics.
Q: Can I focus only on specific types of tokens or memes?
A: Yes. Use the searchTerms input to pass an array of keywords (for example, “doge”, “pepe”, “cat”) and the scraper will return only tokens whose metadata matches those terms.
Q: Does it handle NSFW tokens?
A: You control this behavior using the includeNsfw flag. When set to false, NSFW tokens are excluded from results; when true, they are included alongside other tokens.
Primary Metric: In typical setups, the scraper can retrieve and process between 800 and 1050 token entries in under a few minutes, depending on network latency and concurrency limits.
Reliability Metric: With sensible retry and backoff logic, success rates above 95% per run are achievable, even across multiple consecutive executions.
Efficiency Metric: By batching requests and reusing connections, the scraper maintains a high throughput while keeping CPU and memory usage modest—suitable for running on standard cloud instances or local machines.
Quality Metric: The combination of on-chain identifiers, liquidity metrics, and social links yields highly complete token profiles, with key fields such as mint, market_cap, usd_market_cap, and website present for the vast majority of active tokens.
