GitHub - brightdata/bright-data-sdk-js: Bright Data's JS SDK, use it to call bright data's scrape and search tools. bypass any Bot-detection or Captcha and extract data from the web.

Bright Data JavaScript SDK providing easy and scalable methods for scraping, web search, and more.

Installation

To install the package, open your terminal:

npm install @brightdata/sdk

Quick start

1. Signup and get your API key

2. Initialize the Client

Create a file named pizzaSearch.mjs with the follwing content:

import { bdclient } from '@brightdata/sdk';

const client = new bdclient({
    apiKey: '[your_api_key_here]', // can also be defined as BRIGHTDATA_API_KEY env variable
});

3. Launch your first request

Add our search function:

import { bdclient } from '@brightdata/sdk';

const client = new bdclient({
    apiKey: '[your_api_key_here]', // can also be defined as BRIGHTDATA_API_KEY env variable
});
const result = await client.search('pizza restaurants');
console.log(result);

And run:

node pizzaSearch.mjs

Features

Web Scraping: Scraping every website using unti bot-detection capabilities and proxy support
Search Engine Results: Support Searches on Google, Bing, and Yandex by query (includinig batch searches)
Parallel Processing: Concurrent processing for multiple URLs or queries
Robust Error Handling: Comprehensive error handling with retry logic
Zone Management: Automatic zone creation and management
Multiple Output Formats: HTML, JSON, and Markdown
Dual build: Both ESM and CommonJS supported
TypeScript: Fully typed for different combinations of input and output data

Usage

Scrape websites

// single URL - returns markdown string by default
const result = await client.scrape('https://example.com');
console.log(result); // output: web page html content

// multiple URLs (parallel processing)
const urls = [
    'https://example1.com',
    'https://example2.com',
    'https://example3.com',
];
const results = await client.scrape(urls);
console.log(results); // returns array of html strings

// different data formats available
const htmlResult = await client.scrape('https://example.com', {
    dataFormat: 'html', // returns raw HTML (default: 'html')
});

const screenshotResult = await client.scrape('https://example.com', {
    dataFormat: 'screenshot', // returns base64 screenshot image
});

// different response formats
const jsonResult = await client.scrape('https://example.com', {
    format: 'json', // returns parsed JSON object (default: 'raw' string)
});

// combined custom options
const result = await client.scrape('https://example.com', {
    format: 'raw', // 'raw' (default) or 'json'
    dataFormat: 'markdown', // 'markdown' (default), 'raw', 'screenshot', etc.
    country: 'gb', // two-letter country code
    method: 'GET', // HTTP method (default: 'GET')
});

Search Engine Results

// single search query
const result = await client.search('pizza restaurants');
console.log(result);

// multiple queries (parallel processing)
const queries = ['pizza', 'restaurants', 'delivery'];
const results = await client.search(queries);
console.log(results);

// different search engines
const result = await client.search('pizza', {
    searchEngine: 'google', // can also be 'yandex' or 'bing'
});
console.log(result);

// custom options
const results = await client.search(['pizza', 'sushi'], {
    country: 'gb',
    format: 'raw',
});
console.log(results);

Saving Results

// download scraped content
const data = await client.scrape('https://example.com');
const filePath = await client.saveResults(data, {
    filename: 'results.json',
    format: 'json',
});
console.log(`Content saved to: ${filePath}`);

Trigger dataset snapshot collection

const res = await client.datasets.linkedin.discoverCompanyPosts([
    { url: 'https://www.linkedin.com/company/bright-data' },
]);

// it will poll if snapshot is ready, and once it is - download it
const filePath = await client.datasets.snapshot.download(res.snapshot_id, {
    filename: './brd_posts.jsonl',
    format: 'jsonl',
});
console.log(`Content saved to: ${filePath}`);

Configuration

API Key

you can get your API key from here

Environment Variables

Set the following env variables (also configurable in client constructor)

BRIGHTDATA_API_KEY=your_bright_data_api_key
BRIGHTDATA_WEB_UNLOCKER_ZONE=your_web_unlocker_zone  # Optional, if you have a specific zone
BRIGHTDATA_SERP_ZONE=your_serp_zone                  # Optional, if you have a specific zone

Manage Zones

const zones = await client.listZones();
console.log(`Found ${zones.length} zones`);

Constants

Constant	Default	Description
`DEFAULT_CONCURRENCY`	`10`	Max parallel tasks
`DEFAULT_TIMEOUT`	`30000`	Request timeout (milliseconds)
`MAX_RETRIES`	`3`	Retry attempts on failure
`RETRY_BACKOFF_FACTOR`	`1.5`	Exponential backoff multiplier

API Reference

bdclient Class

const client = new bdclient({
    apiKey: 'string', // Your API key
    autoCreateZones: true, // Auto-create zones if they don't exist
    webUnlockerZone: 'string', // Custom web unlocker zone name
    serpZone: 'string', // Custom SERP zone name
    logLevel: 'INFO', // Log level
    structuredLogging: true, // Use structured JSON logging
    verbose: false, // Enable verbose logging
});

Key Methods

scrape(url, options)

Scrapes a single URL or array of URLs using the Web Unlocker.

Parameters:

Name	Type	Description	Default
`url`	`string` \| `string[]`	Single URL string or array of URLs	—
`options.zone`	`string`	Zone identifier (auto-configured if `null`)	—
`options.format`	`"json"` \| `"raw"`	Response format	`"raw"`
`options.method`	`string`	HTTP method	`"GET"`
`options.country`	`string`	Two-letter country code	`""`
`options.dataFormat`	`"markdown"` \| `"screenshot"` \| `"html"`	Returned content format	`"html"`
`options.concurrency`	`number`	Max parallel workers	`10`
`options.timeout`	`number` (ms)	Request timeout	`30000`

search(query, options)

Searches using the SERP API

Parameters:

Name	Type	Description	Default
`query`	`string` \| `string[]`	Search query string or array of queries	—
`options.searchEngine`	`"google"` \| `"bing"` \| `"yandex"`	Search engine	`"google"`
`options.zone`	`string`	Zone identifier (auto-configured if `null`)	—
`options.format`	`"json"` \| `"raw"`	Response format	`"raw"`
`options.method`	`string`	HTTP method	`"GET"`
`options.country`	`string`	Two-letter country code	`""`
`options.dataFormat`	`"markdown"` \| `"screenshot"` \| `"html"`	Returned content format	`"html"`
`options.concurrency`	`number`	Max parallel workers	`10`
`options.timeout`	`number` (ms)	Request timeout	`30000`

saveResults(content, options)

Save content to local file.

Parameters:

Name	Type	Description	Default
`content`	`any`	Content to save	—
`options.filename`	`string`	Output filename (auto-generated if `null`)	—
`options.format`	`string` (`"json"`, `"csv"`, `"txt"`)	File format	`"json"`

listZones()

List all active zones in your Bright Data account.

Returns: Promise<Array>

Error Handling

The SDK includes built-in input validation and retry logic:

try {
    const result = await client.scrape('https://example.com');
    console.log(result);
} catch (error) {
    if (error.name === 'ValidationError') {
        console.error('Invalid input:', error.message);
    } else {
        console.error('API error:', error.message);
    }
}

Development

For development installation:

git clone https://github.com/brightdata/bright-data-sdk-js.git
cd bright-data-sdk-js
npm install
npm run build:dev

Commits conventions and releases

We do use Semantic Release for automated releases and repo housekeeping. To allow Semantic Release do its job we follow some light commit message conventions:

use fix: prefix if commit fixes an issue (triggers a PATCH release 0.5.0 => 0.5.1)
use feat: prefix if commit is part of a new feature (triggers a MINOR release 0.5.0 => 0.6.0)
use docs: prefix if commit is updating a documentation (like README)
use chore: or no prefix for general purpose changes
use BREAKING CHANGE: in the commit footer if you need to release a new MAJOR version (0.5.0 => 1.0.0)

Examples: fix: correct floating numbers bug, docs: fixed typo

Support

For any issues, contact Bright Data support, or open an issue in this repository.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
.github/workflows		.github/workflows
examples		examples
src		src
tests		tests
.gitignore		.gitignore
.npmignore		.npmignore
.prettierrc		.prettierrc
LICENSE		LICENSE
README.md		README.md
eslint.config.js		eslint.config.js
package-lock.json		package-lock.json
package.json		package.json
release.config.js		release.config.js
rollup.config.js		rollup.config.js
tsconfig.dts.json		tsconfig.dts.json
tsconfig.json		tsconfig.json

License

brightdata/bright-data-sdk-js

Folders and files

Latest commit

History

Repository files navigation

Bright Data JavaScript SDK providing easy and scalable methods for scraping, web search, and more.

Installation

Quick start

1. Signup and get your API key

2. Initialize the Client

3. Launch your first request

Features

Usage

Scrape websites

Search Engine Results

Saving Results

Trigger dataset snapshot collection

Configuration

API Key

Environment Variables

Manage Zones

Constants

API Reference

bdclient Class

Key Methods

scrape(url, options)

search(query, options)

saveResults(content, options)

listZones()

Error Handling

Development

Commits conventions and releases

Support

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Contributors 4

Uh oh!

Languages

Packages