Extract detailed data on Y Combinator companies and their founders. Ideal for market research, lead generation, and tracking startup trends.
With this tool, you can gather vital startup information including company profiles, founder details, and job postings from the Y Combinator directory.
Created by Bitbash, built to showcase our approach to Scraping and Automation!
If you are looking for Y Combinator Scraper you've just found your team — Let’s Chat. 👆👆
Y Combinator Scraper allows you to extract data about companies and founders from the Y Combinator directory: company name, description, batch, status, location, open jobs, website, founder name, founder LinkedIn, and more.
This tool is perfect for:
- Lead generation by identifying startups for investment or collaboration.
- Market research to analyze trends and popular niches in the startup ecosystem.
- Studying successful startup approaches and business models.
- Finding inspiration for new projects and innovations.
- Scrapes comprehensive company data including name, description, batch, status, location, and tags.
- Extracts detailed founder information such as name, LinkedIn, and Twitter.
- Gathers open job positions with title, description, location, salary, and experience requirements.
- Supports exporting data in CSV, JSON, Excel, or via API.
- Can target specific Y Combinator batches for more refined searches.
| Feature | Description |
|---|---|
| Company Info | Extracts company name, description, batch, location, website, LinkedIn, and more. |
| Founder Info | Scrapes detailed information on founders including LinkedIn and Twitter links. |
| Open Jobs | Collects job postings with details like job title, location, salary, and experience. |
| Customizable Search | Filter results by specific Y Combinator batch, status, and other criteria. |
| Export Options | Data can be exported in multiple formats like CSV, JSON, and Excel. |
| Field Name | Field Description |
|---|---|
| company_name | The name of the Y Combinator startup. |
| company_image | The image URL representing the company. |
| company_id | Unique company ID provided by Y Combinator. |
| batch | The Y Combinator batch in which the company participated. |
| status | Current status of the company (e.g., ACTIVE, ACQUIRED). |
| location | The location of the company. |
| year_founded | The year the company was founded. |
| team_size | Number of employees in the company. |
| primary_partner | Mentor or partner assigned to the company by Y Combinator. |
| website | URL of the company's official website. |
| company_linkedin | LinkedIn profile of the company. |
| company_x | Twitter profile of the company. |
| founders | A list of founders with name, LinkedIn, and Twitter links. |
| is_hiring | Boolean indicating whether the company is hiring. |
| open_jobs | A list of open job postings with job titles, descriptions, and requirements. |
[
{
"company_image": "https://bookface-images.s3.amazonaws.com/small_logos/fae29a98d132c4b435b336dbb5d6cf4a1aaf5de7.png",
"company_id": 30545,
"company_name": "StarSling",
"url": "https://www.ycombinator.com/companies/starsling",
"short_description": "Cursor for DevOps",
"long_description": "StarSling is building an agentic developer homepage that automates all the tasks that eat up a developer’s time after they’ve left their code editor...",
"batch": "Spring 2025",
"status": "ACTIVE",
"tags": ["ARTIFICIAL-INTELLIGENCE", "DEVELOPER-TOOLS", "B2B", "DEVOPS", "AI", "SAN FRANCISCO"],
"company_location": "San Francisco",
"year_founded": "2025",
"team_size": "2",
"primary_partner": "Tom Blomfield",
"website": "https://www.starsling.dev/",
"company_linkedin": "https://www.linkedin.com/company/starslingdev",
"company_x": "https://x.com/starslingdev",
"founders": [
{
"id": 7866,
"name": "Yonas Beshawred",
"linkedin": "https://www.linkedin.com/in/yonas-beshawred/",
"x": "https://x.com/yonasbe"
},
{
"id": 751609,
"name": "Daniel Worku",
"linkedin": "https://www.linkedin.com/in/worku",
"x": "https://x.com/dbworku"
}
],
"is_hiring": true,
"number_of_open_jobs": 1,
"open_jobs": [
{
"id": 77003,
"title": "Founding Software Engineer (Full-Stack)",
"description_url": "https://www.ycombinator.com/companies/starsling/jobs/ZvHKf88-founding-software-engineer-full-stack",
"description": "We’re looking for a Founding Software Engineer (Full-Stack) to join our team in San Francisco, CA...",
"location": "San Francisco, CA, US",
"salary": "$150K - $190K",
"years_experience": "3+ years"
}
]
}
]
y-combinator-scraper/
├── src/
│ ├── runner.py
│ ├── extractors/
│ │ ├── yc_parser.py
│ │ └── utils.py
│ ├── outputs/
│ │ └── exporters.py
│ └── config/
│ └── settings.example.json
├── data/
│ ├── inputs.sample.txt
│ └── sample.json
├── requirements.txt
└── README.md
- Venture Capitalists use it to gather data on promising startups for investment decisions, so they can evaluate potential companies to back.
- Market Researchers use it to track trends in the startup ecosystem, so they can gain insights into emerging technologies and sectors.
- Recruiters use it to find job openings in Y Combinator-backed companies, so they can connect top talent with high-growth startups.
- Entrepreneurs use it to study successful business models from Y Combinator-backed companies, so they can find inspiration for their own ventures.
Q1: How do I run this scraper?
A1: Simply provide the Y Combinator search URL and click "Start." The scraper will extract the data, which you can then export in your desired format (CSV, JSON, Excel, etc.).
Q2: Can I filter results by batch?
A2: Yes, you can specify the batch number in the search URL to filter companies by their participation in specific Y Combinator batches.
Q3: What formats can I export the data in?
A3: You can export the data in multiple formats such as CSV, JSON, XML, Excel, or HTML.
Primary Metric: Average scraping speed of 1,000 records per 5 minutes.
Reliability Metric: 98% success rate on scraping data from Y Combinator's directory.
Efficiency Metric: 95% accuracy in data extraction.
Quality Metric: Data completeness of 99%, with all relevant company, founder, and job data captured.
