Web-Scrapping-Java

⚠️ Problem

You have a HTML document that you want to extract data from. You know generally the structure of the HTML document.

♻️ Java HTML Parser

jsoup is a Java library for working with real-world HTML. It provides a very convenient API for fetching URLs and extracting and manipulating data, using the best of HTML5 DOM methods and CSS selectors.jsoup can parse HTML files, input streams, URLs, or even strings. It eases data extraction from HTML by offering Document Object Model (DOM) traversal methods and CSS and jQuery-like selectors. jsoup can manipulate the content: the HTML element itself, its attributes, or its text.

Visit https://jsoup.org/ for more details

🔰 Getting Started

✅ Prerequisites

Java 8
Maven 3.5
Git
An IDE or Editor of your choice

💻 Running the Application

Clone the repository

$ git clone https://github.com/Arham-12336/Web-Scrapping-Java-.git

Check into the cloned repository

$ cd main.xml

Install the dependencies and package the application

$ mvn package

Run the web scraper

Run the xml file on the IDE

🤝 Contribution

Please feel free to raise issues using this and I'll get back to you.

You can also fork the repository, make changes and submit a Pull Request.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.idea		.idea
ScrapFinal-master		ScrapFinal-master
src/com/company		src/com/company
Java web Scrapping.iml		Java web Scrapping.iml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web-Scrapping-Java

⚠️ Problem

♻️ Java HTML Parser

🔰 Getting Started

✅ Prerequisites

💻 Running the Application

🤝 Contribution

About

Releases

Packages

Languages

Arham-12336/Web-Scrapping-Java

Folders and files

Latest commit

History

Repository files navigation

Web-Scrapping-Java

⚠️ Problem

♻️ Java HTML Parser

🔰 Getting Started

✅ Prerequisites

💻 Running the Application

🤝 Contribution

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages