This program allows you to modify and edit multiple HTML files within a folder. It uses the HtmlAgilityPack library to parse and manipulate the HTML content.
The program is specifically designed for the modification of excerpts from books published by the NOMIKI BIBLIOTHIKI Group (Νομική Βιβλιοθήκη), helping format and organize the documents more efficiently.
The program performs various modifications, such as:
- Modifying elements with page numbers (
PAGENUMBERclass). - Modifying elements with side numbers (
SIDENUMBERclass). - Modifying titles and headings (e.g.,
KEFALAIO-TITLOS,PRAKTIKOclasses). - Removing
<a>tags, footnotes, and<br>tags. - Replacing
<li>tags with<p>tags for better layout. - Modifying elements with the
dir='rtl'attribute toltr.
- Processes and modifies multiple HTML files in a specified folder.
- CSS styling modifications for better presentation (center-aligned text, bold, etc.).
- Removes unwanted tags and content (e.g.,
<a>, footnotes). - Replaces list tags (
<li>) with paragraph tags (<p>).
To run this program, you will need:
- .NET Framework or .NET Core
- HtmlAgilityPack library for parsing HTML
- Download or Clone this repository.
- Ensure that the HtmlAgilityPack library is installed via NuGet.
- Run the program.
- The program will prompt you to enter the path of the folder containing the HTML files.
- Once the folder is selected, it will iterate through all
.htmlfiles and apply the defined modifications.
This project is licensed under the MIT License - see the LICENSE file for details.
Feel free to open issues or pull requests if you encounter any bugs or want to contribute improvements.