This application is designed to handle various PDF file operations such as splitting, renaming, and editing based on facility and sequence data. The application is built using React and leverages libraries like pdf-lib
and pdfjs-dist
for PDF manipulation, and JSZip
for creating ZIP archives of the processed files. The second part is the General Order Analyzer and that is responsbile for parsing pdf's and finding the user keyword and extracting data into the excel sheets properly.
-
Split PDF Files:
- Users can upload PDF files, and the application will split them into individual pages. Each page is saved as a separate PDF and zipped into a single file for download.
-
Rename and Zip PDF Files:
- Users can upload PDF files along with facility and sequence data. The application processes the files, matches the sequence numbers and facility IDs, and renames the PDF pages accordingly. Files with the same sequence number are merged and saved as
SEQxxxx.pdf
.
- Users can upload PDF files along with facility and sequence data. The application processes the files, matches the sequence numbers and facility IDs, and renames the PDF pages accordingly. Files with the same sequence number are merged and saved as
-
Add Annotations to PDF Pages:
- Users can add a work order and sequence number annotations to each page of the PDF files. The annotated PDFs are zipped and available for download.
-
Upload PDF Files:
- Use the drag-and-drop interface to upload PDF files for splitting or renaming.
-
Process Facility Data:
- Enter the sequence numbers and facility IDs in the specified format. The application parses this data and uses it to rename and annotate the PDF pages.
-
Download Processed Files:
- After processing, the application provides options to download the split, renamed, or annotated PDF files as a ZIP archive.
-
PDF File Upload:
- Users can upload multiple PDF files using a drag-and-drop interface for processing and analysis.
-
Keyword Search:
- Users can input keywords (comma-separated) to search for in the uploaded PDF files. The application identifies sentences containing these keywords and generates a report.
-
General Order Analysis:
- The application performs a specialized analysis for General Order 95 (GO 95) and General Order 128 (GO 128) rules. It extracts and matches sentences against these rules and includes the definitions in the report.
-
Excel Report Generation:
- After analysis, the application generates an Excel report with detailed information on the keyword occurrences, including the sentence context, page number, and any matched General Order rules and definitions.
-
Interactive Help:
- Users can access a help popup with a video tutorial explaining how to use the application.
-
Upload PDF Files:
- Use the drag-and-drop interface to upload PDF files for analysis. The application reads the text content of each page in the uploaded PDFs.
-
Enter Keywords:
- Enter the keywords you want to search for in the PDFs. Keywords should be comma-separated.
-
Perform Analysis:
- Click on the "Normal Analysis" button to perform a standard keyword search.
- Click on the "General Order Analysis" button to perform a specialized analysis for GO 95 and GO 128 rules.
-
Download Report:
- After the analysis is complete, the application generates an Excel file containing the results. The report includes details such as the keyword, file name, page number, occurrence count, and any matched General Order rules and definitions.
- Clone the repository: git clone https://github.com/your-username/autoCTR.git
- Navigate to the project directory: cd pdf-processing-app
- Install dependencies: npm install
- Start the application: npm start