Vulnerability Scanner V2.0 Development #133

ChaohuiLi0321 · 2025-09-07T12:41:19Z

Integrate Vulnerability Scanner V2.0 (Automated) into NutriHelp

Summary

This PR integrates the new plugin‑based Vulnerability Scanner V2.0 under Vulnerability_Tool_V2 and exposes a set of API endpoints for starting, monitoring, and retrieving scan results (JSON / HTML).
Local developer onboarding is now automated: a postinstall bootstrap prepares (or gracefully skips) the Python scanner environment.
CI workflows were updated to align with local behavior, and the scheduled security assessment now also runs a full V2 scan.

Key Changes

Scanner Core (Vulnerability_Tool_V2)

Plugin architecture (JWT configuration, missing protection, general security)
Progress emission + structured JSON output
Reports directory for persisted HTML / JSON outputs

Node Integration (scanner.js)

New endpoints under /api/scanner:

GET /api/scanner/test – simple availability check
GET /api/scanner/health – scanner presence & version
GET /api/scanner/plugins – enumerate available/enabled plugins
POST /api/scanner/scan – start an asynchronous scan (returns scan_id)
GET /api/scanner/scan/:scanId/status – live status & progress (0–100%, message)
GET /api/scanner/scan/:scanId/result – final JSON results (with severity summary & findings)
GET /api/scanner/scan/:scanId/report?format=html|json – generate/download HTML (lazy) or retrieve JSON
GET /api/scanner/scan/:scanId/raw – raw diagnostic / salvage output for debugging
POST /api/scanner/quick-scan – synchronous, fast scan (salvages JSON even on non‑zero exit)

Features:

Real‑time progress parsing via stdout sentinel lines (PROGRESS|pct|message)
Resilient JSON recovery (handles noisy stderr, emojis, partial writes)
Unified success messaging even on non‑zero exit when output is parseable
HTML report fallback generation in Node if Python renderer not used
Filename-safe scan IDs with timestamp + optional tag (quick-scan)

Automation & Scripts

bootstrap.js (postinstall) – installs Node deps (if needed), creates placeholder .env (if missing), prepares scanner venv, soft env validation
prepareScanner.js – idempotent Python venv creation + dependency hash check
ensureScannerReady.js – lightweight readiness & auto-repair
Postinstall hook: runs bootstrap in soft mode; strict mode via npm run setup

CI Updates

vulnerability-scan.yml: now uses node scripts/prepareScanner.js (aligned with local behavior), runs JSON + HTML V2 scan
security-assessment.yml: adds full V2 scan (JSON & HTML artifacts) alongside existing assessment logic
Legacy workflow (security.yml) left unchanged intentionally (backward-compatible / incremental adoption)

Documentation

Simplified README.md install flow to just:
```
npm install
npm start
```
Removed obsolete manual venv setup doc (README_SETUP.md)
Added notes about automated scanner bootstrap & graceful Python absence

Resilience / Quality

Cross-platform Python detection (override > local project .venv > venv > python3 > python > py)
UTF‑8 enforced for scanner subprocess to prevent Windows code page breakage
Progress message stability (final “Scan completed successfully” not overwritten)
Dependency change detection via .deps_hash
Graceful degradation if Python missing (API still runs; scanner endpoints return meaningful errors)

Why

Reduce onboarding friction (no manual pip / venv steps)
Provide consistent security scanning locally and in CI
Enable operational & scheduled scanning (security posture visibility)
Harden scanner integration against partial failures / platform inconsistencies

How to Test (Current Flow)

Local:

git clone <repo-url>
cd Nutrihelp-api
npm install         # auto bootstrap (postinstall)
npm start

Then (examples):

curl http://localhost:80/api/scanner/test
curl http://localhost:80/api/scanner/plugins
curl -X POST http://localhost:80/api/scanner/scan -H \"Content-Type: application/json\" -d '{\"target_path\":\"./\"}'
# Use returned scan_id:
curl http://localhost:80/api/scanner/scan/<scan_id>/status
curl http://localhost:80/api/scanner/scan/<scan_id>/result
curl http://localhost:80/api/scanner/scan/<scan_id>/report?format=html
curl -X POST http://localhost:80/api/scanner/quick-scan -H \"Content-Type: application/json\" -d '{\"target_path\":\"./\"}'

CI (GitHub Actions):

Run “Manual Vulnerability & Test Scan” → verify artifacts: vulnerability_report.json / .html
Run “Monthly Security Assessment” → verify security-report-v2.json / .html in artifacts

Risk & Mitigations

Area	Risk	Mitigation
Python env variability	Missing interpreter / mismatched versions	Graceful skip + explicit warnings
Partial scanner output	Non-zero exit / encoding issues	Salvage parser + UTF-8 enforcement
Long-running scans	Potential timeout in CI	Separate manual workflow + incremental adoption
Env config drift	Placeholder .env misuse	Soft validation now; can extend to strict in full mode
Report persistence	Path conflicts	Segregated `Vulnerability_Tool_V2/reports` + fallback logic

Reviewer Focus

scanner.js (async flow, progress parsing, report generation fallback)
scanner_engine.py (progress emission, plugin loop)
prepareScanner.js / bootstrap.js
Updated workflows (vulnerability-scan.yml, security-assessment.yml)
Ensure no real secrets introduced; only placeholders present

Follow-Up Suggestions (Not part of this PR)

Add npm run doctor for single-shot environment diagnostics
Cache Python venv in CI (actions/cache) keyed by requirements hash
Incremental diff-based scan workflow using changed files set
Strict env schema validation (fail build when placeholders remain)
PR comment bot summarizing severity stats on scan completion

Request Reviewers: @madhavi2809

…and implement the plugin base class system.

…uration files.

…dular security scanner main program.

…ture is built correctly.

…ctionality of the security scanning tool named Vulnerability_Tool_V2.

…ing.py

…jwt_config.py

…ttp://localhost:8001/scanner/docs. Run the following command: python -m uvicorn api.scanner_api:app --host 0.0.0.0 --port 8001 --reload

…king properly.

…ent with the report generated by scanning in Swagger UI.

…tput security_report.html --verbose" to generate a debugged report

…te reports in the updated debug format (use the command "python -m uvicorn api.scanner_api:app --host 0.0.0.0 --port 8001 --reload" to start the SwaggerUI integration of NutriHelp Security Scanner V2.0)

…lhost/api-docs, and test the GET and POST methods in the API interface separately.

…grate them into the API interface scanning function in Swagger UI.

- Introduced a comprehensive set of security rules in `rules_v1.yaml` to detect vulnerabilities such as SQL injection, XSS, hardcoded credentials, and insecure file handling across JavaScript, Python, and text files. - Implemented tests for the new rules in `test_general_security_legacy_rules.py`, ensuring detection of hardcoded API keys and permissive CORS configurations. - Enhanced the testing framework with new test cases for excluding paths in `test_exclude_paths.py` and verifying JSON output fields in `test_output_json_fields.py`. - Added a script `rename_reports_security_to_vulnerability.py` for batch renaming legacy security report files to a new naming convention. - Improved the debug rules toggle functionality and HTML report generation in `test_debug_rules_and_html.py`.

…ment

…lp-api into Vulnerability_Scanner_V2.0_Development

… V2.0

…nner_V2.0_Development

…cumentation generation

…canner.js to use new report paths

…nner_V2.0_Development

- Introduced a GitHub Actions workflow for manual vulnerability scanning and optional unit tests. - Updated README to include instructions on running the new workflow and details about inputs and artifacts. - Enhanced the vulnerability scanner to include sensible default global excludes to reduce noise during scans. - Implemented a CI helper script to check for critical findings in the vulnerability report and fail the job if any are found.

…nner_V2.0_Development

- Added `hasInstallScript` to package-lock.json for npm install script support. - Updated package.json with new scripts for scanner preparation and environment validation. - Improved `scanner.js` to allow for explicit Python executable overrides and enhanced progress tracking. - Introduced `bootstrap.js` for one-shot setup of Node and Python dependencies, including environment validation. - Created `ensureScannerReady.js` to check and prepare the scanner environment if necessary. - Implemented `prepareScanner.js` to manage the creation of the Python virtual environment and installation of dependencies.

merge from upstream/master

…y assessment workflow

ChaohuiLi0321 added 30 commits September 5, 2025 21:44

Install basic dependencies

3ea7bf5

Add temporary file ignore filter settings

60c0fd0

Create the plugin package initialization file plugins/base_plugin.py …

e137e6e

…and implement the plugin base class system.

Create the core engine core/scanner_engine.py

1c9a181

Create a configuration management system config/scanner_config.yaml

9367023

Configuration Manager - handles loading and validation of YAML config…

714d083

…uration files.

NutriHelp Security Scanner V2.0 - Main Entry Point: scanner_v2.py, Mo…

b3fe677

…dular security scanner main program.

Phase 1 Quick verification script——Verify that the modular infrastruc…

f5fde8d

…ture is built correctly.

Create a file named test_basic_functionality.py to test the basic fun…

fe7e9c9

…ctionality of the security scanning tool named Vulnerability_Tool_V2.

Create a JWT missing protection plug-in plugins/jwt_security/jwt_miss…

7a16146

…ing.py

Create a JWT configuration verification plug-in plugins/jwt_security/…

d0a8f7f

…jwt_config.py

Generate HTML report, view HTML report

2b66c52

Temporarily add a Scanner to http://localhost/api-docs/

f637efd

Integrate Vulnerability_Scanner_V2.0 into Swagger UI, with the URL: h…

f0995aa

…ttp://localhost:8001/scanner/docs. Run the following command: python -m uvicorn api.scanner_api:app --host 0.0.0.0 --port 8001 --reload

resolve conflicts and commit

19bad1e

Change Comment

1a7925e

Update comment

45c8264

Update - Ensure both command scanning and Swagger UI scanning are wor…

a49416e

…king properly.

Update - Ensured that the report generated by command scan is consist…

37015cc

…ent with the report generated by scanning in Swagger UI.

Use the command "python scanner_v2.py --target ../ --format html --ou…

ca3e920

…tput security_report.html --verbose" to generate a debugged report

Use Swagger UI: http://localhost:8001/scanner/docs to test and genera…

6375f5b

…te reports in the updated debug format (use the command "python -m uvicorn api.scanner_api:app --host 0.0.0.0 --port 8001 --reload" to start the SwaggerUI integration of NutriHelp Security Scanner V2.0)

Integrate Vulnerability_Scanner_V2.0 into the Swagger UI: http://loca…

702082a

…lhost/api-docs, and test the GET and POST methods in the API interface separately.

Update security report

7c28bc8

Add standard password or related security testing mechanisms and inte…

ecfe6fe

…grate them into the API interface scanning function in Swagger UI.

update comment

ac49ba6

include general_security in GET /api/scanner/plugins

3788ba0

chore: update .gitignore to ignore venv and pytest cache

3e9f647

Add Python executable resolution and setup script for virtual environ…

97d3d1d

…ment

Merge branch 'master' of https://github.com/Gopher-Industries/Nutrihe…

3b9bbd9

…lp-api into Vulnerability_Scanner_V2.0_Development

ChaohuiLi0321 added 7 commits September 14, 2025 04:09

Add migration task document for Vulnerability Scanner CI from V1.4 to…

c1498cd

… V2.0

Merge remote-tracking branch 'upstream/master' into Vulnerability_Sca…

07359f4

…nner_V2.0_Development

Add v8-to-istanbul dependency to package.json and package-lock.json

ba07154

Enhance API documentation, and integrate swagger-jsdoc for dynamic do…

22ec9c4

…cumentation generation

Add jest configuration for test matching in package.json

0662947

Add .gitignore entries for vulnerability scanner reports and update s…

230e742

…canner.js to use new report paths

Merge remote-tracking branch 'upstream/master' into Vulnerability_Sca…

1bd8b52

…nner_V2.0_Development

ChaohuiLi0321 changed the title ~~Add Vulnerability Scanner V2.0 integration and Swagger UI endpoint Vulnerability scanner v2.0 development~~ Vulnerability Scanner V2.0 Development Sep 18, 2025

ChaohuiLi0321 requested a review from madhavi2809 September 19, 2025 18:11

ChaohuiLi0321 and others added 7 commits September 25, 2025 21:05

Merge remote-tracking branch 'upstream/master' into Vulnerability_Sca…

a5ba1ea

…nner_V2.0_Development

Update package-lock.json due to “npm install”

a43df7c

Merge remote-tracking branch 'upstream/master' into Vulnerability_Sca…

e0e8dab

…nner_V2.0_Development

Merge pull request #6 from Gopher-Industries/master

8ede8a0

merge from upstream/master

refactor: remove Python setup and V2 scanner integration from securit…

690a065

…y assessment workflow

refactor: remove unused API files and dependencies from the project

7450193

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vulnerability Scanner V2.0 Development #133

Vulnerability Scanner V2.0 Development #133

Uh oh!

ChaohuiLi0321 commented Sep 7, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Vulnerability Scanner V2.0 Development #133

Are you sure you want to change the base?

Vulnerability Scanner V2.0 Development #133

Uh oh!

Conversation

ChaohuiLi0321 commented Sep 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Integrate Vulnerability Scanner V2.0 (Automated) into NutriHelp

Summary

Key Changes

Scanner Core (Vulnerability_Tool_V2)

Node Integration (scanner.js)

Automation & Scripts

CI Updates

Documentation

Resilience / Quality

Why

How to Test (Current Flow)

Risk & Mitigations

Reviewer Focus

Follow-Up Suggestions (Not part of this PR)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ChaohuiLi0321 commented Sep 7, 2025 •

edited

Loading