Breaking-out-LLM

My work in progress notes as my knowledge expand while doing research and learning to build my objective.
Breaking out of LLM constrains and build AI Task Assistant to perform interactive security assessment.

The research and learnings I am documenting here is to build a local AI assistant that performs authorized penetration testing tasks. Aim is to allow it to be executing commands against a approved target and review output to perform additional activities based on results and document artifacts.

Objectives

Create private secure local LLM running Ollama that is running on personal Ubuntu host.
Start with LLM that provide the inital interactive input.
Based on the input prompt, the LLM must perform the security actitivity by running local bash commands on Ubuntu.
The Output from the commands executed must be imported into the LLM chat input as results and then analysed to produce next steps to execute.
The LLM must continue until it needs more input from the human security analyst and guidenance if stuck.
Results must be saved and records as evidence that will be put into document, screenshots taken as part of evendice.
Screenshots must include dates for timeline and logging.
LLM trained on security methodology documentation containing example commands and results to enhance the logic flow.

Constrains

This research is constrained by the available resources and funding to test performance. This proof of concept (POC) is to validate the execution functionality, results and not the speed.

Design

Automated LLM Security Assessment Assistant process of LLM, Ollama Chat API, Middleware API on Ubuntu host:

Process Flow Legend

Scoping user prompt
Convert to LLM Generate Command
Send request to Ollama Chat API
Execute command on Middleware API
Middleware API execute command
Response send to Ollama Chat API
LLM Explanation of response
LLM Suggest next command
user prompt

Return to step 2 if instructing LLM with suggested next command - Future is to enhance and automate and continue security assessment tasks...

Sample Integration Result

The following image from peanut-ai is early basic proof of concept (POC) results of automating LLM security assessment assistant code. Prompt getting user input and suggesting next command to user for continue of process flows:

Setup

Physical Hardware and OS:

Ubuntu 24.04.2
Architecture 64-bit
CPU AMD Ryzen 5 2600
CPU's 12
GPU Nvidia GeForce RTX 4060 Ti with 16 GB video memory
Motherboard onboard memory 32 GB

Software

Local Ollama API target installed http://0.0.0.0:11434/api/chat
Local WebUI installed http://0.0.0.0:8080/
Middleware API Executor http://0.0.0.0:5001 - Python Flask App
Interface Processing Code cmd_line_llm_prompt_send_cmd_to_middleware.py

Configuration altered to allow Ollama to be reachable from all local interfaces edit:

sudo systemctl edit --full ollama

Not currently relevant - Set the docker instance for Ollama-WebUI to auto start the container after Ubuntu boot:

sudo docker run -d --restart unless-stopped -p 3000:8080 --name open-webui ghcr.io/open-webui/open-webui:main

Docker instance of Open WebUI: http://127.0.0.1:3000/
Connected to Ollama local instance: http://192.168.255.57:11434

Large Language Model LLM

llama3 LLM that provide the inital interactive input, is an Agentic Large Language Model.

Ollama API list of Models Downloaded locally: http://192.168.255.57:11434/api/tags

Agentic Model testing: llama3

Integration Authorization

Protect access control what services may interact with Middleware API, to prevent unauthorized access from rogue actors.

Middleware API

Develop API integration with the input prompt, the LLM must perform the security actitivity by running local bash commands on Ubuntu.

Path Python API Script: /home/peanut/Downloads/breakout_api

Start API: python3 ollama_command_executor.py
Running on http://0.0.0.0:5001

The Ollama LLM is processing the text based input from user but does not inherently send HTTP requests. The Ollama WebUI frontend running in docker sends user queries to the local Ollama instance,
and if Ollama is instructed to execute commands via the middleware, the WebUI must generate the and send HTTP POST requests.

Troubleshooting Integration

Communication and permissions between docker WebUI and localhost Middleware on port 5001.

sudo docker exec -it open-webui bash
curl -X POST http://192.168.255.57:5001/execute -H "Content-Type: application/json" -d '{"command":"ls -al /home"}'

Above will test if the docker instance bash shell have permissions to execute and reach the middleware API running on localhost port 5001.

Import Results from Tasks

The Output from the commands executed must be imported into the LLM chat input as results and then analysed to produce next steps to execute.

Loop Methodology

Penetration Testing methodology.
The LLM must continue until it needs more input from the human security analyst and guidenance if stuck.

Record Artifacts

Results must be saved and records as evidence that will be put into document, screenshots taken as part of evendice.

Screenshots Evidence

Screenshots must include dates for timeline and logging.

Training & Enhance

LLM trained on personal data sets on security methodology documentation containing example commands and results to enhance the LLM knowledge base.

Status: Inprogress research since: January 2025

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
code		code
images		images
README.md		README.md
troubleshooting-notes.md		troubleshooting-notes.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Breaking-out-LLM

Objectives

Constrains

Design

Process Flow Legend

Sample Integration Result

Setup

Software

Large Language Model LLM

Integration Authorization

Middleware API

Troubleshooting Integration

Import Results from Tasks

Loop Methodology

Record Artifacts

Screenshots Evidence

Training & Enhance

About

Releases

Packages

Languages

botesjuan/Breaking-out-LLM

Folders and files

Latest commit

History

Repository files navigation

Breaking-out-LLM

Objectives

Constrains

Design

Process Flow Legend

Sample Integration Result

Setup

Software

Large Language Model LLM

Integration Authorization

Middleware API

Troubleshooting Integration

Import Results from Tasks

Loop Methodology

Record Artifacts

Screenshots Evidence

Training & Enhance

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages