Skylos: Guard your Code

Skylos is a static analysis tool for Python codebases which locates dead code, performs quality checks, and finds security vulnerabilties.

Quick Start

Objective	Command	Outcome	Remarks
Hunt Dead Code	`skylos .`	Prune unreachable functions and unused imports instantly.
Precise Hunt	`skylos . --coverage`	Cross-reference results with actual runtime data	Run `skylos . --coverage` first then run your actual scan `skylos . --danger`
Audit Risk & Quality	`skylos . --secrets --danger --quality`	Kill security leaks, tainted data, and architectural rot.	You can run one of the flags, or all 3
Automated Repair	`skylos . --audit --fix`	Let the watchdog handle the labor of cleaning your code.
Secure the Gate	`skylos --gate`	Block risky code from merging with hard-coded standards.

Features

Security & Vulnerability Audit

Taint-Flow Tracking: Follows untrusted input from the API edge to your database to stop SQLi, SSRF, and Path Traversal
Credentials Detection: Detects API keys & secrets (GitHub, GitLab, AWS, Google, SendGrid, private key blocks)
Vulnerability Detection: Flags dangerous patterns including eval/exec, unsafe yaml/pickle loads, and weak cryptographic hashes
Implicit Reference Detection: Catches dynamic patterns like getattr(mod, f"handle_{x}"), framework decorators (@app.route, @pytest.fixture), and f-string dispatch patterns

Codebase Optimization

CST-safe removals: Uses LibCST to remove selected imports or functions (handles multiline imports, aliases, decorators, async etc..)
Logic Awareness: Deep integration for Python frameworks (Django, Flask, FastAPI) and TypeScript (Tree-sitter) to identify active routes and dependencies.
Granular Filtering: Skip lines tagged with # pragma: no skylos, # pragma: no cover, or # noqa

Operational Governance & Runtime

Coverage Integration: Auto-detects .coverage files to verify dead code with runtime data
Quality Gates: Enforces hard thresholds for complexity, nesting, and security risk via pyproject.toml to block non-compliant PRs
Interactive CLI: Manually verify and remove/comment-out findings through an inquirer-based terminal interface
Audit Mode: Leverages an independent reasoning loop to identify "hallucinations" and broken dependencies

Multi-Language Support

Language	Parser	Dead Code	Security	Quality
Python	AST	✅	✅	✅
TypeScript	Tree-sitter	Limited	Limited	Limited

No Node.js required - parser is built-in.

Installation

Basic Installation

## from pypi
pip install skylos

## or from source
git clone https://github.com/duriantaco/skylos.git
cd skylos

pip install .

Performance

For dead code detection benchmarks vs Vulture, Flake8, Ruff, see BENCHMARK.md.

To run the benchmark: python compare_tools.py /path/to/sample_repo

How it works

Skylos builds a reference graph of your entire codebase - who defines what, who calls what, across all files.

Parse all files -> Build definition map -> Track references -> Find orphans (zero refs = dead)

Confidence Scoring

Not all dead code is equally dead. Skylos assigns confidence scores to handle ambiguity:

Confidence	Meaning	Action
100	Definitely unused	Safe to delete
60	Probably unused (default threshold)	Review first
40	Maybe unused (framework helpers)	Likely false positive
20	Possibly unused (decorated/routes)	Almost certainly used
0	Show everything	Debug mode

skylos . -c 60  # Default: high-confidence findings only
skylos . -c 30  # Include framework helpers  
skylos . -c 0  # Everything

Framework Detection

When Skylos sees Flask, Django, or FastAPI imports, it adjusts scoring automatically:

Pattern	Handling
`@app.route`, `@router.get`	Entry point → marked as used
`@pytest.fixture`, `@celery.task`	Entry point → marked as used
`getattr(mod, "func")`	Tracks dynamic reference
`getattr(mod, f"handle_{x}")`	Tracks pattern `handle_*`

Test File Exclusion

Tests call code in weird ways that look like dead code. By default, Skylos excludes:

Detected By	Examples
Path	`/tests/`, `/test/`, `*_test.py`
Imports	`pytest`, `unittest`, `mock`
Decorators	`@pytest.fixture`, `@patch`

# These are auto-excluded (confidence set to 0)
/project/tests/test_user.py
/project/test/helper.py  

# These are analyzed normally
/project/user.py
/project/test_data.py  # Doesn't end with _test.py

Want test files included? Use --include-folder tests.

Philosophy

When ambiguous, we'd rather miss dead code than flag live code as dead.

Framework endpoints are called externally (HTTP, signals). Name resolution handles aliases. When things get unclear, we err on the side of caution.

Gating

Block bad code before it merges. Configure thresholds, run locally, then automate in CI.

1. Initialize Configuration

skylos init

Creates [tool.skylos] in your pyproject.toml:

[tool.skylos]
# Quality thresholds
complexity = 10
nesting = 3
max_args = 5
max_lines = 50
ignore = [] 
model = "gpt-4.1"

# Language overrides (optional)
[tool.skylos.languages.typescript]
complexity = 15
nesting = 4

# Gate policy
[tool.skylos.gate]
fail_on_critical = true
max_security = 0      # Zero tolerance
max_quality = 10      # Allow up to 10 warnings
strict = false

2. Run the Gate

skylos . --quality --danger --gate

If thresholds exceeded, Skylos exits non-zero (blocking CI/CD or git push). You'll be prompted to select files manually or push all at once.

Use --force to bypass in emergencies.

3. GitHub Actions

Full workflow (click to expand)

Create .github/workflows/skylos.yml:

name: Skylos Deadcode Scan

on:
  pull_request:
  push:
    branches: [ main, master ]
  workflow_dispatch:

jobs:
  scan:
    runs-on: ubuntu-latest
    env:
      SKYLOS_STRICT: ${{ vars.SKYLOS_STRICT || 'false' }}
    steps:
      - uses: actions/checkout@v4

      - uses: actions/setup-python@v5
        with:
          python-version: '3.11'
          cache: 'pip'

      - name: Install Skylos
        run: pip install skylos

      - name: Run Skylos
        env:
          REPORT: skylos_${{ github.run_number }}_${{ github.sha }}.json
        run: |
          echo "REPORT=$REPORT" >> "$GITHUB_OUTPUT"
          skylos . --json > "$REPORT"
        id: scan

      - name: Fail if there are findings
        continue-on-error: ${{ env.SKYLOS_STRICT != 'true' }}
        env:
          REPORT: ${{ steps.scan.outputs.REPORT }}
        run: |
            python - << 'PY'
            import json, sys, os
            report = os.environ["REPORT"]
            data = json.load(open(report, "r", encoding="utf-8"))
            count = 0
            for value in data.values():
                if isinstance(value, list):
                    count += len(value)
            print(f"Findings: {count}")
            if count > 0:
              print(f"::warning title=Skylos findings::{count} potential issues found. See {report}")
            sys.exit(1 if count > 0 else 0)
            PY

      - name: Upload report artifact
        if: always()
        uses: actions/upload-artifact@v4
        with:
          name: ${{ steps.scan.outputs.REPORT }}
          path: ${{ steps.scan.outputs.REPORT }}

      - name: Summarize in job log
        if: always()
        run: |
          echo "Skylos report: ${{ steps.scan.outputs.REPORT }}" >> $GITHUB_STEP_SUMMARY

Strict mode: Go to GitHub → Settings → Secrets and variables → Actions → Variables → Add SKYLOS_STRICT with value true.

4. Pre-commit

Pick one approach:

Option A: Skylos hook repo

## .pre-commit-config.yaml
repos:
  - repo: local
    hooks:
      - id: skylos-scan
        name: skylos report
        entry: python -m skylos.cli
        language: system
        pass_filenames: false
        require_serial: true
        args: [".", "--output", "report.json", "--confidence", "70", "--danger"]

      - id: skylos-fail-on-findings
        name: skylos gate
        language: system
        pass_filenames: false
        require_serial: true
        entry: python scripts/skylos_gate.py

Option B: Self-contained local hook

repos:
  - repo: local
    hooks:
      - id: skylos-scan
        name: skylos report
        language: python
        entry: python -m skylos.cli
        pass_filenames: false
        require_serial: true
        additional_dependencies: [skylos==2.8.0]
        args: [".", "--output", "report.json", "--confidence", "70"]

      - id: skylos-fail-on-findings
        name: skylos (soft)
        language: python
        language_version: python3
        pass_filenames: false
        require_serial: true
        entry: >
          python -c "import os, json, sys, pathlib;
          p=pathlib.Path('report.json');
          if not p.exists(): sys.exit(0);
          data=json.loads(p.read_text(encoding='utf-8'));
          count = sum(len(v) for v in data.values() if isinstance(v, list));
          print(f'[skylos] findings: {count}');
          sys.exit(0 if os.getenv('SKYLOS_SOFT') or count==0 else 1)"

If you chose option A, then do remember to put this script below in a folder scripts/sylos_gate.py

#!/usr/bin/env python3
import json
import os
import sys
from pathlib import Path

REPORT = Path("report.json")

def main() -> int:
    if not REPORT.exists():
        print("[skylos] report.json missing (skipping gate)")
        return 0

    txt = REPORT.read_text(encoding="utf-8", errors="ignore").strip()
    if not txt:
        print("[skylos] report.json empty (skipping gate)")
        return 0

    try:
        data = json.loads(txt)
    except Exception as e:
        print(f"[skylos] report.json invalid JSON (skipping gate): {e}")
        return 0

    if isinstance(data, dict):
        vals = data.values()
    elif isinstance(data, list):
        vals = data
    else:
        vals = []

    count = 0
    for v in vals:
        if isinstance(v, list):
            count += len(v)

    print(f"[skylos] findings: {count}")
    soft = os.getenv("SKYLOS_SOFT", "").strip()
    if soft or count == 0:
        return 0
    else:
        return 1

if __name__ == "__main__":
    raise SystemExit(main())

Install:

pip install pre-commit
pre-commit install
pre-commit run --all-files

Run pre-commit in CI

Create .github/workflows/pre-commit.yml:

name: pre-commit
on: [push, pull_request]
jobs:
  run:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-python@v5
        with: { python-version: "3.11", cache: "pip" }
      - uses: pre-commit/action@v3.0.1
        with: { extra_args: --all-files }

Note: The second hook is soft by default (SKYLOS_SOFT=1) - prints findings but passes. Remove the env to block commits on findings.

Integration and Ecosystem

Skylos is designed to live everywhere your code does—from your IDE to your deployment pipeline.

1. Integration Environments

Environment	Tool	Use Case
VS Code	Skylos Extension	Real-time guarding. Highlights code rot and risks on-save.
Web UI	`skylos run`	Launch a local dashboard at `localhost:5090` for visual auditing.
CI/CD	GitHub Actions / Pre-commit	Automated gates that audit every PR before it merges.
Quality Gate	`skylos --gate`	Block deployment if security or complexity thresholds are exceeded.

2. Output Formats

Control how you consume the watchdog's findings.

Flag	Format	Primary Use
`--table`	Rich Table	Default human-readable CLI summary.
`--tree`	Logic Tree	Visualizes code hierarchy and structural dependencies.
`--json`	Machine Raw	Piping results to `jq`, custom scripts, or log aggregators.
`-o, --output`	File Export	Save the audit report directly to a file instead of `stdout`.

Auditing and Precision

By default, Skylos finds dead code. Enable additional scans with flags.

Security (`--danger`)

Tracks tainted data from user input to dangerous sinks.

skylos . --danger

Catches	Example
SQL injection	`cur.execute(f"SELECT * FROM users WHERE name='{name}'")`
Command injection	`os.system("zip -r out.zip " + folder)`
SSRF	`requests.get(request.args["url"])`
Path traversal	`open(request.args.get("p"))`
Unsafe deserialize	`pickle.load()`, `yaml.load()` without SafeLoader
Weak crypto	`hashlib.md5()`, `hashlib.sha1()`

Full list in DANGEROUS_CODE.md.

Secrets (`--secrets`)

Detects hardcoded credentials.

skylos . --secrets

Providers: GitHub, GitLab, AWS, Stripe, Slack, Google, SendGrid, Twilio, private keys.

Quality (`--quality`)

Flags functions that are hard to maintain.

skylos . --quality

Rule	ID	What It Catches
Complexity
Cyclomatic complexity	SKY-Q301	Too many branches/loops (default: >10)
Deep nesting	SKY-Q302	Too many nested levels (default: >3)
Structure
Too many arguments	SKY-C303	Functions with >5 args
Function too long	SKY-C304	Functions >50 lines
Logic
Mutable default	SKY-L001	`def foo(x=[])` - causes state leaks
Bare except	SKY-L002	`except:` swallows SystemExit
Dangerous comparison	SKY-L003	`x == None` instead of `x is None`
Anti-pattern try block	SKY-L004	Nested try, or try wrapping too much logic
Performance
Memory load	SKY-P401	`.read()` / `.readlines()` loads entire file
Pandas no chunk	SKY-P402	`read_csv()` without `chunksize`
Nested loop	SKY-P403	O(N²) complexity
Unreachable
Dead branch	SKY-U001	`if False:` or `else` after always-true
Dead statement	SKY-U002	Code after `return`/`raise`/`break`

To ignore a specific rule:

# pyproject.toml
[tool.skylos]
ignore = ["SKY-P403"]  # Allow nested loops

Tune thresholds and disable rules in pyproject.toml:

[tool.skylos]
# Adjust thresholds
complexity = 15        # Default: 10
nesting = 4            # Default: 3
max_args = 7           # Default: 5
max_lines = 80

AI Auditing (`--audit`)

LLM-powered logic review.

skylos . --audit
skylos . --audit --model claude-haiku-4-5-20251001

Finds:

Hallucination Detection: Finds calls to functions that don't actually exist in your repo.
Logic Flaws: Detects "confident but wrong" logic, bare exceptions, and architectural rot.
Using a specific model: --model claude-haiku-4-5-20251001

Autonomous Fix (`--fix`)

Let the LLM fix what it found.

skylos . --fix

API keys stored in your system keychain (macOS Keychain, Windows Credential Locker). Never plaintext.

Combine Everything

skylos . --danger --secrets --quality  # All static scans
skylos . --danger --quality --audit --fix  # Full AI-assisted cleanup

Coverage Integration

Static analysis can't see everything. Python's dynamic nature means patterns like getattr(), plugin registries, and string-based dispatch look like dead code—but they're not.

Coverage integration solves this. If a function actually ran during tests or execution, it's definitely not dead.

Quick Start

# Step 1: 
# Option 1: Let Skylos run your tests first
skylos . --coverage

# Option 2: Use existing coverage data
coverage run -m pytest    # or: coverage run app.py
skylos .                   # Auto-detects .coverage file

# Step 2:
# Run skylos as per usual
skylos . --danger --quality

How It Works

Analysis Type	Confidence	What It Catches
Static only	60-95%	Direct calls, imports, decorators
+ Coverage	100%	Dynamic dispatch, plugins, registries

Example

# Static analysis thinks this is dead (no direct call visible)
def handle_login():
    return "Login handler"

# But it's called dynamically at runtime
action = request.args.get("action")  
func = getattr(module, f"handle_{action}")
func()  # Calls handle_login

Without Coverage	With Coverage
`handle_login` flagged as dead ❌	`handle_login` marked as used ✅

When To Use

Situation	Command
Have pytest/unittest tests	`skylos . --coverage`
No tests, but can run app	`coverage run app.py` then `skylos .`
No tests, can't run app	`skylos .` (static only)

What Coverage Catches

These patterns are invisible to static analysis but caught with coverage:

# 1. Dynamic dispatch
func = getattr(module, f"handle_{action}")
func()

# 2. Plugin/registry patterns  
PLUGINS = []
def register(f): PLUGINS.append(f); return f

@register
def my_plugin(): ...  # Called via: for p in PLUGINS: p()

# 3. Subclass discovery
for cls in BasePlugin.__subclasses__():
    cls().run()

# 4. String-based access
globals()["my_" + "func"]()
locals()[func_name]()

Important Notes

Coverage only adds information. Low test coverage will not create false positives. It just means some dynamic patterns may still be flagged.
Any execution helps. Even running your app once and hitting a few endpoints provides useful data.
Tests don't need to pass. Coverage records are what is executed. Irregardless of pass/fail status of your tests

Filtering

Control what Skylos analyzes and what it ignores.

Inline Suppression

Silence specific findings with comments:

# Ignore dead code detection on this line
def internal_hook():  # pragma: no skylos
    pass

# this also works
def another():  # pragma: no cover
    pass

def yet_another():  # noqa
    pass

Folder Exclusion

By default, Skylos excludes: __pycache__, .git, .pytest_cache, .mypy_cache, .tox, htmlcov, .coverage, build, dist, *.egg-info, venv, .venv

# See what's excluded by default
skylos --list-default-excludes

# Add more exclusions
skylos . --exclude-folder vendor --exclude-folder generated

# Force include an excluded folder
skylos . --include-folder venv

# Scan everything (no exclusions)
skylos . --no-default-excludes

Rule Suppression

Disable rules globally in pyproject.toml:

[tool.skylos]
ignore = [
    "SKY-P403",   # Allow nested loops
    "SKY-L003",   # Allow == None
    "SKY-S101",   # Allow hardcoded secrets (not recommended)
]

Summary

Want to...	Do this
Skip one line	`# pragma: no skylos`
Skip one secret	`# skylos: ignore[SKY-S101]`
Skip a folder	`--exclude-folder NAME`
Skip a rule globally	`ignore = ["SKY-XXX"]` in pyproject.toml
Include excluded folder	`--include-folder NAME`
Scan everything	`--no-default-excludes`

CLI Options

Usage: skylos [OPTIONS] PATH

Arguments:
  PATH  Path to the Python project to analyze

Options:
  -h, --help                   Show this help message and exit
  --json                       Output raw JSON instead of formatted text  
  --tree                       Output results in tree format
  --table                      Output results in table format via the CLI
  -c, --confidence LEVEL       Confidence threshold 0-100 (default: 60)
  --comment-out                Comment out code instead of deleting
  -o, --output FILE            Write output to file instead of stdout
  -v, --verbose                Enable verbose output
  --version                    Checks version
  -i, --interactive            Interactively select items to remove
  --dry-run                    Show what would be removed without modifying files
  --exclude-folder FOLDER      Exclude a folder from analysis (can be used multiple times)
  --include-folder FOLDER      Force include a folder that would otherwise be excluded
  --no-default-excludes        Don't exclude default folders (__pycache__, .git, venv, etc.)
  --list-default-excludes      List the default excluded folders and
  --secrets                    Scan for api keys/secrets
  --danger                     Scan for dangerous code
  --quality                    Code complexity and maintainability
  --coverage                   Run tests with coverage first
  --audit                      LLM-powered logic review
  --fix                        LLM auto-repair
  --model MODEL                LLM model (default: gpt-4.1)
  --gate                       Fail on threshold breach (for CI)
  --force                      Bypass quality gate (emergency override)

Interactive Mode

The interactive mode lets you select specific functions and imports to remove:

Select items: Use arrow keys and spacebar to select/unselect
Confirm changes: Review selected items before applying
Auto-cleanup: Files are automatically updated

FAQ

Q: Why doesn't Skylos find 100% of dead code? A: Python's dynamic features (getattr, globals, etc.) can't be perfectly analyzed statically. No tool can achieve 100% accuracy. If they say they can, they're lying.

Q: Are these benchmarks realistic? A: They test common scenarios but can't cover every edge case. Use them as a guide, not gospel.

Q: Why doesn't Skylos detect my unused Flask routes? A: Web framework routes are given low confidence (20) because they might be called by external HTTP requests. Use --confidence 20 to see them. We acknowledge there are current limitations to this approach so use it sparingly.

Q: What confidence level should I use? A: Start with 60 (default) for safe cleanup. Use 30 for framework applications. Use 20 for more comprehensive auditing.

Q: What does --coverage do? A: It runs pytest (or unittest) with coverage tracking before analysis. Functions that actually executed are marked as used with 100% confidence, eliminating false positives from dynamic dispatch patterns.

Q: Do I need 100% test coverage for --coverage to be useful? A: No. However, we STRONGLY encourage you to have tests. Any coverage helps. If you have 30% test coverage, that's 30% of your code verified. The other 70% still uses static analysis. Coverage only removes false positives, it never adds them.

Q: My tests are failing. Can I still use --coverage? A: Yes. Coverage tracks execution, not pass/fail. Even failing tests provide coverage data.

Limitations and Troubleshooting

Limitations

Dynamic code: getattr(), globals(), runtime imports are hard to detect
Frameworks: Django models, Flask, FastAPI routes may appear unused but aren't
Test data: Limited scenarios, your mileage may vary
False positives: Always manually review before deleting code
Secrets PoC: May emit both a provider hit and a generic high-entropy hit for the same token. All tokens are detected only in py files (.py, .pyi, .pyw)
Quality limitations: The current --quality flag does not allow you to configure the cyclomatic complexity.
Coverage requires execution: The --coverage flag only helps if you have tests or can run your application. Pure static analysis is still available without it.

Troubleshooting

Permission Errors
```
Error: Permission denied when removing function
```
Check file permissions before running in interactive mode.
Missing Dependencies
```
Interactive mode requires 'inquirer' package
```
Install with: pip install skylos[interactive]

Contributing

We welcome contributions! Please read our Contributing Guidelines before submitting pull requests.

Quick Contribution Guide

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Roadmap

More stuff coming soon!

License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.

Contact

Author: oha
Email: aaronoh2015@gmail.com
GitHub: @duriantaco

Name		Name	Last commit message	Last commit date
Latest commit History 221 Commits
.github/workflows		.github/workflows
assets		assets
editors/vscode		editors/vscode
scripts		scripts
skylos		skylos
test		test
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.pre-commit-hooks.yaml		.pre-commit-hooks.yaml
AUTHORS.md		AUTHORS.md
BENCHMARK.md		BENCHMARK.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
DANGEROUS_CODE.MD		DANGEROUS_CODE.MD
Dockerfile		Dockerfile
License		License
MANIFEST.in		MANIFEST.in
QUALITY.md		QUALITY.md
README.md		README.md
SECURITY.md		SECURITY.md
app.py		app.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

License

duriantaco/skylos

Folders and files

Latest commit

History

Repository files navigation

Skylos: Guard your Code

Table of Contents

Quick Start

Features

Security & Vulnerability Audit

Codebase Optimization

Operational Governance & Runtime

Multi-Language Support

Installation

Basic Installation

Performance

How it works

Confidence Scoring

Framework Detection

Test File Exclusion

Philosophy

Gating

1. Initialize Configuration

2. Run the Gate

3. GitHub Actions

4. Pre-commit

Integration and Ecosystem

1. Integration Environments

2. Output Formats

Auditing and Precision

Security (--danger)

Secrets (--secrets)

Quality (--quality)

AI Auditing (--audit)

Autonomous Fix (--fix)

Combine Everything

Coverage Integration

Quick Start

How It Works

Example

When To Use

What Coverage Catches

Important Notes

Filtering

Inline Suppression

Folder Exclusion

Rule Suppression

Summary

CLI Options

Interactive Mode

FAQ

Limitations and Troubleshooting

Limitations

Troubleshooting

Contributing

Quick Contribution Guide

Roadmap

License

Contact

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 25

Packages 0

Uh oh!

Contributors 2

Languages

Security (`--danger`)

Secrets (`--secrets`)

Quality (`--quality`)

AI Auditing (`--audit`)

Autonomous Fix (`--fix`)

Packages