Skip to content

Yet another static analysis tool for Python codebases written in Python that detects dead code. Faster and better than the rest :)

License

Notifications You must be signed in to change notification settings

duriantaco/skylos

Repository files navigation

Skylos 🔍

License: Apache 2.0 100% Local PyPI version Security Policy

Skylos Logo

A static analysis tool for Python codebases written in Python (formerly was written in Rust but we ditched that) that detects unreachable functions and unused imports, aka dead code. Faster and better results than many alternatives like Flake8 and Pylint, and finding more dead code than Vulture in our tests with comparable speed.

Table of Contents

Features

  • Framework-Aware Detection: Attempt at handling Flask, Django, FastAPI routes and decorators
  • Test File Exclusion: Auto excludes test files (you can include it back if you want)
  • Interactive Cleanup: Select specific items to remove from CLI
  • Unused Functions & Methods: Finds functions and methods that not called
  • Unused Classes: Detects classes that are not instantiated or inherited
  • Unused Imports: Identifies imports that are not used
  • Folder Management: Inclusion/exclusion of directories

Benchmark (You can find this benchmark test in test folder)

The benchmark checks how well static analysis tools spot dead code in Python. Things such as unused functions, classes, imports, variables, that kinda stuff. To read more refer down below.

The methodology and process for benchmarking can be found in BENCHMARK.md

Tool Time (s) Items TP FP FN Precision Recall F1 Score
Skylos (Local Dev) 0.013 34 22 12 7 0.6471 0.7586 0.6984
Vulture (0%) 0.054 32 11 20 18 0.3548 0.3793 0.3667
Vulture (60%) 0.044 32 11 20 18 0.3548 0.3793 0.3667
Flake8 0.371 16 5 7 24 0.4167 0.1724 0.2439
Pylint 0.705 11 0 8 29 0.0000 0.0000 0.0000
Ruff 0.140 16 5 7 24 0.4167 0.1724 0.2439

To run the benchmark: python compare_tools.py /path/to/sample_repo

Note: More can be found in BENCHMARK.md

Installation

Basic Installation

pip install skylos

From Source

# Clone the repository
git clone https://github.com/duriantaco/skylos.git
cd skylos

## Install your dependencies 
pip install .

Quick Start

# Analyze a project
skylos /path/to/your/project

# Interactive mode - select items to remove
skylos --interactive /path/to/your/project 

# Dry run - see what would be removed
skylos --interactive --dry-run /path/to/your/project 

# Output to JSON
skylos --json /path/to/your/project 

# With confidence
skylos path/to/your/file --confidence 20 ## or whatever value u wanna set

Understanding Confidence Levels

Skylos uses a confidence-based system to try to handle Python's dynamic nature and web frameworks.

How Confidence Works

  • Confidence 100: 100% unused (default imports, obvious dead functions)
  • Confidence 60: Default value - conservative detection
  • Confidence 40: Framework helpers, functions in web app files
  • Confidence 20: Framework routes, decorated functions
  • Confidence 0: Show everything

Framework Detection

When Skylos detects web framework imports (Flask, Django, FastAPI), it applies different confidence levels:

# only obvious dead codes
skylos app.py --confidence 60  # THIS IS THE DEFAULT

# include unused helpers in framework files  
skylos app.py --confidence 30

# include potentially unused routes
skylos app.py --confidence 20

# everything.. shows how all potential dead code
skylos app.py --confidence 0

Test File Detection

Skylos automatically excludes test files from analysis because test code patterns often appear as "dead code" but are actually called by test frameworks. Should you need to include them in your test, just refer to the Folder Management

What Gets Detected as Test Files

File paths containing:

  • /test/ or /tests/ directories
  • Files ending with _test.py

Test imports:

  • pytest, unittest, nose, mock, responses

Test decorators:

  • @pytest.fixture, @pytest.mark.*
  • @patch, @mock.*

Test File Behavior

When Skylos detects a test file, it by default, will apply a confidence penalty of 100, which will essentially filter out all dead code detection

# This will show 0 dead code because its treated as test file
/project/tests/test_user.py
/project/test/helper.py  
/project/utils_test.py

# The files will be analyzed normally. Note that it doesn't end with _test.py
/project/user.py
/project/test_data.py 

Including & Excluding Files

Default Exclusions

By default, Skylos excludes common folders: __pycache__, .git, .pytest_cache, .mypy_cache, .tox, htmlcov, .coverage, build, dist, *.egg-info, venv, .venv

Folder Options

# List default excluded folders
skylos --list-default-excludes

# Exclude single folder (The example here will be venv)
skylos /path/to/your/project --exclude-folder venv 

# Exclude multiple folders
skylos /path/to/your/project --exclude-folder venv --exclude-folder build

# Force include normally excluded folders
skylos /path/to/your/project --include-folder venv 

# Scan everything (no exclusions)
skylos path/to/your/project --no-default-excludes 

CLI Options

Usage: skylos [OPTIONS] PATH

Arguments:
  PATH  Path to the Python project to analyze

Options:
  -h, --help                    Show this help message and exit
  -j, --json                   Output raw JSON instead of formatted text  
  -o, --output FILE            Write output to file instead of stdout
  -v, --verbose                Enable verbose output
  -i, --interactive            Interactively select items to remove
  --dry-run                    Show what would be removed without modifying files
  --exclude-folder FOLDER      Exclude a folder from analysis (can be used multiple times)
  --include-folder FOLDER      Force include a folder that would otherwise be excluded
  --no-default-excludes        Don't exclude default folders (__pycache__, .git, venv, etc.)
  --list-default-excludes      List the default excluded folders and
  -c, --confidence LEVEL       Confidence threshold (0-100). Lower values will show more items.

Example Output

Python Static Analysis Results
===================================

Summary:
  • Unreachable functions: 48
  • Unused imports: 8

Unreachable Functions
========================

 1. module_13.test_function
    └─ /Users/oha/project/module_13.py:5
 2. module_13.unused_function
    └─ /Users/oha/project/module_13.py:13
...


Unused Imports
=================

 1. os
    └─ /Users/oha/project/module_13.py:1
 2. json
    └─ /Users/oha/project/module_13.py:3
...

Next steps:

• Use --interactive to select specific items to remove

• Use --dry-run to preview changes before applying them

Interactive Mode

The interactive mode lets you select specific functions and imports to remove:

  1. Select items: Use arrow keys and space to select/deselect
  2. Confirm changes: Review selected items before applying
  3. Auto-cleanup: Files are automatically updated

Development

Prerequisites

  • Python ≥3.9
  • pytest
  • inquirer

Setup

# Clone the repository
git clone https://github.com/duriantaco/skylos.git
cd skylos

# Create a virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Running Tests

# Run Python tests
python -m pytest tests/

FAQ

Q: Why doesn't Skylos find 100% of dead code? A: Python's dynamic features (getattr, globals, etc.) can't be perfectly analyzed statically. No tool can achieve 100% accuracy. If they say they can, they're lying.

Q: Why are the results different on my codebase? A: These benchmarks use specific test cases. Your code patterns (frameworks, legacy code, etc.) will give different results.

Q: Are these benchmarks realistic? A: They test common scenarios but can't cover every edge case. Use them as a guide, not gospel.

Q: Should I automatically delete everything flagged as unused? A: No. Always review results manually, especially for framework code, APIs, and test utilities.

Q: Why did Ruff underperform? A: Like all other tools, Ruff is focused on detecting specific, surface-level issues. Tools like Vulture and Skylos are built SPECIFICALLY for dead code detection. It is NOT a specialized dead code detector. If your goal is dead code, then ruff is the wrong tool. It is a good tool but it's like using a wrench to hammer a nail. Good tool, wrong purpose.

Q: Why doesn't Skylos detect my unused Flask routes? A: Web framework routes are given low confidence (20) because they might be called by external HTTP requests. Use --confidence 20 to see them. We acknowledge there are current limitations to this approach so use it sparingly.

Q: What confidence level should I use? A: Start with 60 (default) for safe cleanup. Use 30 for framework applications. Use 20 for more comprehensive auditing.

Limitations

  • Dynamic code: getattr(), globals(), runtime imports are hard to detect
  • Frameworks: Django models, Flask, FastAPI routes may appear unused but aren't
  • Test data: Limited scenarios, your mileage may vary
  • False positives: Always manually review before deleting code

This is a STATIC code analyzer. If we can detect 100% of all dead code in any structure and framework, we wouldn't be sitting here. But we definitely tried our best.

Troubleshooting

Common Issues

  1. Permission Errors

    Error: Permission denied when removing function
    

    Check file permissions before running in interactive mode.

  2. Missing Dependencies

    Interactive mode requires 'inquirer' package
    

    Install with: pip install skylos[interactive]

Contributing

We welcome contributions! Please read our Contributing Guidelines before submitting pull requests.

Quick Contribution Guide

  1. Fork the repository
  2. Create a feature branch (git checkout -b feature/amazing-feature)
  3. Commit your changes (git commit -m 'Add amazing feature')
  4. Push to the branch (git push origin feature/amazing-feature)
  5. Open a Pull Request

Roadmap

  • Expand our test cases
  • Configuration file support
  • Git hooks integration
  • CI/CD integration examples - [] Support for other languages (unless I have contributors, this ain't possible)
  • Further optimization

License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.

Contact

About

Yet another static analysis tool for Python codebases written in Python that detects dead code. Faster and better than the rest :)

Topics

Resources

License

Security policy

Stars

Watchers

Forks

Packages

No packages published

Languages