upcast

A comprehensive static analysis toolkit for Python projects. Upcast provides 15 specialized scanners to analyze code without execution, extracting insights about Django models, environment variables, HTTP requests, logging patterns, concurrency patterns, code complexity, Redis usage, and more.

Github repository: https://github.com/mrlyc/upcast/
Documentation: https://mrlyc.github.io/upcast/
📚 Scanner Documentation: docs/README.md - Detailed scanner rules and field references
🔬 Type Inference: docs/type-inference.md - Static type and value inference mechanism

Quick Start

# Install
pip install upcast

# Scan environment variables
upcast scan-env-vars /path/to/project

# Analyze logging patterns
upcast scan-logging /path/to/project

# Analyze Django models
upcast scan-django-models /path/to/django/project

# Scan Redis usage patterns
upcast scan-redis-usage /path/to/django/project

# Find blocking operations
upcast scan-blocking-operations /path/to/project -o blocking.yaml

# Check cyclomatic complexity
upcast scan-complexity-patterns /path/to/project --threshold 15

Installation

pip install upcast

Common Options

All scanner commands support powerful file filtering options:

# Include specific patterns
upcast scan-env-vars . --include "app/**" --include "core/**"

# Exclude patterns
upcast scan-env-vars . --exclude "tests/**" --exclude "migrations/**"

# Disable default excludes (venv/, build/, dist/, etc.)
upcast scan-env-vars . --no-default-excludes

# Combine options
upcast scan-env-vars . --include "src/**" --exclude "**/*_test.py"

By default, file discovery also respects the scanned target directory's .gitignore, in addition to Upcast's built-in default excludes (such as venv/, build/, and dist/).

Other common options:

-o, --output FILE: Save results to file instead of stdout
--format FORMAT: Choose output format (yaml or json)
-v, --verbose: Enable detailed logging

Scanners

Upcast provides 15 specialized scanners for comprehensive static code analysis. Each scanner extracts specific insights without executing code, making analysis safe and fast.

💡 See example outputs: All scanner results are available in example/scan-results/ based on the blueking-paas project.

📖 Detailed documentation: For complete field references and scanning rules, see Scanner Documentation.

Django Scanners

scan-django-models

Analyze Django model definitions, extracting fields, relationships, and metadata.

upcast scan-django-models /path/to/django/project

Output example:

See full output: example/scan-results/django-models.yaml

app.models.User:
  name: User
  module: app.models
  bases:
    - models.Model
  description: "User model for authentication and profile management."
  fields:
    username:
      type: models.CharField
      max_length: 150
      unique: true
    email:
      type: models.EmailField
    created_at:
      type: models.DateTimeField
      auto_now_add: true
  relationships:
    - type: ForeignKey
      to: Group
      field: group

Key features:

Detects all Django model classes
Extracts model descriptions from docstrings
Extracts field types and parameters
Identifies relationships (ForeignKey, ManyToMany, OneToOne)
Captures model metadata and options

scan-django-settings

Find all references to Django settings variables and extract settings definitions from your Django project.

Basic usage (scan usages only):

upcast scan-django-settings /path/to/django/project

Scan definitions only:

upcast scan-django-settings --definitions-only /path/to/django/project

Scan both definitions and usages:

upcast scan-django-settings --combined /path/to/django/project

Output example (usages):

See full output: example/scan-results/django-settings.yaml

DEBUG:
  count: 5
  locations:
    - file: app/views.py
      line: 15
      column: 8
      code: settings.DEBUG
      pattern: attribute_access
    - file: middleware/security.py
      line: 23
      column: 12
      code: getattr(settings, 'DEBUG', False)
      pattern: getattr

Output example (definitions):

definitions:
  settings.base:
    DEBUG:
      value: true
      line: 10
      type: bool
    SECRET_KEY:
      value: "base-secret-key"
      line: 11
      type: string
    INSTALLED_APPS:
      value: ["django.contrib.admin", "django.contrib.auth"]
      line: 15
      type: list
    __star_imports__:
      - settings.common

  settings.dev:
    DEBUG:
      value: false
      line: 5
      type: bool
      overrides: settings.base
    DEV_MODE:
      value: true
      line: 6
      type: bool

Output example (combined):

definitions:
  settings.base:
    DEBUG:
      value: true
      line: 10
      type: bool

usages:
  DEBUG:
    count: 5
    locations:
      - file: app/views.py
        line: 15

Key features:

Usage scanning:
- Detects settings.VARIABLE access patterns
- Finds getattr(settings, ...) calls
- Tracks from django.conf import settings imports
- Aggregates usage counts per setting
Definition scanning:
- Scans settings modules in settings/ or config/ directories
- Extracts setting values with type inference
- Detects inheritance via from .base import *
- Tracks which settings override base settings
- Supports dynamic imports (importlib.import_module)
- Dynamic values wrapped in backticks (e.g., `os.environ.get('KEY')`)

CLI options:

--definitions-only: Scan and output only settings definitions
--usages-only: Scan and output only settings usages (default behavior)
--combined: Scan and output both definitions and usages
--no-usages: Skip usage scanning (faster when only definitions needed)
--no-definitions: Skip definition scanning (default behavior)
-o, --output FILE: Write results to YAML file
-v, --verbose: Enable verbose output

scan-django-urls

Scan Django URL configurations and extract route patterns, including view resolution.

upcast scan-django-urls /path/to/django/project

Output example:

See full output: example/scan-results/django-urls.yaml

apiserver.paasng.paas_wl.apis.admin.urls:
  urlpatterns:
    - pattern: wl_api/platform/process_spec_plan/manage/
      type: path
      name: null
      view_module: paas_wl.apis.admin.views.processes
      view_name: ProcessSpecPlanManageViewSet
      converters: []
      named_groups: []
      is_partial: false
      is_conditional: false
    - pattern: api/users/<int:pk>/
      type: path
      name: user-detail
      view_module: users.views
      view_name: UserDetailView
      converters:
        - pk:int
      named_groups: []
      is_partial: false
      is_conditional: false
    - pattern: api/accounts/
      type: include
      include_module: accounts.urls
      namespace: accounts

Key features:

Extracts URL patterns from urlpatterns
Resolves view functions and classes - Automatically resolves view_module and view_name for all views including ViewSets
Unified field names - Both regular views and ViewSets use view_module and view_name fields consistently
Fallback extraction - Even when full module path resolution fails, view names are still extracted
Captures route names
Detects path converters (<int:id>, <slug:slug>)
Handles include() patterns and namespaces
Supports Django REST Framework router registrations and ViewSet.as_view() patterns

View resolution:

view_module: Full module path where the view/ViewSet is defined (e.g., users.views)
view_name: Function or class name, including ViewSets (e.g., UserDetailView, UserViewSet)
Both fields are set to null when resolution fails (e.g., for include() patterns or dynamic views)
Resolution success rate typically exceeds 80% on real-world codebases

scan-signals

Discover Django and Celery signal definitions and handlers.

# Basic usage
upcast scan-signals /path/to/project

# Save to file (supports JSON and YAML)
upcast scan-signals /path/to/project -o signals.yaml
upcast scan-signals /path/to/project -o signals.json --format json

# Filter signal files
upcast scan-signals /path/to/project --include "**/signals/**"

Output format:

See full output: example/scan-results/signals.yaml

metadata:
  root_path: /path/to/project
  scanner_name: signal
summary:
  total_count: 5
  files_scanned: 3
  django_receivers: 4
  celery_receivers: 1
  custom_signals_defined: 2
  unused_custom_signals: 0
results:
  - signal: post_save
    type: django
    category: model_signals
    receivers:
      - handler: create_profile
        file: users/signals.py
        line: 25
        sender: User
        context:
          type: function
  - signal: user_logged_in
    type: django
    category: custom_signals
    receivers:
      - handler: log_login
        file: auth/handlers.py
        line: 42
    status: active

Key features:

Detects Django signals (pre_save, post_save, m2m_changed, etc.)
Finds Celery task signals (task_success, task_failure, etc.)
Identifies custom signal definitions
Tracks signal receivers and senders
Supports decorator-based connections (@receiver)
Detects unused custom signals
Provides comprehensive statistics in summary

scan-redis-usage

Analyze Redis usage patterns in Django projects, including cache backends, Celery configuration, and direct redis-py usage.

# Basic usage
upcast scan-redis-usage /path/to/project

# Save to file
upcast scan-redis-usage /path/to/project -o redis-usage.yaml

# Filter specific directories
upcast scan-redis-usage /path/to/project --include "app/**" --include "core/**"

Output example:

See full output: example/scan-results/redis-usage.yaml

summary:
  total_count: 5
  total_usages: 5
  files_scanned: 3
  scan_duration_ms: 12096
  categories:
    celery_broker: 1
    celery_result: 1
    direct_client: 3
  warnings: []

results:
  celery_broker:
    - type: celery_broker
      file: settings/__init__.py
      line: 609
      library: redis
      config:
        location: settings.get('CELERY_BROKER_URL', REDIS_URL)
      statement: CELERY_BROKER_URL = "..."

  direct_client:
    - type: direct_client
      file: misc/metrics/workloads/deployment.py
      line: 40
      library: django_redis
      operation: get
      key: metrics:unavailable_deployments_total
      statement: cache.get(cache_key)
      has_ttl: null
      timeout: null
      is_pipeline: false

    - type: direct_client
      file: misc/metrics/workloads/deployment.py
      line: 67
      library: django_redis
      operation: set
      key: metrics:unavailable_deployments_total
      statement: cache.set(cache_key, gauge_family, timeout=60 * 5)
      has_ttl: true
      timeout: 300
      is_pipeline: false

    - type: direct_client
      file: svc-rabbitmq/tasks/management/commands/worker.py
      line: 59
      library: django_redis
      operation: get_or_set
      key: "..."
      statement: cache.get_or_set(settings.TASK_LEADER_KEY, ...)
      has_ttl: null
      timeout: null
      is_pipeline: false

  distributed_lock:
    - type: distributed_lock
      file: workers/tasks.py
      line: 45
      library: django_redis
      operation: lock
      key: task:...:lock
      statement: cache.lock(f"task:{task_id}:lock", timeout=60)
      timeout: 60
      pattern: cache_lock
      is_pipeline: false

Key features:

Configuration Detection:
- Django cache backends (CACHES, django-redis)
- Celery broker and result backend settings
- Django Channels Redis layer configuration
- DRF throttling with Redis
Usage Pattern Analysis:
- Django cache API (cache.get(), cache.set(), etc.)
- Direct redis-py client operations
- Distributed locks (cache.lock())
- Pipeline operations tracking
Smart Key Inference:
- Extracts literal Redis keys when possible
- Uses ... for dynamic/unresolvable keys
- Handles f-strings, format(), % formatting
- Resolves variable assignments in scope
TTL & Timeout Tracking:
- Detects TTL configuration in cache operations
- Warns about missing TTL on set operations
- Extracts timeout values from lock operations
- Identifies operations without expiration
Library Detection:
- Distinguishes between django_redis, redis-py, channels_redis
- Tracks session storage and rate limiting configurations
- Identifies custom Redis client usage

Usage Types:

cache_backend: Django CACHES configuration
session_storage: SESSION_ENGINE with Redis
celery_broker: Celery message broker
celery_result: Celery result backend
channels: Django Channels layer
rate_limiting: DRF throttling
direct_client: Direct cache/Redis API calls
distributed_lock: Distributed locking patterns

Code Analysis Scanners

scan-module-symbols

Analyze Python modules to extract imports and symbol definitions including variables, functions, and classes with their metadata.

upcast scan-module-symbols /path/to/project

Features:

Extract all import types (import, from...import, from...import *)
Track attribute access patterns on imported symbols
Extract module-level variables, functions, and classes
Capture decorators, docstrings, and function signatures
Track symbol definition context (module, if, try, except blocks)
Compute body MD5 hashes for functions and classes
Filter private symbols (configurable with --include-private)

Output example:

See full output: example/scan-results/module-symbols.yaml

metadata:
  scanner_name: module_symbols

results:
  path/to/file.py:
    imported_modules:
      os:
        module_path: os
        attributes: ["path", "environ"]
        blocks: ["module"]
      django:
        module_path: django
        attributes: []
        blocks: ["module"]

    imported_symbols:
      Path:
        module_path: pathlib
        attributes: ["home"]
        blocks: ["module"]
      execute_from_command_line:
        module_path: django.core.management.execute_from_command_line
        attributes: []
        blocks: ["module"]

    star_imported: []

    variables:
      DEBUG:
        module_path: path.to.file
        attributes: []
        value: "True"
        statement: "DEBUG = True"
        blocks: ["module"]

    functions:
      helper:
        signature: "def helper(arg1: int, arg2: str) -> bool:"
        docstring: "A helper function."
        body_md5: "abc123..."
        attributes: []
        decorators:
          - name: decorator_name
            args: []
            kwargs: {}
        blocks: ["module"]

    classes:
      MyClass:
        docstring: "My class documentation"
        body_md5: "def456..."
        attributes: ["attr1", "attr2"]
        methods: ["method1", "method2"]
        bases: ["BaseClass"]
        decorators:
          - name: dataclass
            args: []
            kwargs: {}
        blocks: ["module"]

Options:

--include-private: Include private symbols (starting with _)
--exclude: Exclude specific file patterns
--format: Output format (yaml or json)

scan-concurrency-patterns

Identify concurrency patterns including async/await, threading, and multiprocessing with detailed context and parameter extraction.

upcast scan-concurrency-patterns /path/to/project

Output example:

See full output: example/scan-results/concurrency-patterns.yaml

summary:
  total_count: 15
  files_scanned: 8
  scan_duration_ms: 120
  by_category:
    threading: 6
    multiprocessing: 3
    asyncio: 6

results:
  threading:
    thread_creation:
      - file: workers/processor.py
        line: 78
        pattern: thread_creation
        function: start_worker
        class_name: WorkerManager
        details:
          target: process_batch
          name: worker-1
        statement: threading.Thread(target=process_batch, name="worker-1")

    thread_pool_executor:
      - file: tasks/parallel.py
        line: 45
        pattern: thread_pool_executor
        function: setup_executor
        details:
          max_workers: 4
        statement: ThreadPoolExecutor(max_workers=4)

    submit:
      - file: tasks/parallel.py
        line: 67
        pattern: executor_submit_thread
        function: process_items
        details:
          function: worker_function
        api_call: submit
        statement: executor.submit(worker_function, item)

  multiprocessing:
    process_creation:
      - file: compute/workers.py
        line: 123
        pattern: process_creation
        function: start_compute
        details:
          target: compute_intensive_task
        statement: multiprocessing.Process(target=compute_intensive_task)

    run_in_executor:
      - file: api/handlers.py
        line: 89
        pattern: run_in_executor_process
        function: async_handler
        details:
          executor_type: ProcessPoolExecutor
          function: cpu_intensive
        api_call: run_in_executor
        statement: await loop.run_in_executor(process_pool, cpu_intensive)

  asyncio:
    async_function:
      - file: api/client.py
        line: 45
        pattern: async_function
        function: fetch_data
        statement: async def fetch_data

    create_task:
      - file: api/client.py
        line: 78
        pattern: create_task
        function: fetch_all
        details:
          coroutine: fetch_data
        api_call: create_task
        statement: asyncio.create_task(fetch_data(url))

    run_in_executor:
      - file: api/handlers.py
        line: 56
        pattern: run_in_executor_thread
        function: async_handler
        details:
          executor_type: ThreadPoolExecutor
          function: io_operation
        api_call: run_in_executor
        statement: await loop.run_in_executor(thread_pool, io_operation)

Key features:

Pattern-Specific Detection: Distinguishes Thread creation, ThreadPoolExecutor, submit() calls, etc.
Context Extraction: Captures enclosing function and class names for each pattern
Parameter Extraction: Extracts target functions, max_workers, coroutine names
Executor Resolution: Two-pass scanning resolves executor variables in submit() and run_in_executor()
API Call Tracking: Identifies specific API methods (create_task, submit, run_in_executor)
Smart Filtering: Skips asyncio.create_task() with unresolvable coroutines to reduce noise

scan-blocking-operations

Find blocking operations that may cause performance issues in async code.

upcast scan-blocking-operations /path/to/project

Output example:

See full output: example/scan-results/blocking-operations.yaml

summary:
  total_operations: 8
  by_category:
    time_based: 3
    synchronization: 2
    subprocess: 3
  files_analyzed: 5

operations:
  time_based:
    - location: api/handlers.py:45:8
      type: time_based.sleep
      statement: time.sleep(5)
      duration: 5
      async_context: true
      function: async_handler

  synchronization:
    - location: cache/manager.py:67:12
      type: synchronization.lock_acquire
      statement: lock.acquire(timeout=10)
      timeout: 10
      async_context: true

  subprocess:
    - location: scripts/runner.py:23:15
      type: subprocess.run
      statement: subprocess.run(['ls', '-la'], timeout=30)
      timeout: 30

Key features:

Detects time.sleep() in async functions
Finds blocking lock operations
Identifies subprocess calls without async wrappers
Detects Django ORM select_for_update()
Flags anti-patterns in async code

scan-unit-tests

Analyze unit test files and extract test information.

upcast scan-unit-tests /path/to/tests

Output example:

See full output: example/scan-results/unit-tests.yaml

tests/test_users.py:
  framework: pytest
  test_count: 15
  tests:
    - name: test_create_user
      line: 45
      type: function
      async: false
      fixtures:
        - db
        - user_factory
    - name: test_update_user_email
      line: 67
      type: function
      async: false
      markers:
        - slow
        - integration

tests/test_api.py:
  framework: unittest
  test_count: 8
  classes:
    - name: UserAPITestCase
      line: 12
      tests:
        - test_get_user
        - test_create_user
        - test_delete_user

Key features:

Detects pytest and unittest tests
Extracts test function/method names
Identifies async tests
Captures fixtures and markers
Counts tests per file

Infrastructure Scanners

scan-env-vars

Scan for environment variable usage with advanced type inference.

upcast scan-env-vars /path/to/project

Output example:

See full output: example/scan-results/env-vars.yaml

DATABASE_URL:
  types:
    - str
  defaults:
    - postgresql://localhost/db
  usages:
    - location: config/settings.py:15
      statement: os.getenv('DATABASE_URL', 'postgresql://localhost/db')
      type: str
      default: postgresql://localhost/db
      required: false
    - location: config/database.py:8
      statement: env.str('DATABASE_URL')
      type: str
      required: true
  required: true

API_TIMEOUT:
  types:
    - int
  defaults:
    - 30
  usages:
    - location: api/client.py:23
      statement: int(os.getenv('API_TIMEOUT', '30'))
      type: int
      default: 30
      required: false
  required: false

Key features:

Advanced type inference from default values and conversions
Detects os.getenv(), os.environ[], os.environ.get()
Supports django-environ patterns (env.int(), env.bool())
Identifies required vs optional variables
Aggregates all usages per variable

scan-prometheus-metrics

Extract Prometheus metrics definitions with full metadata.

upcast scan-prometheus-metrics /path/to/project

Output example:

See full output: example/scan-results/metrics.yaml

http_requests_total:
  type: counter
  help: Total HTTP requests
  labels:
    - method
    - path
    - status
  namespace: myapp
  subsystem: api
  usages:
    - location: api/metrics.py:15
      pattern: instantiation
      statement: Counter('http_requests_total', 'Total HTTP requests', ['method', 'path', 'status'])

request_duration_seconds:
  type: histogram
  help: Request duration in seconds
  labels:
    - endpoint
  buckets:
    - 0.1
    - 0.5
    - 1.0
    - 5.0
  usages:
    - location: middleware/metrics.py:23
      pattern: instantiation
      statement: Histogram('request_duration_seconds', ...)

Key features:

Detects Counter, Gauge, Histogram, Summary
Extracts metric names, help text, and labels
Captures namespace and subsystem
Identifies histogram buckets
Supports decorator patterns

HTTP & Exception Scanners

scan-http-requests

Find HTTP and API requests throughout your codebase with intelligent URL pattern detection and filtering.

upcast scan-http-requests /path/to/project

Output example:

See full output: example/scan-results/http-requests.yaml

"...":
  library: requests
  method: GET
  usages:
    - file: accessories/cloudapi/components/http.py
      line: 38
      method: REQUEST
      statement: requests.request(method, url, **kwargs)
      session_based: false
      is_async: false
      timeout: null
    - file: accessories/dev_sandbox/management/commands/renew_dev_sandbox_expired_at.py
      line: 56
      method: GET
      statement: requests.get(url)
      session_based: false
      is_async: false

https://api.example.com/users/...:
  method: GET
  library: requests
  usages:
    - file: api/client.py
      line: 45
      method: GET
      statement: requests.get(f"https://api.example.com/users/{user_id}")
      session_based: false
      is_async: false

https://api.example.com/api/v2/data:
  method: GET
  library: requests
  usages:
    - file: services/http.py
      line: 67
      method: GET
      statement: requests.get(BASE_URL + "/api/v2/data")
      timeout: 30
      session_based: false
      is_async: false

Key features:

Smart URL Detection: Detects requests, httpx, urllib, aiohttp
Accurate Request Identification: Filters out non-request classes (RequestException, Response, Auth, Adapter, etc.)
Request Constructor Support: Correctly handles requests.Request(method, url) with positional and keyword arguments
Context-Aware Resolution: Infers variable values from assignments in the same scope
Pattern Preservation: Preserves static URL parts while replacing dynamic segments with ...
- f"https://api.example.com/users/{user_id}" → https://api.example.com/users/...
- f"{proto}://{host}/api/v1/path" → ...://.../api/v1/path
- BASE_URL + "/api/data" → resolves BASE_URL if defined
- f"{a}{b}{c}{d}" → ... (merges consecutive ... into one)
Extracts HTTP methods (GET, POST, PUT, DELETE, REQUEST)
Identifies URLs when possible, with smart pattern normalization
Checks for timeout configuration
Detects async requests and session-based calls
Extracts request parameters: data, json_body, headers, params

scan-logging

Detect and analyze all logging statements in your Python codebase, supporting multiple logging libraries.

upcast scan-logging /path/to/project

# Use custom sensitive keywords
upcast scan-logging /path/to/project --sensitive-keywords password --sensitive-keywords api_key

# Combine with other options
upcast scan-logging /path/to/project --sensitive-keywords db_password --format json -o logs.json

Output example:

See full output: example/scan-results/logging.yaml

metadata:
  scanner_name: logging

results:
  src/auth/login.py:
    logging:
      - logger_name: auth.login
        lineno: 45
        level: info
        message: "User {} logged in successfully"
        args: ["username"]
        type: fstring
        block: function
        sensitive_patterns: []
      - logger_name: auth.login
        lineno: 52
        level: warning
        message: "Failed login attempt for user %s"
        args: ["username"]
        type: percent
        block: if
        sensitive_patterns: []
      - logger_name: auth.login
        lineno: 67
        level: error
        message: "Password validation failed"
        args: []
        type: string
        block: except
        sensitive_patterns: ["password"]

    loguru:
      - logger_name: loguru
        lineno: 89
        level: info
        message: "Application started on port {}"
        args: ["port"]
        type: format
        block: module
        sensitive_patterns: []

    django: []
    structlog: []

  src/api/client.py:
    structlog:
      - logger_name: api.client
        lineno: 23
        level: info
        message: "API request completed"
        args: ["user_id", "status_code"]
        type: string
        block: function
        sensitive_patterns: []
      - logger_name: api.client
        lineno: 78
        level: error
        message: "API token expired: {}"
        args: ["token"]
        type: format
        block: except
        sensitive_patterns: ["token"]

    logging: []
    loguru: []
    django: []

Key features:

Multi-Library Support: Detects logging from logging, loguru, structlog, and Django
Logger Name Resolution: Resolves logger names from getLogger(__name__) to actual module paths
Message Format Detection: Identifies format types (string literal, f-string, % formatting, .format())
Argument Extraction: Captures all arguments passed to log calls
Block Type Detection: Identifies code block context for each log call
- Detects block types: function, class, try, except, finally, for, while, if, else, with, module
- Helps understand log context and control flow
Sensitive Data Detection: Automatically flags potentially sensitive information in logs
- Default keywords: password, token, api_key, secret, ssn, credit_card, private_key, etc.
- Custom keywords: Use --sensitive-keywords to specify your own list (can be repeated)
- Identifies JWT tokens (eyJ... pattern)
- Lists matched patterns for review
Smart Library Detection: Uses import analysis to correctly categorize logging calls

CLI Options:

--sensitive-keywords KEYWORD: Add custom sensitive keyword patterns (can be repeated)
-o, --output FILE: Save results to file
--format FORMAT: Output format (yaml or json)
-v, --verbose: Enable detailed logging

scan-exception-handlers

Analyze exception handling patterns in your code.

upcast scan-exception-handlers /path/to/project

Output example:

See full output: example/scan-results/exception-handlers.yaml

handlers:
  - file: api/views.py
    lineno: 45
    end_lineno: 52
    type: try-except
    exceptions:
      - ValueError
      - KeyError
    has_bare_except: false
    reraises: false

  - file: services/processor.py
    lineno: 78
    end_lineno: 85
    type: try-except
    exceptions:
      - Exception
    has_bare_except: false
    reraises: true
    logs_error: true

  - file: legacy/old_code.py
    lineno: 123
    end_lineno: 127
    type: try-except
    has_bare_except: true
    warning: Bare except clause detected

Key features:

Detects try-except blocks
Identifies caught exception types
Flags bare except clauses (anti-pattern)
Checks for error logging
Detects exception re-raising

Code Quality Scanners

scan-complexity-patterns

Analyze cyclomatic complexity to identify functions that may need refactoring.

upcast scan-complexity-patterns /path/to/project

Output example:

See full output: example/scan-results/complexity-patterns.yaml

summary:
  high_complexity_count: 12
  files_analyzed: 28
  by_severity:
    warning: 7 # 11-15
    high_risk: 4 # 16-20
    critical: 1 # >20

modules:
  app/services/user.py:
    - name: process_user_registration
      line: 45
      end_line: 98
      complexity: 14
      severity: warning
      message: "Complexity 14 exceeds threshold 11"
      description: "Validate user registration with multiple checks"
      signature: "def process_user_registration(data: dict, strict: bool = True) -> Result:"
      comment_lines: 8
      code_lines: 54
      code: |
        def process_user_registration(data: dict, strict: bool = True) -> Result:
            """Validate user registration with multiple checks."""
            # Check required fields
            if not data.get('email'):
                return Result.error('Email required')
            ...

Usage options:

# Scan with default threshold (11)
upcast scan-complexity-patterns /path/to/project

# Custom threshold
upcast scan-complexity-patterns . --threshold 15

# Include test files (excluded by default)
upcast scan-complexity-patterns . --include-tests

# Save to file
upcast scan-complexity-patterns . -o complexity-report.yaml

# JSON format
upcast scan-complexity-patterns . --format json

Severity levels:

healthy (≤5): Very simple, minimal maintenance
acceptable (6-10): Reasonable complexity
warning (11-15): Refactoring recommended
high_risk (16-20): Significant maintenance cost
critical (>20): Design issues likely

Key features:

Accurate Calculation: Counts decision points (if/elif, loops, except, boolean operators)
Code Extraction: Full function source code included
Comment Statistics: Uses Python tokenize for accurate comment counting
Test Exclusion: Automatically excludes test files (configurable)
Detailed Metadata: Function signature, docstring, line numbers
Actionable Output: Sorted by severity with clear recommendations

Architecture

Data Models

Starting from version 0.3.0, Upcast provides standardized Pydantic models for all scanner outputs in the upcast.models package. This enables type-safe data handling for both scanners and future analyzers.

Base Models:

from upcast.models import ScannerSummary, ScannerOutput

# All scanners extend these base classes
class MySummary(ScannerSummary):
    # Scanner-specific summary fields
    pass

class MyOutput(ScannerOutput[ResultType]):
    summary: MySummary
    results: ResultType

Available Models:

base: ScannerSummary, ScannerOutput - Base classes for all scanners
blocking_operations: BlockingOperation, BlockingOperationsSummary, BlockingOperationsOutput
concurrency: ConcurrencyUsage, ConcurrencyPatternSummary, ConcurrencyPatternOutput
complexity: ComplexityResult, ComplexitySummary, ComplexityOutput
django_models: DjangoField, DjangoModel, DjangoModelSummary, DjangoModelOutput
django_settings: SettingsUsage, SettingDefinition, DjangoSettingsSummary, DjangoSettings*Output
django_urls: UrlPattern, DjangoUrlSummary, DjangoUrlOutput
env_vars: EnvVarInfo, EnvVarSummary, EnvVarOutput
exceptions: ExceptionHandler, ExceptionHandlerSummary, ExceptionHandlerOutput
http_requests: HttpRequestInfo, HttpRequestSummary, HttpRequestOutput
metrics: MetricInfo, PrometheusMetricSummary, PrometheusMetricOutput
redis_usage: RedisUsage, RedisUsageSummary, RedisUsageOutput
signals: SignalInfo, SignalSummary, SignalOutput
unit_tests: UnitTestInfo, UnitTestSummary, UnitTestOutput

Usage Example:

from upcast.models import EnvVarOutput, EnvVarInfo
from upcast.env_var_scanner import EnvironmentVariableScanner

# Type-safe scanner output
scanner = EnvironmentVariableScanner()
output: EnvVarOutput = scanner.scan(project_path)

# Access with full type hints
for var_name, var_info in output.results.items():
    var_info: EnvVarInfo
    print(f"{var_name}: required={var_info.required}")
    for location in var_info.locations:
        print(f"  {location.file}:{location.line}")

Common Utilities

Upcast uses a shared utilities package (upcast.common) for consistency:

file_utils: File discovery, path validation, package root detection
patterns: Glob pattern matching with configurable excludes
ast_utils: Unified AST analysis with astroid
export: Consistent YAML/JSON output formatting

Benefits:

300+ fewer lines of duplicate code
Consistent behavior across scanners
Better error messages
Unified file filtering

Key Features

Static Analysis: No code execution - safe for any codebase
14 Specialized Scanners: Comprehensive project analysis
Advanced Type Inference: Smart detection of types and patterns
Powerful File Filtering: Glob-based include/exclude patterns
Multiple Output Formats: YAML, JSON, and Markdown with multi-language support
Aggregated Results: Group findings by variable/model/metric name
Cross-Platform: Works on Windows, macOS, and Linux
Well-Tested: Comprehensive test suite with high coverage

Markdown Rendering

Upcast can render scanner outputs directly to markdown format using the --format markdown option. This provides readable documentation with support for multiple languages.

Basic Usage

# Scan and output as markdown (English)
upcast scan-complexity-patterns . --format markdown -o report.md

# Scan and output as markdown (Chinese)
upcast scan-env-vars . --format markdown --markdown-language zh

# Specify custom title
upcast scan-django-models . --format markdown --markdown-title "Django Models Report"

# Output to stdout
upcast scan-http-requests . --format markdown

Features

Multi-Language Support: Templates available in English (en) and Chinese (zh)
Integrated with All Scanners: Works with all 13 scanner commands
Structured Output: Consistent format with metadata, summary, and detailed results
Readable Tables: Field details presented in organized markdown tables
Customizable: Override output filename, language, and document title

Markdown Options

All scanner commands support these markdown-specific options:

--format markdown: Output format (use instead of yaml or json)
--markdown-language [en|zh]: Language for markdown output (default: en)
--markdown-title TEXT: Custom title for the markdown document

Example Outputs

See example/markdown-examples/ for rendered markdown examples:

English: http-requests-en.md, django-models-en.md, complexity-patterns-en.md
Chinese: http-requests-zh.md, django-models-zh.md, complexity-patterns-zh.md

Programmatic Usage

You can also use the render module programmatically:

from upcast.models.http_requests import HttpRequestOutput
from upcast.render import render_to_markdown, render_to_file

# Load scanner output
output = HttpRequestOutput(...)

# Render to string
markdown = render_to_markdown(output, language='en', title='HTTP Requests')

# Or render directly to file
render_to_file(output, 'output.md', language='en', title='HTTP Requests')

Supported Scanner Types

All scanner outputs can be rendered to markdown:

HTTP Requests (HttpRequestOutput)
Django Models (DjangoModelOutput)
Code Complexity (ComplexityOutput)
Environment Variables (EnvVarOutput)
Signal Usage (SignalOutput)
And all other scanner types...

Template Customization

Templates are located in upcast/templates/{language}/:

base.md.jinja2 - Base template for all scanners
http_requests.md.jinja2 - HTTP request analysis template
django_models.md.jinja2 - Django model analysis template
complexity.md.jinja2 - Complexity analysis template
And more...

You can create custom templates for new languages by adding a new language directory with the required templates.

Integration Testing

Upcast includes comprehensive integration tests that validate all scanners on a real-world Django project (blueking-paas).

Running Integration Tests

# Run all scanners on example project
make test-integration

This command:

Scans example/blueking-paas with all 12 scanners
Outputs results to example/scan-results/*.yaml
Takes approximately 1-2 minutes to complete

CI Validation

The GitHub Actions workflow automatically:

Runs integration tests on every PR
Compares scan results against committed versions using Git
Fails if scanner output changes unexpectedly
Helps detect regressions in scanner behavior

Accepting Result Changes

When scanner improvements intentionally change output:

# 1. Run integration tests
make test-integration

# 2. Review the changes
git diff example/scan-results/

# 3. Commit updated results
git add example/scan-results/
git commit -m "Update scan results: [describe changes]"

Note: All 13 scanners are tested on the blueking-paas project, with results available in example/scan-results/.

Contributing

Contributions are welcome! Please see our Contributing Guide for details.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 305 Commits
.github		.github
docs		docs
example		example
openspec		openspec
tests		tests
upcast		upcast
.editorconfig		.editorconfig
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
codecov.yaml		codecov.yaml
mise.toml		mise.toml
pyproject.toml		pyproject.toml
tox.ini		tox.ini
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

upcast

Quick Start

Installation

Common Options

Scanners

Django Scanners

scan-django-models

scan-django-settings

scan-django-urls

scan-signals

scan-redis-usage

Code Analysis Scanners

scan-module-symbols

scan-concurrency-patterns

scan-blocking-operations

scan-unit-tests

Infrastructure Scanners

scan-env-vars

scan-prometheus-metrics

HTTP & Exception Scanners

scan-http-requests

scan-logging

scan-exception-handlers

Code Quality Scanners

scan-complexity-patterns

Architecture

Data Models

Common Utilities

Key Features

Markdown Rendering

Basic Usage

Features

Markdown Options

Example Outputs

Programmatic Usage

Supported Scanner Types

Template Customization

Integration Testing

Running Integration Tests

CI Validation

Accepting Result Changes

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages