Skip to content

professional-ALFIE/github-extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

English | 한국어 | 日本語 | 中文

GitHub Extractor Skill for Claude Code / Codex CLI / Gemini CLI

Ask an AI agent to analyze open source? It clones the entire 500MB repo to /tmp. Ask again? It clones again. Tokens wasted, time wasted. Use web search instead? Less accurate than reading the actual code, more hallucinations, and still burns tokens.

Not anymore. Download only the folders you need, or extract raw links and use them directly.

🚀 Feed raw links into NotebookLM and build your own Deep Wiki from any GitHub repo.


What It Does

Input Output Use Case
GitHub specific folder Raw URL list Feed to NotebookLM, LLMs
GitHub specific folder Downloaded files Partial clone (skip the whole repo)

Installation

Quick Install (Recommended)

curl -sL https://raw.githubusercontent.com/professional-ALFIE/github-extractor/master/install.sh | bash

This installs the skill and scripts to ~/.claude/skills/github-extractor/.


Scripts

1. extractor_githubRawLinks.py

Get raw URLs for files in a specific GitHub path.

Why not git clone? You only need one folder, not 500MB of repo.

./extractor_githubRawLinks.py https://github.com/owner/repo/tree/main/docs

Output:

https://raw.githubusercontent.com/owner/repo/main/docs/guide.md
https://raw.githubusercontent.com/owner/repo/main/docs/api.md
  • Auto-copied to clipboard (macOS)
  • Supports any branch, tag, or commit

Requirements: Python 3 + requests

Tip: Set GITHUB_TOKEN to increase rate limit (60 → 5000 req/hour)


2. extractor_githubRawFiles.py

Download files from a specific GitHub directory — skip the full clone.

./extractor_githubRawFiles.py https://github.com/owner/repo/tree/main/docs ./local-docs

Auto-ignores: node_modules/, .git/, __pycache__/, dist/, build/, .next/, venv/, .pyc, .exe, .dll, .so

Requirements: Python 3 + requests

Tip: Set GITHUB_TOKEN to increase rate limit (60 → 5000 req/hour)


Use with Claude Code

Once installed, just ask:


License

MIT

About

Download specific folders from GitHub — no full clone. Extract raw links and feed them into NotebookLM to build your own Deep Wiki from any repo.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors