
Starred repositories
A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
A collection of awesome web crawler,spider in different languages
A Kafka Connect Source Connector for DynamoDB
Counter Strike : Global Offensive Source Code
Leak of CS:GO Source code, provided by yours truly so go rep me
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
Obsidian Vault Backup; Notes on ML Articles/Books/Lectures
This is the official code of the publised paper 'A Multi-action Deep Reinforcement Learning Framework for Flexible Job-shop Scheduling Problem'
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/mEkkMXFG
Turn any webpage into structured data using LLMs
🎭 Playwright integration for Scrapy
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…
Causal Inference for Time Series Data (with CausalML Demo)
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
Scrapy+Splash for JavaScript integration
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
💡 LeetCode in C++20/Java/Python/MySQL/TypeScript (respect coding conventions)
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 , Agents.
A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks
Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 ,Agents.
Notebooks for Large Language Models (LLMs) Specialization