-
Notifications
You must be signed in to change notification settings - Fork 71
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Draft refactor docs * Include layout for the api * Layout for the docs * Redirect imports of LLMs * Draft overview and getting started * Update docstrings * Fix docstrings * Fix argilla reference * Remove extra line-break * Refactor and rename `llm` -> `llms` * Refactor and rename `task` -> `tasks` * Remove extra line-breaks * Add missing `type: ignore` * Update `tasks` and `llms` imports * Fix imports in `tests/` * Fix `QualityScorer.format_input` signature * Update `extra.md` * Fix `mkdocs.yml` API reference for LLMs * Add `docs/papers` (WIP) * Update `docs/papers` (WIP) * Fix imports after rename to `tasks` * Remove not used files * Update main page * Move argilla docs * Move papers to sections * Remove old tutorials * Update nav * Remove navigation * Advances on docs, learn section (#497) * Add section for distiset * Update distiset * Add sample images for screenshots of pipeline runs * Remove unused files * Draft including tutorial and advances steps, work in progress * Fix minor bugs and add `docs/sections/papers/*.md` (#499) * Fix `distilabel.steps.tasks` imports in `__init__` * Fix formatting in `__init__.py` * Remove `commit_message` from `push_to_hub` * Add missing `super().load()` to load `logging` * Fix `outputs` in `UltraFeedback` * Add `model_post_init` in `Argilla` to supress `warnings` * Add `docs/sections/papers/ultrafeedback.md` * Add `docs/sections/papers/instruction_backtranslation.md` * Fix `tests/unit` * Add `httpx` under `TYPE_CHECKING` * Fix `argilla` optional dependency handling * Revert `AnthropicLLM.http_client` typing and add `httpx` dependency instead * Apply suggestions from code review Co-authored-by: David Berenstein <davidberenstein1957@users.noreply.github.com> --------- Co-authored-by: David Berenstein <davidberenstein1957@users.noreply.github.com> * Docs cli (#502) * Update serialization method of _BatchManager to write each of the inner steps to a file and load them back (#496) * Update replacing string * Add guide to the CLI * Added CLI to api reference a small reference to that from the tutorial * Docs caching (#500) * Update serialization method of _BatchManager to write each of the inner steps to a file and load them back (#496) * Add docstrings to lost argument in Distiset * Add section for caching in advanced tutorial * Add `AzureOpenAILLM` (#505) * Add `AzureOpenAILLM` * Update `distilabel.llms` imports * Fix `base_url` env var and add `api_version` env var * Add `AzureOpenAILLM` to `test_imports` * Add `TestAzureOpenAILLM` * Fix `base_url` docstring * Remove `together` extra and place `tests` extra properly * Fix extras alphabetic order in `pyproject.toml` * Update `docs/index.md` and `README.md` * Add `docs/api/llms/azure.md` * Docs steps (#503) * Update layout of steps * Add step guide and draft of special types of steps * Add reference for the step decorator * Include step decorator in the tutorial * Add intro to the different types of steps * Add generator steps * Update general and global steps * Fix typos * Missing argilla steps examples in general steps * Create initial layout for tasks * Add special tasks * Add `StepInput` missing import * Deita tutorial for docs (#504) * docs: add deita notebook from community meetup * Add `asyncio.get_running_loop` for Colab * feat: refactor into individual steps * fix: patch async active loops * chore: tidy print incremental steps * fix: remove nested asyncio * convert tutorial to markdown and move * add assets to repo * reference tutorial in mkdocs menu bar * formatting and prose in deita tutorial * Add mathjax to render math properly * Update sections to render properly and add some stylistic choices for variable names * update imports to shortcuts in Deita tutorial Co-authored-by: Alvaro Bartolome <alvaro@argilla.io> * Update docs/sections/papers/deita.md Co-authored-by: David Berenstein <david.m.berenstein@gmail.com> * Update docs/sections/papers/deita.md Co-authored-by: David Berenstein <david.m.berenstein@gmail.com> * docs: respond to prose feedback --------- Co-authored-by: Alvaro Bartolome <alvaro@argilla.io> Co-authored-by: plaguss <agustin@argilla.io> Co-authored-by: David Berenstein <david.m.berenstein@gmail.com> * Update docs/sections/learn/steps/index.md Co-authored-by: Gabriel Martín Blázquez <gmartinbdev@gmail.com> * Update docs/sections/learn/steps/index.md Co-authored-by: Gabriel Martín Blázquez <gmartinbdev@gmail.com> * Update docs/sections/learn/steps/index.md Co-authored-by: Gabriel Martín Blázquez <gmartinbdev@gmail.com> * Update docs/sections/learn/steps/index.md Co-authored-by: Gabriel Martín Blázquez <gmartinbdev@gmail.com> * Docs tasks (#506) * Add feedback tasks * Add text generation and self instruct * Add fix for runtime parameter of extra arguments * Update docs/sections/learn/tasks/feedback_tasks.md Co-authored-by: David Berenstein <david.m.berenstein@gmail.com> * Add text generation specific tasks * Add example of custom task * Add runtime parameters * Modify place of runtime parameters --------- Co-authored-by: David Berenstein <david.m.berenstein@gmail.com> * Add `docs/sections/learn/argilla.md` (#509) * Fix wrong formatting around `#` * Add `{TextGeneration,Preference}ToArgilla` in docs * Add `argilla.md` and move Argilla docs there * Add detailed examples in `argilla.md` * Add `assets` for `argilla.md` * Add deployment tips in `argilla.md` * Add `docs/sections/learn/llms/index.md` (#514) * Add `docs/sections/learn/llms/index.md` * Update docs/sections/learn/llms/index.md --------- Co-authored-by: Gabriel Martín Blázquez <gmartinbdev@gmail.com> * Docs pipeline (#512) * Draft of pipeline section * Finish pipeline docs section * Add CLI `run` example --------- Co-authored-by: Gabriel Martín Blázquez <gmartinbdev@gmail.com> --------- Co-authored-by: Gabriel Martín Blázquez <gmartinbdev@gmail.com> Co-authored-by: Alvaro Bartolome <alvaro@argilla.io> Co-authored-by: David Berenstein <davidberenstein1957@users.noreply.github.com> Co-authored-by: burtenshaw <ben@argilla.io> Co-authored-by: David Berenstein <david.m.berenstein@gmail.com>
- Loading branch information
1 parent
6c4d9ae
commit fbcaf6f
Showing
247 changed files
with
3,834 additions
and
21,460 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,48 @@ | ||
# Command Line Interface | ||
|
||
This section contains the API reference for the command line interface. | ||
|
||
## CLI commands | ||
|
||
This section shows the CLI commands: | ||
|
||
### distilabel pipeline run | ||
|
||
```bash | ||
$ distilabel pipeline info --help | ||
|
||
Usage: distilabel pipeline info [OPTIONS] | ||
|
||
Get information about a Distilabel pipeline. | ||
|
||
╭─ Options ───────────────────────────────────────────────────────────────────────────╮ | ||
│ * --config TEXT Path or URL to the Distilabel pipeline configuration file. │ | ||
│ [default: None] │ | ||
│ [required] │ | ||
│ --help Show this message and exit. │ | ||
╰─────────────────────────────────────────────────────────────────────────────────────╯ | ||
``` | ||
|
||
### distilabel pipeline info | ||
|
||
```bash | ||
$ distilabel pipeline --help | ||
|
||
Usage: distilabel pipeline [OPTIONS] COMMAND [ARGS]... | ||
|
||
Commands to run and inspect Distilabel pipelines. | ||
|
||
╭─ Options ───────────────────────────────────────────────────────────────────────────────╮ | ||
│ --help Show this message and exit. │ | ||
╰─────────────────────────────────────────────────────────────────────────────────────────╯ | ||
╭─ Commands ──────────────────────────────────────────────────────────────────────────────╮ | ||
│ info Get information about a Distilabel pipeline. │ | ||
│ run Run a Distilabel pipeline. │ | ||
╰─────────────────────────────────────────────────────────────────────────────────────────╯ | ||
``` | ||
|
||
## Utility functions for the pipeline commands | ||
|
||
Here are some utility functions to help working with the pipelines in the console. | ||
|
||
::: distilabel.cli.pipeline.utils |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
## AnthropicLLM | ||
|
||
::: distilabel.llms.anthropic |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
## AnyscaleLLM | ||
|
||
::: distilabel.llms.anyscale |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
## AzureOpenAILLM | ||
|
||
::: distilabel.llms.azure | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,11 @@ | ||
# Hugging Face | ||
|
||
This section contains the reference for Hugging Face integrations: | ||
|
||
## Inference Endpoints | ||
|
||
::: distilabel.llms.huggingface.inference_endpoints | ||
|
||
## Transformers | ||
|
||
::: distilabel.llms.huggingface.transformers |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
## LiteLLM | ||
|
||
::: distilabel.llms.litellm |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
## LlamacppLLM | ||
|
||
::: distilabel.llms.llamacpp |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
## MistralLLM | ||
|
||
::: distilabel.llms.mistral |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
## OllamaLLM | ||
|
||
::: distilabel.llms.ollama |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
## OpenaiLLM | ||
|
||
::: distilabel.llms.openai |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
## TogetherLLM | ||
|
||
::: distilabel.llms.together |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
## VertexaiLLM | ||
|
||
::: distilabel.llms.vertexai |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
# vLLM | ||
|
||
::: distilabel.llms.vllm |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,13 @@ | ||
# Pipeline | ||
|
||
## Base Pipeline | ||
|
||
::: distilabel.pipeline.base | ||
|
||
## Local Pipeline | ||
|
||
::: distilabel.pipeline.local | ||
|
||
## Extra | ||
|
||
::: distilabel.pipeline.utils |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
# Argilla | ||
|
||
::: distilabel.steps.argilla.base | ||
::: distilabel.steps.argilla.preference | ||
::: distilabel.steps.argilla.text_generation |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
# step decorator | ||
|
||
This section contains the reference for the `@step` decorator, | ||
|
||
::: distilabel.steps.decorator |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,10 @@ | ||
# Extra | ||
|
||
::: distilabel.steps.combine | ||
::: distilabel.steps.conversation | ||
::: distilabel.steps.decorator | ||
::: distilabel.steps.deita | ||
::: distilabel.steps.expand | ||
::: distilabel.steps.keep | ||
::: distilabel.steps.typing | ||
::: distilabel.steps.tasks.typing |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
# Generator Steps | ||
|
||
::: distilabel.steps.generators.data | ||
::: distilabel.steps.generators.huggingface |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
# Global Steps | ||
|
||
::: distilabel.steps.globals.huggingface |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
# Steps | ||
|
||
::: distilabel.steps.base |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
# Embeddings | ||
|
||
::: distilabel.steps.tasks.generate_embeddings |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
# Preference tasks | ||
|
||
::: distilabel.steps.tasks.ultrafeedback |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,29 @@ | ||
# Tasks | ||
|
||
::: distilabel.steps.tasks.base | ||
|
||
## General Text Generation | ||
|
||
::: distilabel.steps.tasks.text_generation | ||
|
||
## Evol Instruct | ||
|
||
::: distilabel.steps.tasks.evol_instruct.base | ||
::: distilabel.steps.tasks.evol_instruct.generator | ||
::: distilabel.steps.tasks.evol_instruct.utils | ||
|
||
### Evol Complexity | ||
|
||
::: distilabel.steps.tasks.evol_instruct.evol_complexity.base | ||
::: distilabel.steps.tasks.evol_instruct.evol_complexity.generator | ||
::: distilabel.steps.tasks.evol_instruct.evol_complexity.utils | ||
|
||
## Evol Quality | ||
|
||
::: distilabel.steps.tasks.evol_quality.base | ||
::: distilabel.steps.tasks.evol_quality.utils | ||
|
||
## DEITA Scorers | ||
|
||
::: distilabel.steps.tasks.complexity_scorer | ||
::: distilabel.steps.tasks.quality_scorer |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,21 @@ | ||
--- | ||
description: Get familiar with the distilabel's pipelines. | ||
--- | ||
|
||
# Overview of Distilabel | ||
|
||
AI Feedback (AIF) framework to build datasets with and for LLMs: | ||
|
||
## Pipeline | ||
|
||
Define your pipeline like you would a Directed Acyclic Graph (DAG)... | ||
|
||
## Steps | ||
|
||
... | ||
|
||
## Command Line Interface | ||
|
||
Distilabel comes with a CLI to easily reproduce datasets from a `pipeline.yaml`. | ||
... | ||
|
Oops, something went wrong.