chore: Updated documentation and README.

anirbanbasu · Jun 8, 2024 · beb26fc · beb26fc
1 parent 0259a81
commit beb26fc
Show file tree

Hide file tree

Showing 10 changed files with 72 additions and 106 deletions.
diff --git a/README.md b/README.md
@@ -2,98 +2,9 @@
 
 # Too Long, Didn't Read, Let's Chat (TLDRLC)
 
-Too Long Didn't Read Let's Chat (TLDRLC) is a chatbot prototype that allows users to chat with a _large language model (LLM)_ about various data sources. This chatbot utilises _retrieval augmented generation (RAG)_ with _knowledge graph_ and _semantic search_.
+Too Long Didn't Read Let's Chat (TLDRLC) is a chatbot prototype that allows a user to chat with a _large language model (LLM)_ about various data sources. This chatbot utilises _retrieval augmented generation (RAG)_ backed by _knowledge graph_ and _semantic search_ indices.
 
-TLDRLC is an experimental software. It is not associated with any, and should not be interpreted as a, reliable chatbot service. The rationale behind this project is experimentation with retrieval augmented generation. The implementation is based on concepts from publicly available information, including tutorials and courses, such as [DeepLearning.ai: Knowledge Graphs for RAG](https://www.deeplearning.ai/short-courses/knowledge-graphs-rag/) and [DeepLearning.ai: Building and Evaluating Advanced RAG](https://www.deeplearning.ai/courses/building-evaluating-advanced-rag/) to name a few.
-
-## Installation
-
-If you want to run this app in a Docker container then you can skip the installation of Python dependencies below. The process of running the app with Docker is described below in the usage section. You may still need to setup the language model provider, and the storages for documents, indices and graphs.
-
-In addition to installation, you can configure the application using environment variables. Some settings can be modified through the app's web interface. All settings can be modified by using a local `.env` file or as environment variables. However, settings that can be modified using the app's Settings page are not stored as environment variables, which means settings in one browser session will be independent of the settings in another browser session even if both are initialised with the settings from the environment variables. A comprehensive list of the supported environment variables is available in the `.env.template` file in this repository, which can serve as a starting point.
-
-### Python dependencies
-
-You will need Python installed on your computer. The code in this repository has been tested on Python 3.12.0. Refrain from using the system Python. Use a Python version and virtual environment manager such as [pyenv](https://github.com/pyenv/pyenv). Create a new Python virtual environment for Python 3.12.0 or above. You can install all the dependencies for this application in your virtual environment by running the following.
-
-```
-pip install --upgrade pip
-pip install -r requirements.txt
-```
-#### Optional: Uninstall all dependencies to start afresh
-
-If necessary, you can uninstall everything previously installed by `pip` (in a virtual environment) by running the following.
-
-```
-pip freeze | xargs pip uninstall -y
-```
-
-#### Upgrading Python dependencies
-
-The currently installed packages can be upgraded and the `requirements.txt` updated accordingly by running the following.
-
-```
-sed 's/==.*$//' requirements.txt | xargs pip install --upgrade
-pip-autoremove -f > requirements.txt
-```
-
-### Language model providers, documents, index and graph storage
-
-#### Supported language model providers
-
-You can specify the language model provider to use by using the environment variable `LLM_PROVIDER`, which defaults to `Ollama`, if not specified. The supported language model providers are:
-
-1. Cohere.
-2. Ollama.
-3. Open AI.
-
-If using [Ollama](https://ollama.com/), you will also need to install it or, point the chatbot to a remotely hosted Ollama server. You also need to pull the Ollama model that you specify with `OLLAMA_MODEL` environment variable using `ollama pull <model-name>` (replace `<model-name>` with the actual model that you want to use) on your Ollama server. Check the [available Ollama models](https://ollama.com/library).
-
-Open AI can be used by specifying an `OPENAI_API_KEY`, an `OPENAI_MODEL`, and by choosing `Open AI` as the `LLM_PROVIDER`. Follow [this link](https://platform.openai.com/account/api-keys) to get an Open AI API key. Similarly, Cohere can be used by specifying a `COHERE_API_KEY`, a `COHERE_MODEL` (which defaults to `command-r-plus`), and by choosing `Cohere` as the `LLM_PROVIDER`. Follow [this link](https://cohere.com/pricing) to obtain a Cohere API key.
-
-See the settings in the `.env.template` file customisation of the LLM settings.
-
-#### Documents and index storage
-
-A [Redis stack installation](https://redis.io/docs/install/install-stack/) is required to store the parsed document chunks and the index, which can be reloaded without having to rebuild the index from the sources. Although the default settings are okay, refer to the [Redis persistence configuration](https://redis.io/docs/management/persistence/), if necessary. Alternative to a local Redis installation, you can also opt for a remote Redis installation such as one on [Render](https://render.com/). Depending on your hosting plan, various limits will apply on such cloud-based managed Redis installations. 
-
-#### Graph database
-
-Similarly, you will need [Neo4j graph database](https://neo4j.com/). You can either install it locally or use a the [Neo4j Aura DB](https://neo4j.com/cloud/platform/aura-graph-database/), which is a cloud-hosted version of Neo4j.
-
-_Note that unless you specify `NEO4J_DISABLE = "True"` to disable Neo4J and use an in-memory graph database, the Neo4J server must be accessible using the specified connection credentials. Otherwise, the application will display an error at the time of starting up._
-
-### Performance evaluation using Langfuse
-
-If you wish to use [Langfuse](https://langfuse.com/) for performance evaluation then set `EVAL_USE_LANGFUSE = "True"` in the `.env` file followed by `LANGFUSE_PUBLIC_KEY`, `LANGFUSE_PRIVATE_KEY`, `LANGFUSE_URL`. You can set these up by running Langfuse self-hosted or by signing up and signing in to [the Langfuse cloud](https://cloud.langfuse.com/).
-
-## Usage
-
-### On your computer
-Once you have installed the dependencies mentioned above in your Python virtual environment, to run the chatbot, execute `solara run app.py`. It will automatically open a browser (unless you have a headless terminal) to the chatbot application. An alternative way of running the app is by executing the script `run_starlette.sh`, which will load the app using the [Starlette framework](https://www.starlette.io/) on the Asynchronous Server Gateway Interface (ASGI) server, [uvicorn](https://www.uvicorn.org/).
-
-### Docker
-You can run the app in a Docker container. By default, the app inside the Docker container will not use persistable index storage, document storage or graph storage. To run the app in a Docker container, you have to build its image (which we name as `tldrlc` although you can choose any other name) and run an instance (which we name as `tldrlc-container` but you can also pick a name of your choice) of that image, as follows.
-
-```
-docker build -f local.dockerfile -t tldrlc .
-docker create -p 8765:8765/tcp --name tldrlc-container tldrlc
-docker container start tldrlc-container
-```
-
-Following this, the app will be accessible on your Docker host, for example as [http://localhost:8765](http://localhost:8765) -- assuming that nothing else on host is blocking port 8765 when the container starts.
-
-<!-- If you want to change the settings of the app itself inside the container, login to the container as `root`. You can do this by running `docker exec -it tldrlc-container bash`. Once, you have the shell access in the container, edit the file `/app/.env` using the `nano` editor that is installed for convenience. For example, you can change the default behaviour of the containerised app to use your preferred remote graph, index and document storage. Then, restart the _same_ container, by running `docker container restart tldrlc-container`. Remember that these changes _will not_ propagate to any new container that you spin out of the image. -->
-
-The Docker container has to depend on external LLM provider, graph database, document and index storage. If any of these, such as `Ollama`, is running on the Docker host then you should use the host name for the service as `host.docker.internal` or `gateway.docker.internal`. See [the networking documentation of Docker desktop](https://docs.docker.com/desktop/networking/) for details.
-
-### Cloud deployment
-
-There is a public deployment of this app available through a Hugging Face Spaces at [https://huggingface.co/spaces/xtremebytes/TLDRLC](https://huggingface.co/spaces/xtremebytes/TLDRLC). Note that this deployment is experimental and bugs are quite likely. There may be additional problems due to any restrictions on the Hugging Face Spaces infrastructure, which will not manifest in a local deployment.
-
-For a cloud deployment, you have to use Open AI or Cohere. By default, graph, index and documents will be stored in memory with no disk persistence. If you want persistence with the deployment with a cloud deployment, you must use [Neo4j Aura DB](https://neo4j.com/cloud/platform/aura-graph-database/) and a remotely hosted Redis (e.g., [Redis on Render](https://docs.render.com/redis)). Alternatively, you can use local or on-premises hosted Ollama, Neo4j and Redis by exposing those services with TCP (or, TLS) tunnels publicly through [ngrok](https://ngrok.com/).
-
-**Note** that for cloud deployment(s) mentioned above, unless you create your separate cloud deployment by yourself, LangFuse evaluation is enabled by default and cannot be turned off. This is meant for evaluation purposes of this project. Hence, all information about your interactions with the chatbot will be available to the maintainer(s) of this project. _For local, Docker or cloud deployments that you create, LangFuse is **not** enabled by default. Even when enabled, by yourself, LangFuse traces will be available to you, and not the maintainer(s) of this project_.
+To learn more about it, check the [project documentation](https://anirbanbasu.github.io/tldrlc/).
 
 ## Support
 
@@ -109,4 +20,4 @@ Apache License, Version 2.0, January 2004. See: [http://www.apache.org/licenses/
 
 ## Project status
 
-Actively maintained early stage prototype for experimentation only.
+Actively maintained early stage prototype for experimentation only.
diff --git a/docs/book.toml b/docs/book.toml
@@ -4,7 +4,7 @@ language = "en"
 text-direction = "ltr"
 multilingual = false
 src = "src"
-title = "Too Long, Didn't Read, Let's Chat (TLDRLC)"
+title = "TLDRLC"
 description = "Too Long Didn't Read Let's Chat (TLDRLC) is a chatbot prototype that allows users to chat with a large language model (LLM) about various data sources. This chatbot utilises retrieval augmented generation (RAG) with knowledge graph and semantic search."
 
 [rust]
@@ -33,7 +33,7 @@ git-repository-icon = "fa-github"
 site-url = "/"
 
 [output.html.fold]
-enable = true    # whether or not to enable section folding
+enable = false    # whether or not to enable section folding
 level = 0         # the depth to start folding
 
 

diff --git a/docs/src/SUMMARY.md b/docs/src/SUMMARY.md
@@ -1,14 +1,19 @@
 # Summary
 
-[Overview](overview.md)
+# Concepts
 
-# User guide
+- [Overview](concepts/overview.md)
 
-- [Installation](guide/installation.md)
-    - [Python dependencies](guide/python.md)
-    - [Containerisation](guide/container.md)
-    - [LLM and indices](guide/llm-indices.md)
-    - [Evaluation](guide/evaluation.md)
+# User guide
 
---
-[Draft Chapter]()
+- [Usage]()
+    - [Components]()
+        - [Language model](guide/llm.md)
+        - [Indices](guide/indices.md)
+    - [Installation](guide/installation.md)
+        - [Python dependencies](guide/python.md)
+        - [Containerisation](guide/container.md)
+    - [Run]()
+        - [Locally](guide/run-local.md)
+        - [Cloud deployments](guide/cloud-deployments.md)
+    - [Evaluation](guide/evaluation.md)
diff --git a/docs/src/concepts/overview.md b/docs/src/concepts/overview.md
@@ -0,0 +1,5 @@
+# Too Long, Didn't Read, Let's Chat (TLDRLC)
+
+Too Long Didn't Read Let's Chat (TLDRLC) is a chatbot prototype that allows a user to chat with a _large language model (LLM)_ about various data sources. This chatbot utilises _retrieval augmented generation (RAG)_ backed by _knowledge graph_ and _semantic search_ indices.
+
+TLDRLC is an experimental software. It is not associated with any, and should not be interpreted as a, reliable chatbot service. The rationale behind this project is experimentation with retrieval augmented generation. The implementation is based on concepts from publicly available information, including tutorials and courses, such as [DeepLearning.ai: Knowledge Graphs for RAG](https://www.deeplearning.ai/short-courses/knowledge-graphs-rag/) and [DeepLearning.ai: Building and Evaluating Advanced RAG](https://www.deeplearning.ai/courses/building-evaluating-advanced-rag/) to name a few.
diff --git a/docs/src/guide/cloud-deployments.md b/docs/src/guide/cloud-deployments.md
@@ -0,0 +1,5 @@
+# Cloud deployment
+
+There is a public deployment of this app available through a Hugging Face Spaces at [https://huggingface.co/spaces/xtremebytes/TLDRLC](https://huggingface.co/spaces/xtremebytes/TLDRLC). Note that this deployment is experimental and bugs are quite likely. There may be additional problems due to any restrictions on the Hugging Face Spaces infrastructure, which will not manifest in a local deployment.
+
+For a cloud deployment, you have to use Open AI or Cohere. By default, graph, index and documents will be stored in memory with no disk persistence. If you want persistence with the deployment with a cloud deployment, you must use [Neo4j Aura DB](https://neo4j.com/cloud/platform/aura-graph-database/) and a remotely hosted Redis (e.g., [Redis on Render](https://docs.render.com/redis)). Alternatively, you can use local or on-premises hosted Ollama, Neo4j and Redis by exposing those services with TCP (or, TLS) tunnels publicly through [ngrok](https://ngrok.com/).
diff --git a/docs/src/guide/container.md b/docs/src/guide/container.md
@@ -4,5 +4,6 @@ You can run the app in a Docker container. By default, the app inside the Docker
 ```
 docker build -f local.dockerfile -t tldrlc .
 docker create -p 8765:8765/tcp --name tldrlc-container tldrlc
-docker container start tldrlc-container
-```
+```
+
+<!-- If you want to change the settings of the app itself inside the container, login to the container as `root`. You can do this by running `docker exec -it tldrlc-container bash`. Once, you have the shell access in the container, edit the file `/app/.env` using the `nano` editor that is installed for convenience. For example, you can change the default behaviour of the containerised app to use your preferred remote graph, index and document storage. Then, restart the _same_ container, by running `docker container restart tldrlc-container`. Remember that these changes _will not_ propagate to any new container that you spin out of the image. -->
diff --git a/docs/src/guide/evaluation.md b/docs/src/guide/evaluation.md
@@ -1,3 +1,5 @@
 # Performance evaluation using Langfuse
 
-If you wish to use [Langfuse](https://langfuse.com/) for performance evaluation then set `EVAL_USE_LANGFUSE = "True"` in the `.env` file followed by `LANGFUSE_PUBLIC_KEY`, `LANGFUSE_PRIVATE_KEY`, `LANGFUSE_URL`. You can set these up by running Langfuse self-hosted or by signing up and signing in to [the Langfuse cloud](https://cloud.langfuse.com/).
+If you wish to use [Langfuse](https://langfuse.com/) for performance evaluation then set `EVAL_USE_LANGFUSE = "True"` in the `.env` file followed by `LANGFUSE_PUBLIC_KEY`, `LANGFUSE_PRIVATE_KEY`, `LANGFUSE_URL`. You can set these up by running Langfuse self-hosted or by signing up and signing in to [the Langfuse cloud](https://cloud.langfuse.com/).
+
+**Note** that for [cloud deployment(s)](cloud-deployments.md), unless you create your separate cloud deployment by yourself, LangFuse evaluation is enabled by default and cannot be turned off. This is meant for evaluation purposes of this project. Hence, all information about your interactions with the chatbot will be available to the maintainer(s) of this project. _For local, Docker or cloud deployments that you create, LangFuse is **not** enabled by default. Even when enabled, by yourself, LangFuse traces will be available to you, and not the maintainer(s) of this project_.
diff --git a/docs/src/guide/indices.md b/docs/src/guide/indices.md
@@ -0,0 +1,11 @@
+# Documents, index and graph storage
+
+## Documents and index storage
+
+A [Redis stack installation](https://redis.io/docs/install/install-stack/) is required to store the parsed document chunks and the index, which can be reloaded without having to rebuild the index from the sources. Although the default settings are okay, refer to the [Redis persistence configuration](https://redis.io/docs/management/persistence/), if necessary. Alternative to a local Redis installation, you can also opt for a remote Redis installation such as one on [Render](https://render.com/). Depending on your hosting plan, various limits will apply on such cloud-based managed Redis installations. 
+
+## Graph database
+
+Similarly, you will need [Neo4j graph database](https://neo4j.com/). You can either install it locally or use a the [Neo4j Aura DB](https://neo4j.com/cloud/platform/aura-graph-database/), which is a cloud-hosted version of Neo4j.
+
+_Note that unless you specify `NEO4J_DISABLE = "True"` to disable Neo4J and use an in-memory graph database, the Neo4J server must be accessible using the specified connection credentials. Otherwise, the application will display an error at the time of starting up._
diff --git a/docs/src/guide/llm.md b/docs/src/guide/llm.md
@@ -0,0 +1,15 @@
+# Language model providers
+
+## Supported language model providers
+
+You can specify the language model provider to use by using the environment variable `LLM_PROVIDER`, which defaults to `Ollama`, if not specified. The supported language model providers are:
+
+1. Cohere.
+2. Ollama.
+3. Open AI.
+
+If using [Ollama](https://ollama.com/), you will also need to install it or, point the chatbot to a remotely hosted Ollama server. You also need to pull the Ollama model that you specify with `OLLAMA_MODEL` environment variable using `ollama pull <model-name>` (replace `<model-name>` with the actual model that you want to use) on your Ollama server. Check the [available Ollama models](https://ollama.com/library).
+
+Open AI can be used by specifying an `OPENAI_API_KEY`, an `OPENAI_MODEL`, and by choosing `Open AI` as the `LLM_PROVIDER`. Follow [this link](https://platform.openai.com/account/api-keys) to get an Open AI API key. Similarly, Cohere can be used by specifying a `COHERE_API_KEY`, a `COHERE_MODEL` (which defaults to `command-r-plus`), and by choosing `Cohere` as the `LLM_PROVIDER`. Follow [this link](https://cohere.com/pricing) to obtain a Cohere API key.
+
+See the settings in the `.env.template` file customisation of the LLM settings.
diff --git a/docs/src/guide/run-local.md b/docs/src/guide/run-local.md
@@ -0,0 +1,11 @@
+# Run locally
+
+## The non-containerised way
+
+Once you have installed the dependencies mentioned above in your Python virtual environment, to run the chatbot, execute `solara run app.py`. It will automatically open a browser (unless you have a headless terminal) to the chatbot application. An alternative way of running the app is by executing the script `run_starlette.sh`, which will load the app using the [Starlette framework](https://www.starlette.io/) on the Asynchronous Server Gateway Interface (ASGI) server, [uvicorn](https://www.uvicorn.org/).
+
+## Containerised (Docker)
+
+Following the [creation of the container](container.md), you can run the app using `docker container start tldrlc-container` the app will be accessible on your Docker host, for example as [http://localhost:8765](http://localhost:8765) -- assuming that nothing else on host is blocking port 8765 when the container starts.
+
+The Docker container has to depend on external LLM provider, graph database, document and index storage. If any of these, such as `Ollama`, is running on the Docker host then you should use the host name for the service as `host.docker.internal` or `gateway.docker.internal`. See [the networking documentation of Docker desktop](https://docs.docker.com/desktop/networking/) for details.