-
Notifications
You must be signed in to change notification settings - Fork 8
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Guangsen Wang
committed
Nov 24, 2022
0 parents
commit 86357c8
Showing
153 changed files
with
31,792 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,38 @@ | ||
name: docs | ||
|
||
on: | ||
push: | ||
branches: [ main ] | ||
pull_request: | ||
branches: [ main ] | ||
release: | ||
types: [ published ] | ||
|
||
jobs: | ||
build: | ||
|
||
runs-on: ubuntu-18.04 | ||
|
||
steps: | ||
- uses: actions/checkout@v2 | ||
with: | ||
fetch-depth: 0 | ||
- name: Set up Python | ||
uses: actions/setup-python@v2 | ||
with: | ||
python-version: '3.8' | ||
- name: Install dependencies | ||
run: | | ||
python -m pip install --upgrade pip setuptools wheel | ||
sudo apt-get update | ||
sudo apt-get install openjdk-11-jdk | ||
sudo apt-get install pandoc | ||
- name: Build Sphinx docs | ||
run: | | ||
docs/build_docs.sh | ||
- name: Deploy to gh-pages | ||
uses: peaceiris/actions-gh-pages@v3 | ||
if: ${{ github.ref == 'refs/heads/main' || github.event_name == 'release' }} | ||
with: | ||
github_token: ${{ secrets.GITHUB_TOKEN }} | ||
publish_dir: docs/_build/html |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,34 @@ | ||
FROM nvcr.io/nvidia/pytorch:20.12-py3 | ||
|
||
|
||
ARG device | ||
|
||
COPY requirements_${device}.txt requirements_${device}.txt | ||
|
||
ENV DEBIAN_FRONTEND=noninteractive | ||
|
||
RUN apt-get update && apt-get install -y --no-install-recommends \ | ||
locales \ | ||
wget \ | ||
build-essential \ | ||
vim \ | ||
htop \ | ||
curl \ | ||
git less ssh cmake \ | ||
zip unzip gzip bzip2 \ | ||
python3-tk gcc g++ libpq-dev | ||
|
||
|
||
RUN pip install -U pip && pip install -r requirements_${device}.txt | ||
RUN pip install torch==1.9.0+cu111 torchvision==0.10.0+cu111 torchaudio==0.9.0 -f https://download.pytorch.org/whl/torch_stable.html | ||
|
||
COPY . /app | ||
WORKDIR /app | ||
ENV PYTHONPATH="${PYTHONPATH}:/app" | ||
#ENV STORAGE="S3" | ||
ENV DATABASE_URL="/app/db/botsim_sqlite_demo.db" | ||
|
||
EXPOSE 8501 | ||
ENTRYPOINT ["streamlit", "run", "/app/botsim/streamlit_app/app.py"] | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,14 @@ | ||
BSD 3-Clause License | ||
|
||
Copyright (c) 2022 Salesforce, Inc. | ||
All rights reserved. | ||
|
||
Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met: | ||
|
||
1. Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer. | ||
|
||
2. Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution. | ||
|
||
3. Neither the name of Salesforce.com nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission. | ||
|
||
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,127 @@ | ||
<p align="center"> | ||
<br> | ||
<img src="docs/_static/logo.png" width="400"/> | ||
<br> | ||
<p> | ||
|
||
<div align="center"> | ||
<a href="https://github.com/salesforce/botsim/releases"><img alt="Latest Release" src="https://img.shields.io/github/release/salesforce/LAVIS.svg" /></a> | ||
<a href="https://opensource.salesforce.com/botsim/index.html"> | ||
<img alt="docs" src="https://github.com/salesforce/LAVIS/actions/workflows/docs.yaml/badge.svg"/> | ||
<a href="https://opensource.org/licenses/BSD-3-Clause"> | ||
<img alt="license" src="https://img.shields.io/badge/License-BSD_3--Clause-blue.svg"/> | ||
</a> | ||
</div> | ||
|
||
<div align="center"> | ||
<a href="https://arxiv.org/abs/2211.11982">System Demo Paper</a>, | ||
<a href="https://arxiv.org/abs/2211.11982">Technical Report</a>, | ||
<a href="https://salesforce-botsim.herokuapp.com/">Demo</a>, | ||
<a href="https://opensource.salesforce.com/botsim//latest/index.html">Documentation</a>, | ||
<a href="">Blog</a> | ||
</div> | ||
|
||
|
||
# BotSIM: An End-to-End Bot Simulation Toolkit for Commercial Task-Oriented Dialog Systems | ||
|
||
|
||
## Table of Contents | ||
1. [Introduction](#introduction) | ||
2. [Installation](#installation) | ||
3. [Getting Started](#getting-started) | ||
4. [Tutorials](#tutorial) | ||
4. [Documentation](#documentation) | ||
5. [System Demo Paper and Technical Report](#system-demo-paper-and-technical-report) | ||
|
||
|
||
## Introduction | ||
BotSIM is a Bot SIMulation toolkit for performing large-scale data-efficient end-to-end evaluation, diagnosis and remediation of commercial task-oriented dialog (TOD) systems to accelerate bot development and evaluation, reduce cost and time-to-market. | ||
As a modular framework, BotSIM can be extended by bot developers to support new bot platforms. As a toolkit, it offers an easy-to-use App and a suite of command line tools for bot admins or practitioners to readily perform evaluation and remediation of their bots. | ||
|
||
Key features of BotSIM include: | ||
|
||
- **Multi-stage bot evaluation**: BotSIM can be used for both pre-deployment testing and potentially post-deployment performance monitoring. | ||
- **Data-efficient dialogue generation**: Equipped with a deep network based paraphrasing model, BotSIM can generate an extensive set of test intent queries from the limited number of input intent utterances, which can be used to evaluate the bot intent model at scale. | ||
- **End-to-end bot evaluation via dialogue simulation**: Through automatic chatbot simulation, BotSIM can identify existing issues of the bot and evaluate both the natural language understanding (NLU) performance (for instance, intent or NER error rates) and the end-to-end dialogue performance such as goal completion rates. | ||
- **Bot health report dashboard**: The bot health report dashboard presents a multi-granularity top-down view of bot performance consisting of historical performance, current bot test performance and dialogue-specific performance. Together with the analytical tools, they help bot practitioners quickly identify the most urgent issues and properly plan their resources for troubleshooting. | ||
- **Easy extension to new bot platform**: BotSIM was built with a modular task-agnostic design, with multiple platform support in mind, so it can be easily extended to support new bot platforms. BotSIM currently supports [Salesforce Einstein BotBuilder](https://help.salesforce.com/s/articleView?id=sf.bots_service_intro.htm&type=5) and [Google DialogFlow CX](https://cloud.google.com/dialogflow/cx/docs/basics) | ||
|
||
## Installation | ||
|
||
1. (Optional) Creating conda environment | ||
```bash | ||
conda create -n botsim python=3.9 | ||
conda activate botsim | ||
``` | ||
|
||
2. Cloning and building dependencies | ||
``` bash | ||
git clone https://github.com/salesforce/botsim.git | ||
cd BotSIM | ||
pip install . | ||
``` | ||
|
||
## Getting Started | ||
### Streamlit Web App | ||
The Streamlit Web App can be used to | ||
<p align="center" width="100%"> | ||
<img width="100%" src="docs/BotSIM_App.png"> | ||
</p> | ||
|
||
The following commands can be used to run BotSIM as a Streamlit Web App locally: | ||
```bash | ||
export PYTHONPATH=./:$PYTHONPATH | ||
export DATABASE_URL="db/botsim_sqlite_demo.db" | ||
streamlit run botsim/streamlit_app/app.py | ||
``` | ||
The App can also be deplpyed as a docker image: | ||
``` | ||
# build the docker image | ||
docker build -t botsim-streamlit . | ||
# run the docker container | ||
docker run -p 8501:8501 botsim-streamlit | ||
``` | ||
### Command Line Tools | ||
Alternatively, users can also use the command line tools to deep-dive into BotSIM's generation-simulation-remediation pipeline. | ||
|
||
## Tutorial | ||
We provide the following tutorials in the tutorial section of the documentation. | ||
- Using Streamlit Web App | ||
- Using BotSIM command line tools | ||
- Navigating through bot health dashboard | ||
- Applying remedidation suggestions | ||
|
||
## Documentation | ||
For more details of the system components and advanced usages, please refer to [code documentation]((https://opensource.salesforce.com/botsim//latest/index.html#)]). | ||
We welcome the contribution from the open-source community to improve the toolkit! To support new bot platforms, please also follow the guidelines detailed in the code documentation. | ||
|
||
## System Demo Paper and Technical Report | ||
You can find more details in our technical report and system demo paper. | ||
If you're using BotSIM in your research or applications, please cite using this BibTeX for technical report: | ||
``` | ||
@article{guangsen2022-botsim-tr, | ||
author = {Guangsen Wang and Shafiq Joty and Junnan Li and Steven Hoi}, | ||
title = {BotSIM: An End-to-End Bot Simulation Toolkit for Commercial Task-Oriented Dialog Systems}, | ||
year = {2022}, | ||
doi = {}, | ||
url = {}, | ||
archivePrefix = {arXiv}, | ||
} | ||
``` | ||
or the following BibTex for our system demo paper: | ||
``` | ||
@article{guangsen2022-botsim-demo, | ||
author = {Guangsen Wang and Samson Tan and Shafqi Joty and Guang Wu and Jimmy Au and Steven Hoi}, | ||
title = {BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems}, | ||
year = {2022}, | ||
doi = {}, | ||
url = {}, | ||
archivePrefix = {arXiv}, | ||
} | ||
``` | ||
|
||
## Contact Us | ||
Feel free to contact botsim@salesforce.com for any comments, issues or suggestions. | ||
|
||
## License | ||
[BSD 3-Clause License](LICENSE.txt) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
# Copyright (c) 2022, salesforce.com, inc. | ||
# All rights reserved. | ||
# SPDX-License-Identifier: BSD-3-Clause | ||
# For full license text, see the LICENSE file in the repo root or https://opensource.org/licenses/BSD-3-Clause | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
# Copyright (c) 2022, salesforce.com, inc. | ||
# All rights reserved. | ||
# SPDX-License-Identifier: BSD-3-Clause | ||
# For full license text, see the LICENSE file in the repo root or https://opensource.org/licenses/BSD-3-Clause | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,14 @@ | ||
"""Get the version.""" | ||
|
||
# Copyright (c) 2022, salesforce.com, inc. | ||
# All rights reserved. | ||
# SPDX-License-Identifier: BSD-3-Clause | ||
# For full license text, see the LICENSE file in the repo root or https://opensource.org/licenses/BSD-3-Clause | ||
|
||
# Third party | ||
import pkg_resources | ||
|
||
try: | ||
__version__ = pkg_resources.get_distribution("clana").version | ||
except pkg_resources.DistributionNotFound: | ||
__version__ = "not installed" |
Oops, something went wrong.