DisplAI - Interactive Multimedia AI Assistant with Stand-alone UI

DisplAI is an Interactive Multimedia AI Assistant with Stand-alone UI that leverages OpenAI GPT, DALL-E, Google GTTS, Spotify, OpenWeatherMap, NewsAPI to create an immersive and engaging user experience.

The AI assistant DisplAI listens to user input, transcribes it, gets activated on specific keywords, can talk with user, play music, share real-time weather forecast or news and generate artistic images based on conversations. This repository contains the source code and an example script for implementing and using the DisplAI Assistant.

Disclaimer:

User discretion recommended. As with any generative AI application, this AI Assistant may produce factually incorrect / misleading content. Use this AI Assistant purely for entertainment and fun purposes. Do not use content generated from this software as facts or reality.

Demo (YouTube):

Library

The DisplAI library consists of several modules:

lib.viewer: the Viewer class for displaying images and animations based on the current state of the chatbot (e.g., awake, asleep, thinking, or speaking).
lib.chatbot: the ChatBot class - manages the chatbot's core functionalities such as handling conversation history, generating responses using GPT, and creating images using DALL-E.
lib.audutils: the AudioHelper class - handles audio input and output using Google speech recognition and GTTS for text-to-speech.
lib.imgutils: utility functions for working with images, such as converting images from strings to image objects.
lib.OAIWrapper: the OAIWrapper class - wraps the OpenAI API for GPT, DALL-E.
lib.action: the action function - provides news, weather or music based on user input.

Example

To use DisplAI in your own projects, simply import the necessary modules and classes from the library, and create instances of the Viewer, ChatBot, and AudioHelper classes. Then, use the provided methods to interact with the chatbot and manage the conversation.

For example:

from lib.viewer import Viewer
from lib.chatbot import ChatBot
from lib.audutils import AudioHelper

viewer = Viewer()
chatbot = ChatBot("sophia")
audio_helper = AudioHelper()

Once you have instantiated these classes, you can use their methods to handle user input, generate chatbot responses, and display images based on the conversation.

Run Sophia example

python sophia.py

and say: hi sophia, how are you?

Getting Started

Installation

To use DisplAI, you will need to install dependencies listed in requirements.txt:

pip install -r requirements.txt

Setup API Keys

To use DisplAI you will need OpenAI API key (for GPT, DALL-E). Follow the instructions on the OpenAI API documentation to obtain your API key.

Once you have your API key, add it to your environment variables. Linux example:

export OPENAI_API_KEY="your_api_key_here"

To use DisplAI with action features (weather, music, news) you will need 4 API keys:

NEWS_API_KEY: from News API
OWM_API_KEY: from OpenWeatherMap API
SPOT_CLIENT_ID: log in at Spotify Developer, create an app, go to settings, add your email id as an authorized user.
SPOT_CLIENT_SECRET: copy client id and client secret from Spotify, save them as environment variables.

Note: upon running the action script first time, Spotify will prompt the user to login on their default web browser, and paste the browser URL onto cmd / terminal.

Documentation

Viewer Class

The Viewer class is responsible for displaying images and animations on the screen based on the chatbot's current state. It is a subclass of threading.Thread and can be used as follows:

viewer = Viewer()
viewer.change_state("awake")  # or "sleep", "thinking", or "saying"
viewer.show_image("path/to/image.png")

ChatBot Class

The ChatBot class manages the chatbot's core functionalities, such as handling conversation history, generating responses using GPT, and creating images using DALL-E. Here are some of its methods:

gen_image_from_conversation(): Generates an image based on the current conversation using DALL-E.
is_awake(): Checks if the chatbot is currently awake (has been active within the last 60 seconds).
add_user_msg(user_name, user_text): Adds a user message to the conversation and updates the chatbot's awake status.
get_and_save_bot_next_msg(context_window=10): Generates and saves the chatbot's next message based on the conversation history and context.

AudioHelper Class

The AudioHelper class handles audio input and output using Whisper for speech recognition and GTTS for text-to-speech. Some of its methods include:

listen(): Listens for user input and returns an audio object.
transcript(audio): Transcribes the given audio object using Whisper's speech recognition.
say(text, lang="en", accent="us", path=''): Converts the given text to speech using GTTS and plays the resulting audio.

Contributing

We welcome contributions to the DisplAI project! If you would like to contribute, please follow these steps:

Fork the repository.
Create a new branch for your feature or bugfix.
Commit your changes to the new branch.
Submit a pull request for your branch.

Please make sure to follow the project's coding style and guidelines, and include tests and documentation for any new features or changes.

License

DisplAI is released under the non-commercial [CC BY-NC 4.0: International License](https://creativecommons.org/licenses/by-nc/4.0/). See the LICENSE file for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
config		config
gifs		gifs
lib		lib
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
sophia.py		sophia.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DisplAI - Interactive Multimedia AI Assistant with Stand-alone UI

Library

Example

Run Sophia example

Getting Started

Installation

Setup API Keys

Documentation

Viewer Class

ChatBot Class

AudioHelper Class

Contributing

License

About

Releases

Packages

Contributors 2

Languages

License

TrNi/displAI

Folders and files

Latest commit

History

Repository files navigation

DisplAI - Interactive Multimedia AI Assistant with Stand-alone UI

Library

Example

Run Sophia example

Getting Started

Installation

Setup API Keys

Documentation

Viewer Class

ChatBot Class

AudioHelper Class

Contributing

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages