Skip to content

onuratakan/gpt-computer-assistant

Repository files navigation

Logo
Logo

GPT Computer Assistant

Static Badge

gpt-4o for windows, macos and ubuntu
Documentation . Explore the capabilities »


Made_with_python . pypi_downloads

Static Badge

|ENGLISH|简体中文|正體中文

GPT Computer Assistant

Hi, this is an alternative work for providing ChatGPT MacOS app to Windows and Linux. In this way this is a fresh and stable work. You can easily install as Python library for this time but we will prepare a pipeline for providing native install scripts (.exe).

Powered by Upsonic Tiger 🐅 A function hub for llm agents.

Installation and Run

Needed >= Python 3.9

pip3 install 'gpt-computer-assistant[base]'
computerassistant

Wake Word | NEW

We have added Pvporcupine integration. To use it, you need to install an additional library:

pip3 install 'gpt-computer-assistant[wakeword]'

After that, please enter your Pvporcupine API key and enable the wake word feature.




Logo

Agent Infrastructure

With this way you can create crewai agents and using it into gpt-computer-assistant gui and tools.

pip3 install 'gpt-computer-assistant[base]'
pip3 install 'gpt-computer-assistant[agentic]'
from gpt_computer_assistant import Agent, start

manager = Agent(
  role='Project Manager',
  goal='understands project needs and assist coder',
  backstory="""You're a manager at a large company.""",
)

coder = Agent(
  role='Senior Python Coder',
  goal='writing python scripts and copying to clipboard',
  backstory="""You're a python developer at a large company.""",
)


start()

Logo

Adding Custom Tools

Now you are able to add custom tools that run in the agentic infra and assistant processes.

from gpt_computer_assistant import Tool, start

@Tool
def sum_tool(first_number: int, second_number: int) -> str:
    """Useful for when you need to sum two numbers together."""
    return first_number + second_number

start()

Logo

API | NEW

Now you can use your GPT Computer Assistant remotely! GUI still active, for this there is few steps:

pip3 install 'gpt-computer-assistant[base]'
pip3 install 'gpt-computer-assistant[api]'
computerassistant --api
from gpt_computer_assistant.remote import remote

output = remote.input("Hi, how are you today?", screen=False, talk=False)
print(output)

remote.just_screenshot()

remote.talk("TTS test")

# Other Functionalities
remote.reset_memory()
remote.profile("default")

remote.enable_predefined_agents()
remote.disable_predefined_agents()

remote.enable_online_tools()
remote.disable_online_tools()









demo_out.mp4

Usage

options

Use cases

Take Meeting Notes Daily Assistant
Read Docs Coding Assistant

Roadmap

Feature Status Target Release
Clear Chat History Completed Q2 2024
Long Audios Support (Split 20mb) Completed Q2 2024
Text Inputs Completed Q2 2024
Just Text Mode (Mute Speech) Completed Q2 2024
Added profiles (Different Chats) Completed Q2 2024
More Feedback About Assistant Status Completed Q2 2024
Local Model Vision and Text (With Ollama, and vision models) Completed Q2 2024
Our Customizable Agent Infrastructure Completed Q2 2024
Supporting Groq Models Completed Q2 2024
Adding Custom Tools Completed Q2 2024
Click on something on the screen (text and icon) Completed Q2 2024
New UI Completed Q2 2024
Native Applications, exe, dmg Failed (Agentic Infra libraries not supported for now) Q2 2024
Collaborated Speaking Different Voice Models on long responses. Completed Q2 2024
Auto Stop Recording, when you complate talking Completed Q2 2024
Wakeup Word Completed Q2 2024
Continuously Conversations Completed Q2 2024
Adding more capability on device Planned Q2 2024
DeepFace Integration (Facial Recognition) Planned Q2 2024

Capabilities

At this time we have many infrastructure elements. We just aim to provide whole things that already in ChatGPT app.

Capability Status
Screen Read OK
Click to and Text or Icon in the screen OK
Move to and Text or Icon in the screen OK
Typing Something OK
Pressing to Any Key OK
Scrolling OK
Microphone OK
System Audio OK
Memory OK
Open and Close App OK
Open a URL OK
Clipboard OK
Search Engines OK
Writing and running Python OK
Writing and running SH OK
Using your Telegram Account OK
Knowledge Management OK
Add more tool ?

Predefined Agents

If you enable it your assistant will work with these teams:

Team Name Status
search_on_internet_and_report_team OK
generate_code_with_aim_team_ OK
Add your own one ?
Logo

Contributors