Skip to content

CommandWindows is a desktop opeating system copilot based on multi-modal large language model, supporting all-platforms which have application window

Notifications You must be signed in to change notification settings

c0mm4nd/command-windows

Repository files navigation

⌘ Command ⊞ Windows

CommandWindows ( ) is a desktop opeating system copilot based on multi-modal large language model, supporting all-platforms which have application windows.

Supported Model

  • GPT4 Vision from OpenAI (gpt-4-vision-preview)
  • Gemini Pro Vision from Google (gemini-pro-vision)
  • Gemini Nano/Ultra Vision from Google
  • Local Vision Model
    • Vary-toy

Features

Preview:

Help me create a Google doc and write the definition of Blockchain on that

CommandWin Test Video

  • A digital assistant rather than just software
  • Use multi-modal perception to help you operate your computer
  • Step by step notification showing
  • Testing on more platforms
  • More detailed device information to LLM
  • Enhanced chat experience with better reply content
  • More interactive operation
  • Convinent shortcuts

How to use

While an official release is not yet available due to the experimental status, you can still try out this tool by cloning the repository and then running on your system.

git clone https://github.com/c0mm4nd/command-windows
cd command-windows
npm i 
npm run start

The pre-built releases will be available soon!

How to build

Simply run

npm run make

The built file is inside the make folder

Status

Inspired by SOC but written in Electron with Javascript.

Currently, this project is

  • actively in development and experimental, not suitable for any production
  • welcoming any kind of issues and pull requests!

About

CommandWindows is a desktop opeating system copilot based on multi-modal large language model, supporting all-platforms which have application window

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages