Skip to content

Latest commit

 

History

History
58 lines (40 loc) · 1.7 KB

README.md

File metadata and controls

58 lines (40 loc) · 1.7 KB

⌘ Command ⊞ Windows

CommandWindows ( ) is a desktop opeating system copilot based on multi-modal large language model, supporting all-platforms which have application windows.

Supported Model

  • GPT4 Vision from OpenAI (gpt-4-vision-preview)
  • Gemini Pro Vision from Google (gemini-pro-vision)
  • Gemini Nano/Ultra Vision from Google
  • Local Vision Model
    • Vary-toy

Features

Preview:

Help me create a Google doc and write the definition of Blockchain on that

CommandWin Test Video

  • A digital assistant rather than just software
  • Use multi-modal perception to help you operate your computer
  • Step by step notification showing
  • Testing on more platforms
  • More detailed device information to LLM
  • Enhanced chat experience with better reply content
  • More interactive operation
  • Convinent shortcuts

How to use

While an official release is not yet available due to the experimental status, you can still try out this tool by cloning the repository and then running on your system.

git clone https://github.com/c0mm4nd/command-windows
cd command-windows
npm i 
npm run start

The pre-built releases will be available soon!

How to build

Simply run

npm run make

The built file is inside the make folder

Status

Inspired by SOC but written in Electron with Javascript.

Currently, this project is

  • actively in development and experimental, not suitable for any production
  • welcoming any kind of issues and pull requests!