zer0 🤖

zer0 is an AI-powered browser extension that acts as a bridge between your thoughts and actions. Using advanced language models (OpenAI/Azure OpenAI), zer0 can understand natural language instructions and autonomously perform tasks in your browser.

🌟 Features

AI-Powered Automation: Execute complex browser tasks using natural language commands
Intelligent DOM Interaction: Automatically clicks, fills forms, and navigates websites
Task History Tracking: Monitor and review all actions performed by the agent
Voice Input Support: Control the extension using voice commands
Multi-Model Support: Compatible with OpenAI and Azure OpenAI services
Privacy-Focused: All processing happens locally in your browser

🛠️ Technologies Used

Frontend: React 18, TypeScript, Chakra UI
State Management: Zustand with Immer
AI Integration: OpenAI API, Azure OpenAI
Build Tools: Webpack, Babel
Testing: Jest
Extension Platform: Chrome Extension Manifest V3

📂 Repository Structure

browser-extension-master/
├── src/
│   ├── common/              # Reusable React components
│   │   ├── App.tsx
│   │   ├── TaskUI.tsx
│   │   ├── TaskHistory.tsx
│   │   └── ModelDropdown.tsx
│   ├── helpers/             # Core logic and utilities
│   │   ├── availableActions.ts      # Defines browser actions
│   │   ├── determineNextAction.ts   # AI decision logic
│   │   ├── domActions.ts            # DOM manipulation
│   │   ├── parseResponse.ts         # AI response parsing
│   │   └── simplifyDom.ts           # DOM simplification
│   ├── pages/               # Extension pages
│   │   ├── Background/      # Service worker
│   │   ├── Content/         # Content script
│   │   ├── Popup/           # Extension popup
│   │   ├── Options/         # Settings page
│   │   └── Panel/           # DevTools panel
│   ├── state/               # Zustand state stores
│   │   ├── currentTask.ts
│   │   ├── settings.ts
│   │   └── ui.ts
│   └── manifest.json        # Extension manifest
├── utils/                   # Build scripts
├── package.json
└── webpack.config.js

👨‍💻 Team Members and Contributions

1. Avnish

Led the architecture and development of the AI agent system
Implemented OpenAI/Azure OpenAI integration
Designed the task execution pipeline

2. Armann

Developed the DOM interaction and manipulation system
Built the content script injection mechanism
Optimized the browser automation engine

3. Utkarsh

Implemented testing framework and wrote unit tests
Focused on debugging and error handling
Enhanced the deployment and build pipeline

4. Agam

Created the UI components and user interface
Developed the responsive layout using Chakra UI
Implemented the task history and status displays

🚀 Installation & Setup

Prerequisites

Node.js (v14 or higher)
pnpm (recommended) or npm
A Chromium-based browser (Chrome, Edge, Brave, etc.)
OpenAI API key or Azure OpenAI credentials

Installation Steps

Clone the repository

git clone https://github.com/avnishsinha/Zero-.git
cd Zero-/browser-extension-master

Install dependencies
```
pnpm install
# or
npm install
```
Build the extension
```
pnpm build
# or
npm run build
```
Load the extension in Chrome
- Open Chrome and navigate to chrome://extensions/
- Enable "Developer mode" (toggle in top-right corner)
- Click "Load unpacked"
- Select the build folder from the project directory
Configure API credentials
- Click the extension icon in your browser toolbar
- Go to Options/Settings
- Enter your OpenAI API key or Azure OpenAI credentials
- Select your preferred model

💡 Usage

Activate the extension
- Use the keyboard shortcut: Ctrl+Shift+Y (Windows/Linux) or Cmd+Shift+Y (Mac)
- Or click the extension icon in your toolbar
Enter a task
- Type a natural language instruction (e.g., "Fill out this form with my information")
- Or use voice input to describe the task
Watch zer0 work
- The AI agent will analyze the page
- Execute actions step-by-step
- Display progress in the task history
Review results
- Check the task history to see all actions performed
- Review any errors or completion messages

Example Tasks

"Click the login button"
"Fill in the email field with example@email.com"
"Navigate to the pricing page"
"Search for AI tools on this website"

🧪 Development

Run in development mode

pnpm start
# or
npm start

Run tests

pnpm test
# or
npm test

Code formatting

pnpm prettier
# or
npm run prettier

Linting

pnpm lint
# or
npm run lint

🔧 Available Actions

The AI agent can perform the following actions on web pages:

click: Click on any element
setValue: Fill input fields with text
finish: Mark the task as complete
fail: Indicate inability to complete the task

More actions can be added by extending the availableActions array in the codebase.

🌟 Future Enhancements

Add more browser actions (hover, scroll, drag-and-drop)
Support for Firefox and Safari
Enhanced error recovery and retry logic
Local model support (Ollama, LM Studio)
Multi-step task planning and execution
Screenshot and vision capabilities
Task templates and saved workflows
Collaborative task sharing

⚠️ Privacy & Security

Data Privacy: All AI processing happens through your API keys; we don't store or access your data
API Keys: Your API keys are stored locally in the browser's secure storage
Permissions: The extension requires broad permissions to interact with web pages as requested
Review Actions: Always review the task history to understand what actions were performed

🤝 Contributing

Contributions are welcome! Please feel free to submit issues or pull requests.

Fork the repository
Create a feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📧 Contact

For inquiries or collaboration:

Email: aks526@nau.edu
LinkedIn: https://www.linkedin.com/in/avnishkumarsinha/

📜 License

This project is licensed under the MIT License.

🙏 Acknowledgments

OpenAI and Azure OpenAI for providing the AI capabilities
The Chrome Extensions community for excellent documentation
All contributors and testers who helped improve zer0

zer0 - Bridging the gap between your thoughts and browser actions 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
browser-extension-master		browser-extension-master
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

zer0 🤖

🌟 Features

🛠️ Technologies Used

📂 Repository Structure

👨‍💻 Team Members and Contributions

1. Avnish

2. Armann

3. Utkarsh

4. Agam

🚀 Installation & Setup

Prerequisites

Installation Steps

💡 Usage

Example Tasks

🧪 Development

Run in development mode

Run tests

Code formatting

Linting

🔧 Available Actions

🌟 Future Enhancements

⚠️ Privacy & Security

🤝 Contributing

📧 Contact

📜 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

zer0 🤖

🌟 Features

🛠️ Technologies Used

📂 Repository Structure

👨‍💻 Team Members and Contributions

1. Avnish

2. Armann

3. Utkarsh

4. Agam

🚀 Installation & Setup

Prerequisites

Installation Steps

💡 Usage

Example Tasks

🧪 Development

Run in development mode

Run tests

Code formatting

Linting

🔧 Available Actions

🌟 Future Enhancements

⚠️ Privacy & Security

🤝 Contributing

📧 Contact

📜 License

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages