The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
-
Updated
Oct 13, 2025 - TypeScript
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
Your AI Operator for Web, Android, Automation & Testing.
Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment.
Browser Operator - The AI browser with built in Multi-Agent platform! Open source alternative to Perplexity Comet, Dia and Microsoft CoPilot Edge Browser
A fully-featured, GUI-powered local LLM Agent sandbox with complete MCP protocol support. Features both CLI and full desktop environment, enabling AI agents to operate browsers, terminal, and other desktop applications just like humans. Based on E2B oss code.
The World's First Out-of-the-Box Computer Use Agent Powered by Gemini-CLI @openmule
Autonomous virtual computer agents at scale, fully open-source, safe, auditable, and production-ready.
AI-powered computer control for automated testing. Factifai uses vision models (Claude, GPT-4o, Gemini) to interact with applications naturally - clicking, typing, and verifying results just like a human would.
β¨ Use natural language to control your browser, powered by LLM and playwright
Mark web pages for use with vision-language models
This is the crud backend for our QA test application
This is OpenAI's computer use hooked up to a chrome extension.
ChatGPT Agent but in Cloudflare Containers
AI-powered computer control for automated testing in your CI/CD pipelines. Factifai agent uses vision models (Claude, GPT-4o) to interact with applications naturally - clicking, typing, and verifying results just like a human would.
Auto-Browse: AI Enabled Browser Automation
Build your own AI operators like OpenAI
Anthropic's Computer use implementation in Nodejs
π€ LLM-powered computer control through local and Docker environments. Features VNC integration, automated interactions, and a chat interface for natural language system control.
A computer use chat demo that integrates the Cyberdesk SDK and AI SDK.
Add a description, image, and links to the computer-use topic page so that developers can more easily learn about it.
To associate your repository with the computer-use topic, visit your repo's landing page and select "manage topics."