Desktop AI Companion

Turn your computer into a teammate that works beside you

CoView runs a visual control loop: observe the screen, understand the task, take one action, then observe again. It can keep real work moving across browsers, editors, documents, and desktop apps instead of staying inside a chat box.

Observe Understand the current screen, foreground app, and multi-display setup

Operate Click, scroll, use shortcuts, type, and read web pages or documents

Collaborate Bring text, voice, wake words, and background Code Agents into one entry point

Download View on GitHub Read the Guide

👁️

Real desktop awareness

Screen, foreground app, multiple displays

🧠

Feedback-driven execution

Act, observe, and plan the next step

Live Demo

This is not a concept demo. It is CoView working continuously on a real desktop.

From waking the companion and understanding the current interface to clicking, typing, moving the task forward, and reporting progress, this demo shows how CoView connects seeing with doing.

Real workflows Actions happen directly inside desktop apps, not in a sandbox or a scripted mockup.

Continuous task progress CoView observes again after every step before deciding what to do next.

Speak, see, and act The floating panel, voice collaboration, and background Agents work together in one experience.

Looping demo Real software recording

Why CoView

CoView stands out because it actually joins the work

It connects visual understanding, desktop operation, voice interaction, and background Agents into a continuous workflow, so AI does more than answer: it helps carry the task forward.

🖥️

See the real desktop

CoView observes screenshots, foreground apps, and multi-display context so it can enter the interface you are already using.

⌨️

Operate your computer directly

It can click, drag, scroll, press shortcuts, enter text, and read web pages or documents without constant manual switching.

🪟

Collaborate from one companion window

Task input, stop controls, live logs, suggestions, and final reports all live inside one floating workspace.

🔁

A feedback-driven execution loop

CoView is not built around one-off answers. Its core loop is observe the screen, understand the task, take one action, and observe again. When pages load, interfaces change, or a task gets interrupted, it can decide whether to continue, ask for confirmation, or switch strategy.

Workflow

How it keeps tasks moving inside real software

Instead of asking you to write a long prompt first, CoView understands the current environment and advances the task through step-by-step desktop actions.

Observe the current environment

Understand the screen, foreground app, and context without making you explain everything from scratch.

Understand the task intent

Use model reasoning, history, and safety boundaries to choose the best next action.

Take one concrete action

Click, type, scroll, use shortcuts, or read content so the task actually moves forward.

Report back and re-plan

Use interface changes to decide whether to continue, pause for confirmation, or choose a new path.

Capabilities

A complete capability set from desktop actions to Agent handoff

CoView offers a GUI, voice entry, background Code Agents, CLI access, and a Python API, making it useful for both personal workflows and business integration.

🧭

Desktop-level context

Combine the current screen, foreground app, and task history so you do not have to repeat where you are or what you need.

🪄

Task input and state control

Enter tasks, review progress, pause execution, and accept suggestions directly in the floating companion.

📄

Web and document reading

Read current pages and documents, extract key information, and turn it into the basis for the next action.

✅

Progress reports

Execution is not a black box: CoView keeps reporting progress, completed actions, and final outcomes.

🖱️

Click, drag, and scroll

Perform mouse-level actions inside real desktop apps without making you take over each step.

⌨️

Shortcuts and text input

Use key combinations, enter text, and progress through forms for continuous office work.

🖥️

Cross-app workflows

Connect full action chains across browsers, editors, document tools, and desktop applications.

🛰️

Multi-display support

Understand display metadata across multiple monitors, matching real work setups more closely.

🎙️

Local wake words

Use local wake words such as "ni hao xiao tong" or "hey Lucy" to enter a natural desktop collaboration flow.

🧹

Realtime ASR and VAD

Voice input passes through VAD, optional echo cancellation, and realtime transcription for speak-while-working scenarios.

🗣️

Interruptions during execution

While tasks and voice playback are running, CoView can recognize intents such as stop, new_task, and ignore.

🔊

TTS result playback

Progress and final results can be spoken aloud, making the computer feel more like a working partner.

🤖

Background Code Agents

Complex work can be handed off to background Agents so the foreground workflow is not blocked.

🌐

Multiple model providers

Connect OpenAI-compatible model services and choose providers flexibly for different task types.

🧩

GUI, CLI, and Python API

Use the graphical app directly, or integrate CoView into command-line and Python workflows.

🛡️

Controllable safety boundaries

Confirmation policies, logs, and permission boundaries help reduce the risks of desktop automation.

🔔

Wake and collaborate

After a local wake word is detected, the floating companion appears, shows voice status, and enters the current task context.

🧠

Understand speech intent in real time

When idle, transcription can become a new task. During execution, CoView can recognize interruption intents such as stop, new_task, and ignore.

🧭

Interrupt, continue, and report aloud

Continue listening, stop the current job, exit the program, and receive progress or final results through TTS playback.

Default wake words and control commands

ni hao xiao tong hey Lucy stop current work close program

Code Agent

Send long-running work to background Agents and keep using your desktop

When a task should not block the foreground, CoView can hand code analysis, repository scans, script generation, and batch jobs to a background Code Agent while streaming logs and final results back to you.

OpenAI Anthropic Qwen Gemini

$ coview run "summarize meeting notes and draft a weekly report"

[observe] Current apps: document editor / browser

[plan] Task split: extract points -> group themes -> fill template

[agent] Background job created: Agent-1024

[done] Weekly report draft written with operation log

Visual control loop: observe, understand, act, report

Interaction entry points: text, voice, shortcuts

Desktop platforms: macOS and Windows

Integration paths: GUI, CLI, Python API

Get Started

Start using CoView today

Choose the installer for your system and download it directly. We recommend reading the guide before your first run.

Windows Installer

For Windows devices. Click to download `CoView-2.0.0-Windows-Setup.exe` directly.

Download for Windows

macOS Installer

For macOS devices. Click to download `CoView-2.0.0-macOS.dmg` directly.

Download for macOS

📘

User Guide

Learn how to install, grant permissions, configure settings, and use CoView effectively.

Read the Guide