LogoCua Documentation

Quickstart (for Developers)

Get started with cua in 5 steps

Get up and running with cua in 5 simple steps.

Introduction

cua combines Computer (interface) + Agent (AI) for automating desktop apps. Computer handles clicks/typing, Agent provides the intelligence.

Create Your First cua Container

  1. Go to trycua.com/signin
  2. Navigate to Dashboard > Containers > Create Instance
  3. Create a Medium, Ubuntu 22 container
  4. Note your container name and API key

Install cua

bash pip install "cua-agent[all]" cua-computer

bash npm install @trycua/computer

Using Computer

from computer import Computer

async with Computer(
    os_type="linux",
    provider_type="cloud",
    name="your-container-name",
    api_key="your-api-key"
) as computer:
    # Take screenshot
    screenshot = await computer.interface.screenshot()

    # Click and type
    await computer.interface.left_click(100, 100)
    await computer.interface.type("Hello!")
import { Computer, OSType } from '@trycua/computer';

const computer = new Computer({
  osType: OSType.LINUX,
  name: "your-container-name",
  apiKey: "your-api-key"
});

await computer.run();

try {
  // Take screenshot
  const screenshot = await computer.interface.screenshot();

  // Click and type
  await computer.interface.leftClick(100, 100);
  await computer.interface.typeText("Hello!");
} finally {
  await computer.close();
}

Using Agent

from agent import ComputerAgent

agent = ComputerAgent(
    model="anthropic/claude-3-5-sonnet-20241022",
    tools=[computer],
    max_trajectory_budget=5.0
)

messages = [{"role": "user", "content": "Take a screenshot and tell me what you see"}]

async for result in agent.run(messages):
    for item in result["output"]:
        if item["type"] == "message":
            print(item["content"][0]["text"])

Next Steps