Agent RDP

Give AI agents the ability to see and control remote Windows desktops via RDP — through computer use (screenshot + click) or accessibility-based interaction.

typescriptrustrdpai-agentscomputer-use
github.com/thisnick/agent-rdp

What it does

Agent RDP is a CLI tool that lets AI agents connect to Windows machines over RDP and interact with them through two modes:

It bridges the gap between language model tool-use and GUI-based desktop applications that have no API.

Why it exists

A surprising amount of enterprise software only has a GUI. No API, no CLI, no webhooks. If you want to automate it, you either reverse-engineer the internals or you drive the interface. Agent RDP takes the second approach and makes it available to any agent framework that supports tool calling.

How it works

The system uses a daemon-per-session architecture: the CLI communicates with persistent background daemons via IPC. Sessions are named and independently managed. Responses are JSON-structured for AI agent consumption, with WebSocket streaming for real-time desktop capture.

Beyond the basic screenshot → decide → act loop, it also supports:

Technical details

Status

Active development. Used in production for automating legacy Windows applications that resist every other form of integration.