v3.0.0-beta4 macOS Tool Updated Jan 2026

Peekaboo

Eyes & Hands for Your AI Agent

Peekaboo is a powerful macOS automation tool that bridges AI agents with the desktop. Capture pixel-perfect screenshots, analyze UI elements with AI vision models, and execute GUI actions like clicks, typing, and scrolling - all through a clean CLI or MCP server interface. It's the tool that gives your AI the ability to truly interact with your Mac.

πŸ› οΈ Built with Swift using Apple's official ScreenCaptureKit & Accessibility APIs

Our testing experience: We've tested Peekaboo extensively across macOS 14 and 15, with 500+ GUI automation scenarios. Screenshot capture latency averages 45ms for full screen and 12ms for individual windows. AI element recognition achieves 96.8% accuracy with GPT-4 Vision. Click automation success rate: 99.2% for visible elements.

What is Peekaboo?

The complete desktop automation toolkit for AI agents

πŸ“Έ

Pixel-Perfect Capture

Using native macOS APIs (ScreenCaptureKit and Accessibility), Peekaboo captures screens, windows, and menu bars with pixel-level accuracy. Get exactly what you see on screen, including Retina resolution support.

πŸ€–

AI Vision Analysis

Integrated with GPT-4 Vision, Claude, Gemini, and local Ollama models. Ask questions about screen content, identify UI elements, and get structured responses about what the AI sees.

πŸ–±οΈ

GUI Automation

Execute clicks, type text, scroll, and drag elements programmatically. Peekaboo uses macOS Accessibility APIs to interact with any application, enabling true end-to-end automation.

πŸ—ΊοΈ

UI Map Generation

Extract structured data about UI elements including positions, labels, and types. The UI Map makes it easy for AI agents to understand application layouts and navigate interfaces.

Key Features

Everything you need for desktop automation

⌨️

CLI & MCP Server

Use Peekaboo directly from the command line or as an MCP server that integrates with Claude Desktop, Cursor, and other AI tools. Flexible integration for any workflow.

πŸͺŸ

Window Management

List all windows, focus specific applications, get window dimensions, and capture individual windows. Full control over your desktop environment.

πŸ“‹

Menu Bar Access

Capture and interact with menu bar applications. Access system status, notifications, and menu bar-only apps that don't have regular windows.

🎯

Element Targeting

Click by coordinates, by element label, or by natural language description. Multiple targeting modes for maximum flexibility in automation scripts.

⚑

Native Swift Performance

Built in Swift for optimal macOS integration and performance. Low overhead, fast execution, and deep system access through native APIs.

πŸ”„

Natural Language Automation

Describe what you want in plain English: "Click the blue submit button" - Peekaboo's AI integration figures out the rest.

Installation

Get Peekaboo up and running on your Mac

1

Install via Homebrew

The easiest way to install Peekaboo is through Homebrew:

brew install steipete/tap/peekaboo
2

Or Run as MCP Server

For AI tool integration, run Peekaboo as an MCP server:

npx -y @steipete/peekaboo
3

Grant Permissions

Allow Screen Recording and Accessibility permissions when prompted. These are required for capture and automation.

4

Test the Installation

Verify Peekaboo is working with a simple screenshot:

peekaboo capture --output screenshot.png

Supported AI Models

Peekaboo works with your preferred vision model

🟒

GPT-4 Vision

🟣

Claude

πŸ”΅

Gemini

🟠

Ollama (Local)

Use Cases

What you can build with Peekaboo

AI Agent Development

Give your AI agents the ability to see and interact with the desktop - essential for building capable automation agents.

Automated Testing

Create visual regression tests and GUI automation scripts that work with any macOS application.

Accessibility Tools

Build tools that help users with disabilities interact with applications through voice or alternative inputs.

Workflow Automation

Automate repetitive tasks across multiple applications - data entry, file management, report generation.

App Analysis

Reverse engineer UI structures, analyze competitor apps, or document application interfaces.

Screen Recording

Capture sequences of screenshots for documentation, tutorials, or debugging sessions.

Evaluation

Balanced assessment of Peekaboo

Advantages

  • Comprehensive: screenshot, AI analysis, and GUI control in one tool
  • Native Swift for high performance and deep macOS integration
  • Dual interface: CLI for scripts, MCP for AI tools
  • Supports multiple AI vision models
  • Active development with frequent updates
  • Beautiful UI design for the companion app

Considerations

  • macOS only - no Windows or Linux support
  • Requires macOS 15+ (Sequoia) and Xcode 16+
  • Currently in Beta (v3.0.0-beta4)
  • Screen Recording permission required
  • Some features require AI API keys

Technical Details

Specifications and resources

Version
v3.0.0-beta4
Language
Swift
Platform
macOS 15+
License
MIT
45ms
Full Screen Capture
96.8%
AI Element Accuracy
99.2%
Click Success Rate
500+
Test Scenarios

Related Ecosystem Products

Works great with these tools

Give Your AI Eyes and Hands

Start automating your Mac with AI-powered screen control