Agent CLI

A collection of local-first, AI-powered command-line agents that run entirely on your machine.

What is Agent CLI?

Agent CLI provides a suite of powerful tools for voice and text interaction, designed for privacy, offline capability, and seamless integration with system-wide hotkeys and workflows.

Tip

Short aliases available: You can use agent or ag instead of agent-cli for convenience.

Important

Local and Private by Design

All agents can run 100% locally. Your data—whether from your clipboard, microphone, or files—stays on your machine unless you configure a cloud provider. This keeps workflows private and allows the tools to work offline.

You can optionally configure the agents to use OpenAI/Gemini services.

Quick Demo

Why I built this

I got tired of typing long prompts to LLMs. Speaking is faster, so I built this tool to transcribe my voice directly to the clipboard with a hotkey.

What it does:

Voice transcription to clipboard with system-wide hotkeys (Cmd+Shift+R on macOS)
Autocorrect any text from your clipboard
Edit clipboard content with voice commands ("make this more formal")
Runs locally - no internet required, your audio stays on your machine
Works with any app that can copy/paste

I use it mostly for the transcribe command when working with LLMs. Being able to speak naturally means I can provide more context without the typing fatigue.

Since then I have expanded the tool with many more features, all focused on local-first AI agents that integrate seamlessly with your system.

Features

Command	Description
`autocorrect`	Correct grammar and spelling using a local LLM
`transcribe`	Transcribe audio from your microphone to clipboard
`transcribe-live`	Continuous background transcription with VAD
`speak`	Convert text to speech using a local TTS engine
`voice-edit`	Edit clipboard text with voice commands
`assistant`	Wake word-based voice assistant
`chat`	Conversational AI with tool-calling capabilities
`rag-proxy`	Chat with your documents via RAG
`memory`	Long-term memory system for conversations
`dev`	Parallel development with git worktrees and AI coding agents
`server`	Local ASR & TTS servers with dual-protocol (Wyoming & OpenAI), TTL-based memory, and multi-platform acceleration (MLX/CUDA)

Quick Start

Just want the CLI tool?

If you already have AI services running (or plan to use OpenAI):

# Using uv (recommended)
uv tool install agent-cli -p 3.13

# Using pip
pip install agent-cli

Note

The -p 3.13 flag is required because some dependencies don't support Python 3.14 yet. See uv issue #8206 for details.

Then use it:

agent-cli autocorrect "this has an eror"

Want automatic setup with everything?

# 1. Install agent-cli
uv tool install agent-cli -p 3.13

# 2. Install all required services
agent-cli install-services

# 3. Start all services
agent-cli start-services

# 4. (Optional) Set up system-wide hotkeys
agent-cli install-hotkeys

# 5. Use it!
agent-cli autocorrect "this has an eror"

The setup automatically installs:

✅ Package managers (Homebrew/uv) if needed
✅ All AI services (Ollama, Whisper, TTS, etc.)
✅ The agent-cli tool
✅ System dependencies
✅ Hotkey managers (if using hotkey scripts)

Requirements

uv (recommended) or Python 3.11+
A microphone (for voice features)
Speakers (for text-to-speech features)

Documentation

Getting Started - Installation and first steps
Configuration - All configuration options
Commands - CLI reference
System Integration - Hotkeys and system setup
Architecture - How it works under the hood

License

MIT