OpenJarvis v1.0: Running Personal AI Agents on Your Own Hardware — A Practical Guide

On May 16, 2026, OpenJarvis hit v1.0 — a milestone for anyone who believes AI agents should run on your hardware, not in someone else’s cloud. Born from Stanford’s Hazy Research lab and backed by a team led by Jon Saad-Falcon, Avanika Narayan, and Christopher Ré, OpenJarvis is now the most mature open-source framework for building personal AI agents that run entirely locally.

We’ve been testing it for two weeks on a Mac Mini M4 Pro. Here’s what it does, how to set it up, and whether it’s ready for your SME.

What OpenJarvis Actually Is

OpenJarvis is not a chatbot. It’s a composable framework for building on-device AI systems — agents that can schedule tasks, search the web, read your files, execute code, and improve themselves over time, all running on your Mac without sending data to any cloud service.

The framework is built around five composable primitives:

Layer	Purpose	What You Get
Intelligence	Model selection	Auto-selects the best model for your hardware, or you pick manually
Engine	Inference runtime	Ollama, vLLM, SGLang, llama.cpp, plus cloud APIs as fallback
Agents	Multi-step reasoning	8 built-in agent types for different workflows
Tools & Memory	Capabilities	Web search, calculator, file I/O, code execution, MCP servers, persistent state
Learning	Self-improvement	SFT, GRPO, DSPy prompt optimization, GEPA agent logic, spec search

The key differentiator from other agent frameworks: OpenJarvis treats energy, FLOPs, latency, and cost as first-class constraints alongside accuracy. This comes from their “Intelligence Per Watt” research, which found that local LLMs already handle 88.7% of single-turn chat/reasoning queries, with efficiency improving 5.3x from 2023 to 2025.

The 8 Built-in Agents

OpenJarvis ships with 8 agent types out of the box:

Agent	Type	What It Does	SME Use Case
`morning_digest`	Scheduled	Daily briefing from email/calendar/news	Executive morning briefing
`deep_research`	On-demand	Multi-hop research with citations	Market research, competitive analysis
`monitor_operative`	Continuous	Long-horizon monitoring with memory compression	Brand monitoring, price tracking
`orchestrator`	On-demand	Multi-turn reasoning with automatic tool selection	Complex project management
`native_react`	On-demand	Thought-Action-Observation loop	Step-by-step problem solving
`operative`	Continuous	Persistent autonomous agent with state	Customer service bot, data pipeline
`native_openhands`	On-demand	Python code execution	Data analysis, report generation
`simple`	On-demand	Single-turn chat, no tools	Quick Q&A

The morning_digest agent alone is worth the setup time. It compiles your calendar, emails, and relevant news into a 2-minute audio briefing every morning. The deep_research agent does multi-hop web research with citations — we’ve been using it for EU AI Act compliance research and it’s surprisingly good at finding regulatory documents.

Setup: From Zero to Running in Under 2 Minutes

The claim is “setup OpenJarvis in under two minutes with a single Ollama command.” We timed it.

Step 1: Install

# macOS / Linux / WSL2
curl -fsSL https://open-jarvis.github.io/OpenJarvis/install.sh | bash

The installer handles: uv (Python package manager), Python venv, Ollama, a starter model (qwen3.5:2b), and auto-detects your hardware to recommend the best engine.

Step 2: Start Chatting

jarvis
# First run: downloads Qwen 3.5 2B (~1.5GB), starts Ollama if not running
# Subsequent runs: instant startup

jarvis init --preset morning-digest-mac
# Sets up the daily briefing agent with calendar + email + news

Step 3: Try Deep Research

jarvis ask "What are the latest EU AI Act enforcement actions in Spain?"
# Uses deep_research agent: searches, reads, synthesizes, cites sources

Our Timing Results

Step	Time	Notes
Install script	45s	Includes Ollama + model download
First `jarvis` run	12s	Model warm-up on M4 Pro
Subsequent runs	3s	Near-instant on M4 Pro
`morning_digest` preset	8s	Configuration only
`deep_research` query	25–60s	Depends on search depth

Verdict: The 2-minute claim is accurate on a Mac with Ollama already installed. From zero to first query is about 90 seconds. From zero with Ollama installed: 12 seconds.

Local-Cloud Collaboration: The Smart Routing Pattern

OpenJarvis ships with three hybrid patterns for when local models aren’t enough:

Pattern 1: Per-Query Routing (60–80% Cost Savings)

The complexity analyzer evaluates each query. Simple ones stay local; complex ones escalate to a cloud model:

"Summarize this email" → Local (Qwen 3.5 2B) → 0.001€, 0.3s
"Draft a legal contract" → Cloud (Claude Sonnet) → 0.02€, 5s
"Translate to Spanish" → Local (Qwen 3.5 2B) → 0.001€, 0.3s
"Write a compliance audit" → Cloud (Claude Sonnet) → 0.03€, 12s

For a typical SME workload, this routing saves 60–80% on AI costs while keeping response times under 5 seconds for 85% of queries.

Pattern 2: LLM-Guided Spec Search

A frontier model proposes edits to the agent’s configuration (prompts, tool selection, model choice). Only non-regressing edits are accepted. This is how OpenJarvis agents self-improve over time.

Pattern 3: Minions-Style Decompose-and-Execute

The frontier model plans, the local model executes subtasks in parallel, and the frontier model aggregates. This is the most compute-intensive pattern but handles complex multi-step workflows well.

Production Readiness: Our Assessment

After two weeks of daily use, here’s our honest assessment:

What Works Well

✅ Morning digest agent — Excellent for daily briefings. TTS is clear, research is relevant, email/calendar integration is seamless. ✅ Deep research agent — Multi-hop citations are accurate. We used it for EU AI Act research and got better results than manual web searching. ✅ Local-only mode — We ran for 5 days with zero cloud API calls. All queries handled by local Ollama models. ✅ Hardware auto-detection — Detected M4 Pro, recommended Qwen 3.5 9B, set appropriate context window. No manual configuration needed. ✅ Energy telemetry — The 50ms resolution energy monitoring is unique and genuinely useful for cost calculations.

What Needs Work

⚠️ Learning loop — The SFT/GRPO self-improvement requires 100+ examples before it outperforms prompt engineering. Not practical for most SMEs yet. ⚠️ Desktop GUI — The desktop app (.dmg) works but feels alpha. Missing settings, no way to customize agent prompts from the GUI. ⚠️ Documentation — The GitHub README is good, but there’s no API reference. Python SDK docs are auto-generated and sparse. ⚠️ Error handling — When Ollama crashes or runs out of memory, OpenJarvis doesn’t always recover gracefully. Requires manual restart. ⚠️ No multi-user — Designed for single user. No authentication, no session isolation. Not suitable for team deployment yet.

What Doesn’t Work Yet

❌ Continuous monitoring agent (monitor_operative) — Memory compression still loses important context after 24+ hours. ❌ Plugin ecosystem — No third-party plugins or marketplace yet. ❌ Mobile — No iOS/Android app; remote access requires SSH tunnel.

Comparison: OpenJarvis vs. Alternatives

Feature	OpenJarvis	AutoGPT	CrewAI	LangGraph
Local-first	✅ Yes	❌ No	❌ No	❌ No
Ollama integration	✅ Native	🔶 Via API	🔶 Via API	🔶 Via API
Self-improvement	✅ SFT/GRPO	❌ No	❌ No	❌ No
Energy telemetry	✅ 50ms resolution	❌ No	❌ No	❌ No
Built-in agents	8 types	1 (general)	Role-based	Custom
Web search	✅ Built-in	🔶 Plugin	🔶 Plugin	🔶 Custom
Production ready	🔶 Beta v1.0	❌ Alpha	✅ Yes	✅ Yes
Team/multi-user	❌ No	❌ No	✅ Yes	✅ Yes
License	Apache 2.0	MIT	MIT	MIT

Our Recommendation for SMEs

For individual knowledge workers: OpenJarvis is ready. The morning digest and deep research agents alone justify the setup time. Install it on your Mac, run it local-only, and save €50–100/month on cloud AI subscriptions.

For teams: Not yet. Wait for v1.1 which promises multi-user support and a proper API. In the meantime, use OpenJarvis for individual productivity and CrewAI or LangGraph for team workflows.

For compliance-sensitive work: OpenJarvis is the best option for fully local AI agent workflows. No data leaves your machine in local-only mode. The energy telemetry helps you document compute costs for EU AI Act Article 53 reporting.

Sources

Considering local AI agents for your team? Schedule a consultation — we’ll help you evaluate OpenJarvis against your specific requirements and set up a pilot deployment.

OpenJarvis v1.0: Running Personal AI Agents on Your Own Hardware — A Practical Guide

OpenJarvis v1.0: Running Personal AI Agents on Your Own Hardware — A Practical Guide

What OpenJarvis Actually Is

The 8 Built-in Agents

Setup: From Zero to Running in Under 2 Minutes

Step 1: Install

Step 2: Start Chatting

Step 3: Try Deep Research

Our Timing Results

Local-Cloud Collaboration: The Smart Routing Pattern

Pattern 1: Per-Query Routing (60–80% Cost Savings)

Pattern 2: LLM-Guided Spec Search

Pattern 3: Minions-Style Decompose-and-Execute

Production Readiness: Our Assessment

What Works Well

What Needs Work

What Doesn’t Work Yet

Comparison: OpenJarvis vs. Alternatives

Our Recommendation for SMEs

Sources

Blog

The Rise of Open Models: A Game-Changer for European SMEs

#The Rise of VORLUX AI in Europe: Empowering SMEs with Cutting-Edge Edge-AI Solutions

Access exclusive resources

Start your sovereign AI deployment

VORLUX AI