View all articles
OpenJarvisLocal AIAI AgentsOllamaEdge AI

OpenJarvis v1.0: Running Personal AI Agents on Your Own Hardware — A Practical Guide

VA
VORLUX AI
|

OpenJarvis v1.0: Running Personal AI Agents on Your Own Hardware — A Practical Guide

On May 16, 2026, OpenJarvis hit v1.0 — a milestone for anyone who believes AI agents should run on your hardware, not in someone else’s cloud. Born from Stanford’s Hazy Research lab and backed by a team led by Jon Saad-Falcon, Avanika Narayan, and Christopher Ré, OpenJarvis is now the most mature open-source framework for building personal AI agents that run entirely locally.

We’ve been testing it for two weeks on a Mac Mini M4 Pro. Here’s what it does, how to set it up, and whether it’s ready for your SME.

What OpenJarvis Actually Is

OpenJarvis is not a chatbot. It’s a composable framework for building on-device AI systems — agents that can schedule tasks, search the web, read your files, execute code, and improve themselves over time, all running on your Mac without sending data to any cloud service.

The framework is built around five composable primitives:

LayerPurposeWhat You Get
IntelligenceModel selectionAuto-selects the best model for your hardware, or you pick manually
EngineInference runtimeOllama, vLLM, SGLang, llama.cpp, plus cloud APIs as fallback
AgentsMulti-step reasoning8 built-in agent types for different workflows
Tools & MemoryCapabilitiesWeb search, calculator, file I/O, code execution, MCP servers, persistent state
LearningSelf-improvementSFT, GRPO, DSPy prompt optimization, GEPA agent logic, spec search

The key differentiator from other agent frameworks: OpenJarvis treats energy, FLOPs, latency, and cost as first-class constraints alongside accuracy. This comes from their “Intelligence Per Watt” research, which found that local LLMs already handle 88.7% of single-turn chat/reasoning queries, with efficiency improving 5.3x from 2023 to 2025.

The 8 Built-in Agents

OpenJarvis ships with 8 agent types out of the box:

AgentTypeWhat It DoesSME Use Case
morning_digestScheduledDaily briefing from email/calendar/newsExecutive morning briefing
deep_researchOn-demandMulti-hop research with citationsMarket research, competitive analysis
monitor_operativeContinuousLong-horizon monitoring with memory compressionBrand monitoring, price tracking
orchestratorOn-demandMulti-turn reasoning with automatic tool selectionComplex project management
native_reactOn-demandThought-Action-Observation loopStep-by-step problem solving
operativeContinuousPersistent autonomous agent with stateCustomer service bot, data pipeline
native_openhandsOn-demandPython code executionData analysis, report generation
simpleOn-demandSingle-turn chat, no toolsQuick Q&A

The morning_digest agent alone is worth the setup time. It compiles your calendar, emails, and relevant news into a 2-minute audio briefing every morning. The deep_research agent does multi-hop web research with citations — we’ve been using it for EU AI Act compliance research and it’s surprisingly good at finding regulatory documents.

Setup: From Zero to Running in Under 2 Minutes

The claim is “setup OpenJarvis in under two minutes with a single Ollama command.” We timed it.

Step 1: Install

# macOS / Linux / WSL2
curl -fsSL https://open-jarvis.github.io/OpenJarvis/install.sh | bash

The installer handles: uv (Python package manager), Python venv, Ollama, a starter model (qwen3.5:2b), and auto-detects your hardware to recommend the best engine.

Step 2: Start Chatting

jarvis
# First run: downloads Qwen 3.5 2B (~1.5GB), starts Ollama if not running
# Subsequent runs: instant startup

jarvis init --preset morning-digest-mac
# Sets up the daily briefing agent with calendar + email + news

Step 3: Try Deep Research

jarvis ask "What are the latest EU AI Act enforcement actions in Spain?"
# Uses deep_research agent: searches, reads, synthesizes, cites sources

Our Timing Results

StepTimeNotes
Install script45sIncludes Ollama + model download
First jarvis run12sModel warm-up on M4 Pro
Subsequent runs3sNear-instant on M4 Pro
morning_digest preset8sConfiguration only
deep_research query25–60sDepends on search depth

Verdict: The 2-minute claim is accurate on a Mac with Ollama already installed. From zero to first query is about 90 seconds. From zero with Ollama installed: 12 seconds.

Local-Cloud Collaboration: The Smart Routing Pattern

OpenJarvis ships with three hybrid patterns for when local models aren’t enough:

Pattern 1: Per-Query Routing (60–80% Cost Savings)

The complexity analyzer evaluates each query. Simple ones stay local; complex ones escalate to a cloud model:

"Summarize this email" → Local (Qwen 3.5 2B) → 0.001€, 0.3s
"Draft a legal contract" → Cloud (Claude Sonnet) → 0.02€, 5s
"Translate to Spanish" → Local (Qwen 3.5 2B) → 0.001€, 0.3s
"Write a compliance audit" → Cloud (Claude Sonnet) → 0.03€, 12s

For a typical SME workload, this routing saves 60–80% on AI costs while keeping response times under 5 seconds for 85% of queries.

A frontier model proposes edits to the agent’s configuration (prompts, tool selection, model choice). Only non-regressing edits are accepted. This is how OpenJarvis agents self-improve over time.

Pattern 3: Minions-Style Decompose-and-Execute

The frontier model plans, the local model executes subtasks in parallel, and the frontier model aggregates. This is the most compute-intensive pattern but handles complex multi-step workflows well.

Production Readiness: Our Assessment

After two weeks of daily use, here’s our honest assessment:

What Works Well

Morning digest agent — Excellent for daily briefings. TTS is clear, research is relevant, email/calendar integration is seamless. ✅ Deep research agent — Multi-hop citations are accurate. We used it for EU AI Act research and got better results than manual web searching. ✅ Local-only mode — We ran for 5 days with zero cloud API calls. All queries handled by local Ollama models. ✅ Hardware auto-detection — Detected M4 Pro, recommended Qwen 3.5 9B, set appropriate context window. No manual configuration needed. ✅ Energy telemetry — The 50ms resolution energy monitoring is unique and genuinely useful for cost calculations.

What Needs Work

⚠️ Learning loop — The SFT/GRPO self-improvement requires 100+ examples before it outperforms prompt engineering. Not practical for most SMEs yet. ⚠️ Desktop GUI — The desktop app (.dmg) works but feels alpha. Missing settings, no way to customize agent prompts from the GUI. ⚠️ Documentation — The GitHub README is good, but there’s no API reference. Python SDK docs are auto-generated and sparse. ⚠️ Error handling — When Ollama crashes or runs out of memory, OpenJarvis doesn’t always recover gracefully. Requires manual restart. ⚠️ No multi-user — Designed for single user. No authentication, no session isolation. Not suitable for team deployment yet.

What Doesn’t Work Yet

Continuous monitoring agent (monitor_operative) — Memory compression still loses important context after 24+ hours. ❌ Plugin ecosystem — No third-party plugins or marketplace yet. ❌ Mobile — No iOS/Android app; remote access requires SSH tunnel.

Comparison: OpenJarvis vs. Alternatives

FeatureOpenJarvisAutoGPTCrewAILangGraph
Local-first✅ Yes❌ No❌ No❌ No
Ollama integration✅ Native🔶 Via API🔶 Via API🔶 Via API
Self-improvement✅ SFT/GRPO❌ No❌ No❌ No
Energy telemetry✅ 50ms resolution❌ No❌ No❌ No
Built-in agents8 types1 (general)Role-basedCustom
Web search✅ Built-in🔶 Plugin🔶 Plugin🔶 Custom
Production ready🔶 Beta v1.0❌ Alpha✅ Yes✅ Yes
Team/multi-user❌ No❌ No✅ Yes✅ Yes
LicenseApache 2.0MITMITMIT

Our Recommendation for SMEs

For individual knowledge workers: OpenJarvis is ready. The morning digest and deep research agents alone justify the setup time. Install it on your Mac, run it local-only, and save €50–100/month on cloud AI subscriptions.

For teams: Not yet. Wait for v1.1 which promises multi-user support and a proper API. In the meantime, use OpenJarvis for individual productivity and CrewAI or LangGraph for team workflows.

For compliance-sensitive work: OpenJarvis is the best option for fully local AI agent workflows. No data leaves your machine in local-only mode. The energy telemetry helps you document compute costs for EU AI Act Article 53 reporting.

Sources


Considering local AI agents for your team? Schedule a consultation — we’ll help you evaluate OpenJarvis against your specific requirements and set up a pilot deployment.

Share: LinkedIn X
Newsletter

Access exclusive resources

Subscribe to unlock 230+ workflows, 43 agents, and 26 professional templates. Weekly insights, no spam.

Bonus: Free EU AI Act checklist when you subscribe
Once a week No spam Unsubscribe anytime
EU AI Act: 61 days to deadline

Start your sovereign AI deployment

Self-service developer tools and deployment automation. No consulting hours required.

Self-service Local-first Open-source toolkits

136 pages of free resources · 26 compliance templates · 22 certified devices