Files

Tanmay Karande ec8bd80a3d first commit

2026-02-13 23:56:09 -05:00

18 KiB

Raw Blame History

OpenClaw Analysis & "My Own OpenClaw" Comparison

Date: 2026-02-13
Source Repo: inspiration/openclaw/ (local clone)
Diagram Reference: inspiration/MyOwnOpenClaw.png

What Is OpenClaw?
OpenClaw Architecture Deep Dive
MyOwnOpenClaw — The Simplified Blueprint
Side-by-Side Comparison
Key Takeaways for Building Our Own
Recommended Build Process for Aetheel

1. What Is OpenClaw?

OpenClaw is an open-source personal AI assistant (MIT licensed, 176k+ stars, 443 contributors, 175k+ lines of TypeScript). It runs locally on your own devices and acts as a gateway-centric control plane that connects an AI agent to every messaging channel you already use.

Core value proposition: A single, always-on AI assistant that talks to you through WhatsApp, Telegram, Slack, Discord, Signal, iMessage, Microsoft Teams, Google Chat, Matrix, WebChat, and more — while keeping everything local and under your control.

2. OpenClaw Architecture Deep Dive

2.1 The Four Pillars

Based on both the source code analysis and the MyOwnOpenClaw.png diagram, OpenClaw's architecture rests on four core subsystems:

Pillar 1: Memory System — "How It Remembers You"

Source files: src/memory/ (49 files, including manager.ts at 2,300+ lines)

How it works:

Component	Details
Identity Files	`SOUL.md` — personality & values; `USER.md` — who you are; `AGENTS.md` — agent behavior rules; `HEARTBEAT.md` — what to proactively check
Long-term Memory	`MEMORY.md` — persisted decisions, lessons, context
Session Logs	`daily/` — session logs organized by date
Search	Hybrid search = vector (embeddings) + keyword (BM25) via `sqlite-vec` or `pgvector`
Embedding Providers	OpenAI, Voyage AI, Gemini, or local via `node-llama-cpp` (ONNX)
Storage	SQLite database with `sqlite-vec` extension for vector similarity
Sync	File watcher (chokidar) monitors workspace for changes, auto-re-indexes

Key architectural details from the code:

MemoryIndexManager class (2,300 LOC) manages the full lifecycle: sync → chunk → embed → store → search
Hybrid search weighting: configurable vector weight + keyword weight (default 0.7 × vector + 0.3 × keyword as shown in the diagram)
Supports batch embedding with Voyage, OpenAI, and Gemini batch APIs
FTS5 full-text search table for keyword matching
Vector table via sqlite-vec for similarity search
Automatic chunking with configurable token sizes and overlap

Pillar 2: Heartbeat — "How It Acts Proactively"

Source files: src/cron/ (37 files including service, scheduling, delivery)

How it works:

Component	Details
Scheduling	Cron-based scheduling using the `croner` library
Service Architecture	`src/cron/service/` — manages job lifecycle, timers, catch-up after restarts
Normalization	`normalize.ts` (13k) — normalizes cron expressions and job definitions
Delivery	`delivery.ts` — routes cron job output to the correct channel/session
Run Logging	`run-log.ts` — persists execution history
Session Reaper	`session-reaper.ts` — cleans up stale sessions

What happens on each heartbeat:

Cron fires at scheduled intervals
Gateway processes the event
Checks all integrated services (Gmail, Calendar, Asana, Slack, etc.)
AI reasons over the data
Sends notification if needed (e.g., "Meeting in 15 min — prep doc is empty")
Or returns HEARTBEAT_OK (nothing to report)

Key detail: Runs without user prompting — this is what makes it feel "proactive."

Pillar 3: Channel Adapters — "How It Works Everywhere"

Source files: src/channels/, src/whatsapp/, src/telegram/, src/discord/, src/slack/, src/signal/, src/imessage/, src/web/, plus extensions/ (35 extension directories)

Built-in channels:

Channel	Library	Status
WhatsApp	`@whiskeysockets/baileys`	Core
Telegram	`grammy`	Core
Slack	`@slack/bolt`	Core
Discord	`discord.js` / `@buape/carbon`	Core
Signal	`signal-cli`	Core
iMessage	BlueBubbles (recommended) or legacy `imsg`	Core
WebChat	Built into Gateway WS	Core

Extension channels (via plugin system): Microsoft Teams, Matrix, Zalo, Zalo Personal, Google Chat, IRC, Mattermost, Twitch, LINE, Feishu, Nextcloud Talk, Nostr, Tlon, voice calls

Architecture:

Gateway-centric — all channels connect through a single WebSocket control plane (ws://127.0.0.1:18789)
Channel Dock (src/channels/dock.ts, 17k) — unified registration and lifecycle management
Session isolation — each channel/conversation gets its own session with isolated context
Group routing — configurable mention gating, reply tags, per-channel chunking
DM security — pairing codes for unknown senders, allowlists

Pillar 4: Skills Registry — "How It Extends to Anything"

Source files: skills/ (52 skill directories)

How it works:

Component	Details
Structure	Each skill is a directory with a `SKILL.md` file
Installation	Drop a file in `~/.openclaw/workspace/skills/<skill>/SKILL.md` — instantly available
Registry	ClawHub (5,700+ skills) — community-built extensions
Types	Bundled, managed, and workspace skills
Scope	Local files only — no public registry dependency, no supply chain attack surface

Built-in skill examples: 1password, apple-notes, apple-reminders, bear-notes, github, notion, obsidian, spotify-player, weather, canvas, coding-agent, discord, slack, openai-image-gen, openai-whisper, session-logs, summarize, video-frames, voice-call, etc.

2.2 Gateway Architecture

The Gateway is the central nervous system of OpenClaw:

WhatsApp / Telegram / Slack / Discord / Signal / iMessage / Teams / WebChat
                │
                ▼
┌───────────────────────────────┐
│            Gateway            │
│       (WS control plane)      │
│     ws://127.0.0.1:18789      │
├───────────────────────────────┤
│  • Session management         │
│  • Channel routing            │
│  • Cron/heartbeat engine      │
│  • Tool registration          │
│  • Presence & typing          │
│  • Auth & pairing             │
│  • Plugin loading             │
│  • Memory manager             │
│  • Config hot-reload          │
└──────────────┬────────────────┘
               │
               ├─ Pi agent (RPC) — AI reasoning engine
               ├─ CLI (openclaw …)
               ├─ WebChat UI
               ├─ macOS app (menu bar)
               ├─ iOS / Android nodes
               └─ Browser control (CDP)

Key source files:

src/gateway/server.impl.ts (22k) — main gateway server implementation
src/gateway/server-http.ts (17k) — HTTP server
src/gateway/ws-log.ts (14k) — WebSocket logging
src/gateway/session-utils.ts (22k) — session management
src/gateway/config-reload.ts (11k) — hot config reload

2.3 Configuration

File: ~/.openclaw/openclaw.json (JSON5 format)
Schema: Massive TypeBox schema system (src/config/schema.ts, schema.hints.ts at 46k, schema.field-metadata.ts at 45k)
Validation: Zod schemas (src/config/zod-schema.ts, 20k)
Hot Reload: Config changes apply without restart

2.4 Tech Stack

Category	Technology
Language	TypeScript (ESM)
Runtime	Node.js ≥22 (Bun also supported)
Package Manager	pnpm (bun optional)
Build	`tsdown` (based on Rolldown)
Testing	Vitest with V8 coverage
Linting	Oxlint + Oxfmt
AI Runtime	Pi agent (`@mariozechner/pi-agent-core`) in RPC mode
Database	SQLite with `sqlite-vec` for vector search
Embedding	OpenAI, Voyage, Gemini, or local ONNX
HTTP	Express 5
WebSocket	`ws` library

3. MyOwnOpenClaw — The Simplified Blueprint

The MyOwnOpenClaw.png diagram presents a dramatically simplified version of the same architecture, built with:

Tools: Claude Code + Claude Agent SDK + SQLite + Markdown + Obsidian

The 4 Custom Modules

① My Memory (SQLite + Markdown + Obsidian)

Feature	Implementation
`SOUL.md`	Personality & values
`USER.md`	Who I am, preferences
`MEMORY.md`	Decisions & lessons
`daily/`	Session logs
Hybrid Search	0.7 × vector + 0.3 × keyword (BM25)
Embeddings	SQLite (or Postgres) + FastEmbed (384-dim, ONNX)
Key principle	Fully local — zero API calls
Storage philosophy	"Markdown IS the database" — Obsidian syncs it everywhere

② My Heartbeat (Claude Agent SDK + Python APIs)

Feature	Implementation
Frequency	Every 30 minutes
Action	Python gathers data from sources: Gmail, Calendar, Asana, Slack
Reasoning	Claude reasons over the data, decides what's important
Notification	Sends notification if needed
Example	"Meeting in 15 min — prep doc is empty"
Fallback	`HEARTBEAT_OK (nothing to report)`

③ My Adapters (Slack + Terminal)

Feature	Implementation
Slack	Socket Mode — no public URL needed; each thread = persistent conversation
Terminal	Claude Code — direct interaction; full skill + hook access either way
One-shot	With Claude Code
Future	Discord, Teams — add when needed

④ My Skills (Local `.claude/skills/`)

Feature	Implementation
Location	Local `.claude/skills/` directory
Examples	`content-engine/`, `direct-integrations/`, `yt-script/`, `pptx-generator/`, `excalidraw-diagram/`, `...15+ more`
Installation	Drop in `SKILL.md` — instantly available
Security	Local files only — NO public registry, no supply chain attack surface

The Vision: "Your Ultra-Personalized AI Agent"

🔵 Remembers your decisions, preferences, and context

🟣 Checks your email and calendar — before you ask

🟢 Talk to it from Slack, terminal, anywhere

🟡 Add any capability with a single file

"Acts on your behalf. Anticipates what you need. Knows you better every day."

Build Stack

Claude Code ──→ Claude Agent SDK ──→ SQLite + Markdown ──→ Obsidian
(skills + hooks)  (heartbeat + background)  (hybrid search, fully local)  (your canvas, sync anywhere)

~2,000 lines of Python + Markdown — "You can build it in just a couple days."

4. Side-by-Side Comparison

Feature	OpenClaw (Full)	MyOwnOpenClaw (Custom)
Codebase Size	175k+ lines TypeScript	~2,000 lines Python + Markdown
Language	TypeScript (ESM)	Python
AI Provider	Any (Anthropic, OpenAI, etc. via Pi)	Claude (via Claude Agent SDK)
Memory System	SQLite + sqlite-vec, multiple embedding providers	SQLite + FastEmbed (384-dim ONNX)
Hybrid Search	Vector + BM25 (configurable weights)	0.7 vector + 0.3 keyword (BM25)
Embeddings	OpenAI, Voyage, Gemini, local ONNX	FastEmbed local ONNX only — zero API calls
Prompt Files	SOUL.md, USER.md, AGENTS.md, HEARTBEAT.md, TOOLS.md	SOUL.md, USER.md, MEMORY.md, daily/
Heartbeat	Full cron system with croner library	Simple 30-minute Python script
Data Sources	Configurable via plugins/skills	Gmail, Calendar, Asana, Slack
Channels	15+ (WhatsApp, Telegram, Slack, Discord, Signal, iMessage, Teams, Matrix, etc.)	Slack (Socket Mode) + Terminal (Claude Code)
Gateway	Full WS control plane with auth, routing, sessions	None — direct connection
Skills	52 bundled + ClawHub registry (5,700+)	Local `.claude/skills/` directory (15+ custom)
Skill Format	`SKILL.md` file in directory	`SKILL.md` file in directory (same pattern!)
Apps	macOS, iOS, Android, WebChat	None — Slack + CLI
Voice	Voice Wake + Talk Mode (ElevenLabs)	Not included
Browser	Playwright-based CDP control	Not included
Canvas	Agent-driven visual workspace (A2UI)	Not included
Config	JSON5 with massive schema validation	Simple Markdown files
Sync	File watcher (chokidar)	Obsidian sync
Storage Philosophy	SQLite is the DB	"Markdown IS the database" — Obsidian syncs everywhere
Installation	`npm install -g openclaw` + wizard	Clone repo + point Claude Code at it
Security	DM pairing, allowlists, Docker sandboxing	Local only by default
Multi-agent	Session isolation, agent-to-agent messaging	Not included
Complexity	Enterprise-grade, production-ready	Personal, lightweight, hackable

5. Key Takeaways for Building Our Own

What OpenClaw Gets Right (and we should learn from):

The Memory Architecture — The combination of identity files (SOUL.md, USER.md) + long-term memory (MEMORY.md) + session logs (daily/) is the core pattern. Both systems use this.
Hybrid Search — Vector + keyword search is essential for good memory retrieval. The 0.7/0.3 weighting is a good starting point.
Skill Drop-in Pattern — Just put a SKILL.md file in a directory and it's instantly available. No compilation, no registry. OpenClaw invented this pattern and the custom version copies it directly.
Proactive Heartbeat — Running on a schedule, checking your data sources before you ask. This is what makes the agent feel like an assistant rather than a chatbot.
The Separation of Concerns — Memory, Heartbeat, Adapters, and Skills are clean, independent modules. Each can be built and tested separately.

What MyOwnOpenClaw Simplifies:

No Gateway — Direct connections instead of a WS control plane. Much simpler but less flexible.
Python over TypeScript — More accessible for quick prototyping and data processing.
Claude-only — No model switching, no failover. Simpler but locked to one provider.
Obsidian as sync — Uses Obsidian's existing sync infrastructure instead of building custom file watching.
Two adapters max — Slack + Terminal vs. 15+ channels. Start small, add as needed.

The Process (from the diagram):

Clone the OpenClaw repository (MIT licensed, 100% open source)

Point your coding agent at it — "Explain how the memory system works"

"Now build that into my own system here (optional: with customization XYZ)"

Repeat for heartbeat, adapters, skills. That's it.

Use OpenClaw as your blueprint, not your dependency.

6. Recommended Build Process for Aetheel

Based on this analysis, here's the recommended order for building a custom AI assistant inspired by OpenClaw:

Phase 1: Memory System

Create SOUL.md, USER.md, MEMORY.md files
Implement SQLite database with sqlite-vec or FastEmbed for vector search
Build hybrid search (vector + BM25 keyword)
Set up file watching for automatic re-indexing
Use Obsidian for cross-device sync

Phase 2: Heartbeat

Build a Python script using Claude Agent SDK
Connect to Gmail, Calendar, Asana (start with most-used services)
Set up 30-minute cron schedule
Implement notification delivery (start with terminal notifications)

Phase 3: Adapters

Start with Terminal (Claude Code) for direct interaction
Add Slack (Socket Mode) for messaging
Build conversation threading support

Phase 4: Skills

Create .claude/skills/ directory structure
Port most-used skills from OpenClaw as inspiration
Build custom skills specific to your workflow

Appendix: OpenClaw File Structure Reference

openclaw/
├── src/                          # Core source code (175k+ LOC)
│   ├── memory/                   # Memory system (49 files)
│   │   ├── manager.ts            # Main memory manager (2,300 LOC)
│   │   ├── hybrid.ts             # Hybrid search (vector + keyword)
│   │   ├── embeddings.ts         # Embedding provider abstraction
│   │   ├── qmd-manager.ts        # Query+doc management (33k)
│   │   └── ...
│   ├── cron/                     # Heartbeat/cron system (37 files)
│   │   ├── service/              # Cron service lifecycle
│   │   ├── schedule.ts           # Scheduling logic
│   │   ├── delivery.ts           # Output delivery
│   │   └── ...
│   ├── channels/                 # Channel adapter framework (28 files)
│   │   ├── dock.ts               # Unified channel dock (17k)
│   │   ├── registry.ts           # Channel registration
│   │   └── ...
│   ├── gateway/                  # Gateway WS control plane (129+ files)
│   │   ├── server.impl.ts        # Main server (22k)
│   │   ├── server-http.ts        # HTTP layer (17k)
│   │   ├── session-utils.ts      # Session management (22k)
│   │   └── ...
│   ├── config/                   # Configuration system (130+ files)
│   ├── agents/                   # Agent runtime
│   ├── browser/                  # Browser control (Playwright)
│   └── ...
├── skills/                       # Built-in skills (52 directories)
│   ├── obsidian/
│   ├── github/
│   ├── notion/
│   ├── spotify-player/
│   └── ...
├── extensions/                   # Extension channels (35 directories)
│   ├── msteams/
│   ├── matrix/
│   ├── voice-call/
│   └── ...
├── apps/                         # Companion apps
│   ├── macos/
│   ├── ios/
│   └── android/
├── AGENTS.md                     # Agent behavior guidelines
├── openclaw.json                 # Configuration
└── package.json                  # Dependencies & scripts

18 KiB Raw Blame History Unescape Escape

OpenClaw Analysis & "My Own OpenClaw" Comparison

Table of Contents

1. What Is OpenClaw?

2. OpenClaw Architecture Deep Dive

2.1 The Four Pillars

Pillar 1: Memory System — "How It Remembers You"

Pillar 2: Heartbeat — "How It Acts Proactively"

Pillar 3: Channel Adapters — "How It Works Everywhere"

Pillar 4: Skills Registry — "How It Extends to Anything"

2.2 Gateway Architecture

2.3 Configuration

2.4 Tech Stack

3. MyOwnOpenClaw — The Simplified Blueprint

The 4 Custom Modules

① My Memory (SQLite + Markdown + Obsidian)

② My Heartbeat (Claude Agent SDK + Python APIs)

③ My Adapters (Slack + Terminal)

④ My Skills (Local .claude/skills/)

The Vision: "Your Ultra-Personalized AI Agent"

Build Stack

4. Side-by-Side Comparison

5. Key Takeaways for Building Our Own

What OpenClaw Gets Right (and we should learn from):

What MyOwnOpenClaw Simplifies:

The Process (from the diagram):

6. Recommended Build Process for Aetheel

Phase 1: Memory System

Phase 2: Heartbeat

Phase 3: Adapters

Phase 4: Skills

Appendix: OpenClaw File Structure Reference

18 KiB

Raw Blame History

④ My Skills (Local `.claude/skills/`)