ProductivityClaw

provenance:github:rah757/ProductivityClaw

WHAT THIS AGENT DOES

ProductivityClaw is an AI agent designed to help individuals prioritize their tasks and focus on what truly matters. It integrates with existing tools like calendars, notes, and messaging apps to build a comprehensive understanding of a user's digital life. The agent proactively suggests actions based on deadlines, habits, and priorities, eliminating the need for manual review of schedules and to-do lists. It's ideal for busy professionals and anyone seeking to improve their productivity without relying on complex new systems. ProductivityClaw operates locally on the user's device, ensuring data privacy and eliminating reliance on cloud services. The agent learns user patterns over time, providing increasingly relevant and personalized recommendations.

PROBLEM IT SOLVES

ProductivityClaw solves the problem of information overload and the difficulty of discerning what tasks are most important. Instead of simply displaying a list of commitments, it intelligently prioritizes them, saving users time and mental energy by guiding them toward the most impactful actions.

View Source ↗First seen 5mo agoNot yet hireable

CAPABILITIES & CONSTRAINTS

TECH & STACK

pythonmlxlanggraphsqlitemacostelegramllmproductivity

USE CASES

Scheduling Automation

README

# ProductivityClaw

A local-first AI agent that ingests your digital life — calendar, notes, messages, notifications — builds persistent context, and proactively tells you what to focus on.

Not "here's your calendar." More like: **"Based on your deadlines, habits, and priorities — here's what you should do right now."**

## Why This Exists

Every productivity tool shows you *what* you have. None of them tell you *what matters*. ProductivityClaw sits on top of your existing tools, learns your patterns over time, and surfaces what's actually important — without you asking.

## System Architecture
<img width="793" height="669" alt="image" src="https://github.com/user-attachments/assets/e12d98d8-906d-44d9-85f6-cf8299fda431" />
<img width="772" height="561" alt="image" src="https://github.com/user-attachments/assets/524a1531-5989-437c-8e1d-402c3444dc16" />
<img width="767" height="371" alt="image" src="https://github.com/user-attachments/assets/5354fbcb-8249-40e3-8641-87d8b1adc1b7" />


## Architecture Decisions

### Why Local LLM
All data stays on your device. Calendar events, personal notes, habits, routines — none of it leaves your machine. No cloud API calls, no data sharing, no latency dependency.

### Why MLX over Ollama
MLX uses ~50% less RAM than Ollama on Apple Silicon. The main reason: Qwen's 150K+ token vocabulary creates a 3-4GB embedding table in FP16. Ollama keeps this full-precision; MLX quantizes it down to ~1GB. MLX also exploits Apple Silicon's Unified Memory Architecture (no CPU↔GPU staging buffers) and uses custom Metal shaders. The codebase connects via OpenAI-compatible API (`ChatOpenAI`), so switching to Ollama for non-Mac deployment is a config change — same endpoint format.

### Why Qwen 3.5 35B-A3B
Mixture-of-Experts model: 35B total parameters, but only ~3B active per token (256 experts, 8 active). This gives you large-model quality at small-model speed — 2-4 second responses on a MacBook Pro. `think=False` works correctly (was broken on Qwen 3), keeping latency tight for chat while still available for background reasoning tasks.

### Why LangGraph
Not a simple prompt→response chain. LangGraph provides a stateful agent loop: the LLM can call tools, inspect results, call more tools, and maintain state across the cycle. Pending action state (for write confirmations) lives in the graph, not in fragile string parsing.

### Why SQLite + FTS5
Local-first, zero config, single file. FTS5 gives BM25-ranked full-text search over stored context — no vector database needed at personal scale. Every message, action, and tool call is linked by trace_id for full audit trails.

### Why Apple EventKit (Native)
No Google API keys, no OAuth dance, works offline. Direct access to macOS Calendar via PyObjC. Read-only by default — write actions (create/move events) require explicit user confirmation through Telegram buttons.

### Why Telegram
Free, instant setup, runs on your phone. Rich inline buttons enable the human-in-the-loop confirmation workflow for write actions. No web UI to build or maintain.

## Current Status — Phase 2: Memory, Tool Calling + Write Actions

- [x] Project architecture and technical design
- [x] MLX inference backend (migrated from Ollama)
- [x] Telegram bot — messages, HTML rendering, feedback buttons
- [x] Telegram streaming — tiered message editing for live LLM output
- [x] Apple Calendar read-only integration (EventKit)
- [x] Calendar write actions with confirmation (create_event, move_event)
- [x] Context dump ingestion (store_context + FTS5 search)
- [x] Living user profile (update_profile — add/remove/update CONTEXT.md)
- [x] Conversation logging with trace_id linking
- [x] Thumbs up/down feedback on every response
- [x] Heartbeat — proactive briefing system (morning/evening/meeting reminders)
- [x] Email ingestion — Apple Mail via ScriptingBridge + LLM classification (HIGH/LOW/NOISE)
- [x] Apple Notes ingestion — ScriptingBridge, 60-day window, auto-sync in heartbeat
- [x] LLM priority lock — chat always wins over background tasks (MLX single-threaded)
- [x] Custom LLM-as-judge eval suite (see Eval below)

**Phase 2 target:** Calendar writes with confirmation, email/notes awareness, streaming responses, and eval data proving quality.

## Tech Stack

| Component | Choice | Why |
|-----------|--------|-----|
| Language | Python | LangGraph ecosystem + PyObjC for macOS |
| Orchestration | LangGraph | Stateful tool-calling loops, pending action state |
| LLM | Qwen 3.5 35B-A3B-4bit | MoE: ~3B active/token, 2-4s responses, local |
| Inference | MLX (mlx_lm.server) | 50% less RAM than Ollama on Apple Silicon |
| Chat Interface | Telegram Bot API | Free, inline buttons for confirmation, runs on phone |
| Memory | SQLite + FTS5 | Local-first, BM25 search, zero config |
| Calendar | Apple EventKit (PyObjC) | Native macOS, no API keys, works offline |
| Email | Apple Mail (ScriptingBridge) | Same native pattern as EventKit, zero credentials |
| Notes | Apple Notes (ScriptingBridge) | Same native pattern, auto-syncs every heartbeat |
| Eval | Custom LLM-as-judge | Local Qwen judges its own outputs (no cloud eval APIs) |

## Skills

| Skill | Type | Description |
|-------|------|-------------|
| `get_calendar_events` | Read | Fetch events by timeframe (today, tomorrow, this_week, etc.) |
| `create_event` | Write | Propose a new calendar event — requires user confirmation |
| `move_event` | Write | Propose rescheduling an event — requires user confirmation |
| `store_context` | Write | Save notes, tasks, reminders to persistent memory with FTS5 indexing |
| `update_profile` | Write | Manage living user profile (preferences, schedule, routines, work) |
| `get_emails` | Read | Fetch classified emails by timeframe and priority filter |

Write skills that modify external systems (calendar) use a **pending action workflow**: the LLM proposes the action, the user sees a confirmation button in Telegram, and only an explicit tap executes the write.

## Eval

We tried DeepEval but it requires multiple chained LLM calls per metric, and Qwen 3.5 (MoE) intermittently returns empty responses — breaking DeepEval's JSON parsing pipeline. Instead we wrote a **custom LLM-as-judge eval** that:

- Sends **one prompt per test** (not 3+) with a simple "rate 1-5, respond with JSON" format
- **Retries with escalating temperature** (0.0 → 0.1 → 0.3 → 0.5 → 0.7) to break out of empty-response loops
- Strips `<think>` tags and extracts JSON from reasoning fields
- Marks tests as `flaky(reruns=2)` since local LLM non-determinism is inherent

**Test suite:**

| Suite | Tests | Requires MLX | What it covers |
|-------|-------|--------------|----------------|
| Deterministic | 32 | No | Calendar filtering, prompt assembly, email parsing, pending actions, tool detection |
| LLM-as-judge | 14 | Yes | Answer relevancy, faithfulness, hallucination detection, tool routing, correctness |

## Roadmap

| Phase | Focus | Status |
|-------|-------|--------|
| **1 — Prove the Loop** | Telegram + Calendar + Memory + Heartbeat + Eval | Complete |
| **2 — Memory + Write Actions** | Email/Notes ingestion, streaming, priority lock, calendar writes, eval | In progress |
| 3 — Proactive | Proactive suggestions, pattern recognition, smart reminders | Planned |
| 4 — Polish | Siri Shortcuts, vision, multi-modal, eval dashboard | Future |
| 5 — Ecosystem | Skill import pipeline, multi-agent, OSS community | Future |

## Setup

Requires **macOS with Apple Silicon** and 16GB+ RAM (36GB recommended for Qwen 3.5 35B).

```bash
# Clone
git clone https://github.com/rah757/ProductivityClaw.git
cd ProductivityClaw

# Environment
cp .env.example .env
# Edit .env: add TELEGRAM_BOT_TOKEN and TELEGRAM_ALLOWED_USER_IDS

# Dependencies
pip install -r requirements.txt

# Start MLX model server (separate terminal)
mlx_lm.server --model mlx-community/Qwen3.5-35B-A3B-4bit --port 8000

# Run the agent
python -m agent.main
```

## Privacy Model

- Agent runs 100% locally — API keys, data, memory 

[truncated…]

PUBLIC HISTORY

First discoveredMar 21, 2026

IDENTITY

inferred

Identity inferred from code signals. No PROVENANCE.yml found.

Is this yours? Claim it →

METADATA

platformgithub

first seenFeb 19, 2026

last updatedMar 21, 2026

last crawled4 months ago

version—

RELATED AGENTS

askimo

Askimo is a platform that lets you interact with artificial intelligence in a simple way, whether through chatting, sear

blog-writer-multi-agents

Here's a plain English summary of the blog-writer-multi-agents AI agent: This agent automatically creates professional-

boss-skill

This agent, boss-skill, is designed to help employees navigate challenging workplace dynamics, particularly those involv

strands-multi-engineer-agent

This agent helps businesses understand how different AI models perform when tackling the same engineering task. It runs

J.E.L.L.Y._AI

J.E.L.L.Y._AI is an article writing AI developed by its creator. This repository is publicly available for job seeking p

More Scheduling agents →

README BADGE

Add to your README:

![Provenance](https://getprovenance.dev/api/badge?id=provenance:github:rah757/ProductivityClaw)