Commit Graph

9 Commits

Author SHA1 Message Date
Sanju Sivalingam
c395f9d83e feat: add DB persistence, real-time WebSocket, goal preprocessor, and Android companion app
- Add device/session/step DB persistence in server agent loop
- Add goal preprocessor for compound goals (e.g., "open YouTube and search X")
- Add step-level logging to agent loop
- Fix dashboard WebSocket auth (direct DB token lookup instead of auth.api)
- Fix web layout to use locals.session.token instead of cookie
- Add dashboard-ws.svelte.ts WebSocket store with auto-reconnect
- Rewrite devices page with direct DB queries and real-time updates
- Add device detail page with live step display and session history
- Add Android companion app resources, themes, and screen capture consent

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-17 20:12:41 +05:30
Sanju Sivalingam
9e9f398ed6 Add auto-scroll to find_and_tap skill for off-screen elements
find_and_tap now scrolls down and rescans (up to 10 times) when the
target element isn't visible on the current screen. Stops as soon as
the element is found — no wasted scrolls. This removes the need for
LLMs to manually scroll-and-check in workflow prompts.

Also simplifies the Gemini-to-WhatsApp workflow prompts.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-16 21:56:34 +05:30
Sanju Sivalingam
2312f8bece update workflow 2026-02-16 21:14:16 +05:30
Sanju Sivalingam
4848587f0c Add Gemini-to-WhatsApp workflow example
Multi-app workflow: ask Gemini about droidclaw.ai, copy response,
switch to WhatsApp, find contact, paste and send.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-16 20:47:40 +05:30
Sanju Sivalingam
71021f48dd Organise examples into folders, rewrite readme with natural flow and tailscale section
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-14 20:28:35 +05:30
Sanju Sivalingam
389ac81c98 Add Maestro-style YAML flow runner for deterministic automation
New --flow mode executes scripted YAML steps without LLM, mapping 17
commands (tap, type, swipe, scroll, etc.) to existing actions. Element
finding uses accessibility tree text/hint/id matching.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-07 19:34:18 +05:30
Sanju Sivalingam
7875db1626 Add weather-to-whatsapp workflow using Google AI Mode + share intent
Google app → AI Mode → "Chennai today weather" → Share button → WhatsApp → Sanju

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-07 19:11:26 +05:30
Sanju Sivalingam
feeb52aade Add 33 practical day-to-day workflow examples
Messaging & Communication:
- whatsapp-reply, whatsapp-broadcast, whatsapp-to-email
- telegram-channel-digest, telegram-send-message
- slack-standup, slack-check-messages
- email-digest, email-reply, translate-and-reply

Social Media:
- social-media-post (cross-platform Twitter + LinkedIn)
- social-media-engage (like/comment rounds)
- instagram-post-check (engagement stats)

Productivity:
- morning-briefing (SMS + WhatsApp + Telegram + Calendar + Weather)
- calendar-create-event, notes-capture
- notification-cleanup, do-not-disturb (focus mode)
- github-check-prs, screenshot-share-slack

Research & Search:
- google-search-report, news-roundup
- multi-app-research (Google + YouTube + Reddit)
- price-comparison (Amazon + Flipkart + Google)

Lifestyle:
- food-order (Zomato), uber-ride (price check)
- maps-commute, check-flight-status
- spotify-playlist, youtube-watch-later
- fitness-log, expense-tracker, wifi-password-share

All workflows use: bun run src/kernel.ts --workflow examples/<name>.json

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-07 18:37:29 +05:30
Sanju Sivalingam
8bad65cd96 Add 7 new actions, workflow orchestration, and shell escaping fixes
- New actions: open_url, switch_app, notifications, pull_file, push_file, keyevent, open_settings
- Workflow system: runWorkflow() for multi-app sub-goal sequences with --workflow CLI flag
- Export runAgent() with {success, stepsUsed} return for workflow integration
- Fix clipboard_set shell escaping (single-quote wrapping matching skills.ts)
- Improve type action escaping for backticks, $, !, ?, brackets, braces
- Move parseJsonResponse to llm-providers.ts and export it
- Update SYSTEM_PROMPT and Zod schema for 22 total actions

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-07 18:33:24 +05:30