droidclaw

Author	SHA1	Message	Date
Somasundaram Mahesh	0b36d92fef	fix: thread AbortSignal through LLM calls so stop_goal cancels immediately The agent loop checked signal.aborted only at the top of each iteration, but the LLM fetch() call (which takes seconds) never received the signal. Now the signal is passed to fetch() and checked after LLM errors and before the inter-step sleep, so aborting takes effect mid-step.	2026-02-18 18:51:01 +05:30
Somasundaram Mahesh	011e2be291	feat: agent overlay, stop-goal support, and state persistence across app kill - Add draggable agent overlay pill (status dot + step text + stop button) that shows over other apps while connected. Fix ComposeView rendering in service context by providing a SavedStateRegistryOwner. - Add stop_goal protocol message so the overlay/client can abort a running agent session; server aborts via AbortController. - Persist screen-capture consent to SharedPreferences so it survives process death; restore on ConnectionService connect and Settings resume. - Query AccessibilityManager for real service state instead of relying on in-process MutableStateFlow that resets on restart. - Add overlay permission checklist item and SYSTEM_ALERT_WINDOW manifest entry. - Filter DroidClaw's own overlay nodes from the accessibility tree so the agent never interacts with them.	2026-02-18 18:51:01 +05:30
Sanju Sivalingam	88af77ddc7	fix: configure postgres idle timeout and connection recycling for Railway Railway proxy closes idle DB connections after ~60s, causing CONNECTION_CLOSED errors on stale sockets. Set idle_timeout=20s and max_lifetime=5m so postgres-js recycles connections before they die. Also fix sendCommand to fall back to persistent device ID on reconnect.	2026-02-18 13:56:34 +05:30
Sanju Sivalingam	3bab84f611	fix(auth): use internal secret for web→server calls instead of cookie forwarding Cookie forwarding between dash.droidclaw.ai and tunnel.droidclaw.ai was unreliable. Now the web app passes userId + shared internal secret via headers. Also removes debug logging from device auth and session middleware.	2026-02-18 12:40:49 +05:30
Sanju Sivalingam	8ef15af97a	debug: add logging to device auth for hash mismatch investigation	2026-02-18 12:27:56 +05:30
Sanju Sivalingam	05d1cc657d	fix code	2026-02-18 12:01:23 +05:30
Sanju Sivalingam	d03be7365e	debug: add logging to session middleware for auth investigation	2026-02-18 11:59:59 +05:30
Sanju Sivalingam	68ca812267	revert(server): use direct DB queries for all auth validation Reverts middleware and dashboard WS to direct DB session lookups. Replaces auth.api.verifyApiKey in device WS with direct DB query using SHA-256 hash matching, removing dependency on BETTER_AUTH_SECRET for auth validation.	2026-02-18 11:46:48 +05:30
Sanju Sivalingam	a1ec1ac731	fix(agent): use device screen dimensions for scroll/swipe coordinates Swipe coordinates were hardcoded for 1080x2400 screens, causing scrolls to fail on devices with different resolutions. Now reads screenWidth and screenHeight from DeviceInfo and computes coordinates proportionally.	2026-02-18 10:48:37 +05:30
Sanju Sivalingam	81d78684a5	refactor: use better-auth api for session validation in server middleware and websocket	2026-02-18 10:38:15 +05:30
Sanju Sivalingam	792b42974f	feat(agent): implement server-side multi-step skills Skills (copy_visible_text, find_and_tap, submit_message, read_screen, wait_for_content, compose_email) were CLI-only using direct ADB. The server prompt advertised them but they silently failed when chosen. Now intercepted in the agent loop before actionToCommand() and executed server-side using existing WebSocket primitives (get_screen, tap, swipe, clipboard_set). Each skill replaces 3-8 LLM calls with deterministic server-side logic.	2026-02-18 00:58:59 +05:30
Sanju Sivalingam	db995e4913	fix(agent): prevent stuck loop by adding action history to LLM prompt The UI agent had no memory of previous actions — each step was a fresh single-shot LLM call. After typing and sending a message, the LLM saw an empty text field and retyped the message in a loop. - Add RECENT_ACTIONS (last 5 actions with text/result) to user prompt - Add chat app completion detection rule to dynamic prompt - Add send-success hints for WhatsApp and Messages apps - Add git convention to CLAUDE.md (no co-author lines)	2026-02-18 00:53:13 +05:30
Sanju Sivalingam	9193b02d36	fix(agent): address code review issues - Add empty goal guard in parser (returns done instead of passthrough) - Replace `as any` casts in pipeline.ts with proper ActionDecision types - Add runtime type guards for untrusted LLM output in classifier - Add intent action to dynamic prompt so UI agent can fire intents Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 00:32:14 +05:30
Sanju Sivalingam	3769b21ed1	refactor(agent): delete preprocessor.ts (replaced by parser.ts)	2026-02-18 00:28:50 +05:30
Sanju Sivalingam	d5c3466554	feat(agent): wire intent-first pipeline into all entrypoints Replace preprocessor+runAgentLoop with runPipeline in both device.ts (WebSocket) and goals.ts (REST). The pipeline orchestrates: deterministic parser (stage 1) -> LLM classifier (stage 2) -> lean UI agent (stage 3). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 00:28:13 +05:30
Sanju Sivalingam	18b8509081	feat(agent): add pipeline mode with dynamic prompts to agent loop When pipelineMode is enabled in AgentLoopOptions, the loop uses buildDynamicPrompt() with per-screen context (editable fields, scrollable elements, app hints, stuck state) instead of the static mega-prompt. Legacy mode (default) is unchanged. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 00:24:25 +05:30
Sanju Sivalingam	3f389c5de6	feat(agent): add dynamic prompt builder for Stage 3 UI agent	2026-02-18 00:22:24 +05:30
Sanju Sivalingam	91a828452b	feat(agent): add Stage 2 LLM goal classifier	2026-02-18 00:15:56 +05:30
Sanju Sivalingam	5dd199e0b8	feat(agent): add Stage 1 deterministic goal parser	2026-02-18 00:09:15 +05:30
Sanju Sivalingam	122bf87e72	feat(agent): add app-specific hints registry	2026-02-18 00:07:24 +05:30
Sanju Sivalingam	e300f04e13	feat: installed apps, stop goal, auth fixes, remote commands - Android: fetch installed apps via PackageManager, send to server on connect - Android: add QUERY_ALL_PACKAGES permission for full app visibility - Android: fix duplicate Intent import, increase accessibility retry window - Android: default server URL to ws:// instead of wss:// - Server: store installed apps in device metadata JSONB - Server: inject installed apps context into LLM prompt - Server: preprocessor resolves app names from device's actual installed apps - Server: add POST /goals/stop endpoint with AbortController cancellation - Server: rewrite session middleware to direct DB token lookup - Server: goals route fetches user's saved LLM config from DB - Web: show installed apps in device detail Overview tab with search - Web: add Stop button for running goals - Web: replace API routes with remote commands (submitGoal, stopGoal) - Web: add error display for goal submission failures - Shared: add InstalledApp type and apps message to protocol	2026-02-17 22:50:18 +05:30
Sanju Sivalingam	fae5fd3534	fix: goals route now finds devices by persistent DB ID, not connection UUID	2026-02-17 21:22:43 +05:30
Sanju Sivalingam	bf92ff4742	feat: handle heartbeat messages, update battery in DB + dashboard Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-17 21:01:06 +05:30
Sanju Sivalingam	c395f9d83e	feat: add DB persistence, real-time WebSocket, goal preprocessor, and Android companion app - Add device/session/step DB persistence in server agent loop - Add goal preprocessor for compound goals (e.g., "open YouTube and search X") - Add step-level logging to agent loop - Fix dashboard WebSocket auth (direct DB token lookup instead of auth.api) - Fix web layout to use locals.session.token instead of cookie - Add dashboard-ws.svelte.ts WebSocket store with auto-reconnect - Rewrite devices page with direct DB queries and real-time updates - Add device detail page with live step display and session history - Add Android companion app resources, themes, and screen capture consent Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-17 20:12:41 +05:30
Sanju Sivalingam	4c8241c964	feat: add agent loop with LLM integration and stuck detection Server-side agent loop that adapts the CLI kernel to work over WebSocket. Three new modules: stuck detection, LLM provider abstraction (OpenAI/Groq/ OpenRouter), and the main perception-reasoning-action loop. Also wires up the goals route to start agent loops with duplicate-device protection. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-17 14:27:26 +05:30
Sanju Sivalingam	577c195862	feat: add REST routes for devices, goals, and health Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-17 14:21:11 +05:30
Sanju Sivalingam	8fe3ad9926	feat: add WebSocket handlers for device and dashboard connections Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-17 14:17:29 +05:30
Sanju Sivalingam	bc014fd587	feat: scaffold Hono server with auth and health check Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-17 14:07:19 +05:30

28 Commits