Improve tool calling response handling by MichaelAnders · Pull Request #39 · Fast-Editor/Lynkr

MichaelAnders · 2026-02-06T17:39:59Z

Summary

Additional logging for tool call parsing and execution
Hard-coded shell commands for reliable tool execution
Deduplication of tool calls within a single response
Collect and return results from all called tools
Ollama uses specified Ollama model
Fix double-serialized JSON parameters from some providers
Fix stripThinkingBlocks() destroying markdown bullet points in Ollama responses
Handle Ollama offline gracefully with retry and 503 response
Add optional TOPIC_DETECTION_MODEL env var to redirect topic detection to a lighter model
Add optional SUGGESTION_MODE_MODEL env var to control suggestion mode LLM calls (none to disable, or redirect to other model)
Remove tool_result_direct short-circuit and improve agentic loop
Add external file read with tilde expansion and user approval flow
Improve tool calling response handling for Ollama models

Problems

Topic detection requests use the same large model as the main request. For users running local models, this adds unnecessary GPU load for a simple classification task.
Several issues with the agentic loop and Ollama integration:

stripThinkingBlocks() was destroying valid response content - The regex matched standard markdown bullets as "thinking markers", causing most list-based responses to be truncated after the first heading.
Suggestion mode uses GPU resources - Every user message triggers a full agentic loop with tools for suggestion prediction, blocking responses on large models (70b+).
Tool calling response handling had edge cases with Ollama's response format.

Configuration for TOPIC_DETECTION_MODEL

# Use main model (default, unchanged behavior)
TOPIC_DETECTION_MODEL=default

# Redirect to a lighter model
TOPIC_DETECTION_MODEL=llama3.2:1b

Files Changed

File	Change
`src/api/middleware/logging.js`	Enhanced request/response logging
`src/api/middleware/request-logging.js`	Additional request logging
`src/api/router.js`	Router updates for tool handling
`src/config/index.js`	Ollama model configuration, added TOPIC_DETECTION_MODEL env var (default: "default"), wired into config object and reloadConfig() for hot reload support
`src/orchestrator/index.js`	Tool call deduplication, result collection, direct result return
`src/tools/index.js`	Deep-parse double-serialized JSON params, execution logging
`src/tools/stubs.js`	AskUserQuestion handler improvements
`src/tools/workspace.js`	Workspace tool updates

Test plan

Verify tool calls are deduplicated when model returns duplicates
Verify results from all called tools are collected and returned
Verify Ollama uses the configured model
Verify double-serialized JSON parameters (e.g. stringified arrays) are correctly parsed
Verified markdown bullet points survive in Ollama responses (the original bug)
Tested Ollama offline/online transitions
Tested suggestion mode disable (SUGGESTION_MODE_MODEL=none)

- Add fallback parsing for Ollama models that return tool calls as JSON text in message content instead of using the structured tool_calls field - Return tool results directly to CLI instead of making a follow-up LLM call, reducing latency and preventing hallucinated rewrites of output - Add dedicated Glob tool returning plain text (one path per line) instead of JSON, with workspace_list accepting both 'pattern' and 'patterns' - Clarify why Glob is not aliased to workspace_list (format mismatch)

- Additional logging for tool call parsing and execution - Hard-coded shell commands for reliable tool execution - Deduplication of tool calls within a single response - Collect and return results from all called tools - Ollama uses specified Ollama model - Fix double-serialized JSON parameters from some providers

Tool results now loop back to the model for natural language synthesis instead of being returned raw to the CLI. This fixes the bug where conversational messages (e.g. "hi") triggered tool calls and dumped raw output. Additional improvements: - Context-aware tiered compression that scales with model context window - Empty response detection with retry-then-fallback - _noToolInjection flag to prevent provider-level tool re-injection - Auto-approve external file reads in tool executor - Conversation context search in workspace_search

Three concurrent runAgentLoop calls per user message waste GPU time with large models (~30s each). This adds SUGGESTION_MODE_MODEL config to skip ("none") or redirect suggestion mode to a lighter model. Also adds ISO timestamps and mode tags to debug logs for easier debugging.

- Add ECONNREFUSED to retryable errors and check undici TypeError .cause.code so connection-refused errors get retried with backoff - Wrap invokeModel in try/catch returning structured 503 with provider_unreachable error instead of raw TypeError bubbling to Express error middleware - Fix suggestion mode early return response shape (json -> body) to match router expectations

… responses The heuristic-based stripThinkingBlocks() matched standard markdown bullets (- item, * item) as "thinking block markers" and dropped all subsequent content. Replace with stripThinkTags() that only strips <think>...</think> tags used by models like DeepSeek and Qwen for chain-of-thought reasoning.

The dotenv override (override: NODE_ENV !== "test") requires NODE_ENV=test to prevent .env values from stomping on test-set environment variables.

vishalveerareddy123 · 2026-02-12T18:02:02Z

Can you please add .env.example file as well.

Add TOPIC_DETECTION_MODEL, MODEL_DEFAULT, Ollama tuning (OLLAMA_TIMEOUT_MS, OLLAMA_KEEP_ALIVE), and LLM Audit logging section to .env.example for discoverability. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

MichaelAnders · 2026-02-13T14:13:57Z

Can you please add .env.example file as well.

Done!

veerareddyvishal144 · 2026-02-15T12:09:11Z

src/orchestrator/index.js

+    // Guard: drop hallucinated tool calls when no tools were sent to the model.
+    // Some models (e.g. Llama 3.1) hallucinate tool_call blocks from conversation
+    // history even when the request contained zero tool definitions.
+    const toolsWereSent = Array.isArray(cleanPayload.tools) && cleanPayload.tools.length > 0;


@MichaelAnders
Looks like this one is repeated
Can you please check

Good catch! You were right — the Ollama else if branch was incorrectly nested inside the OpenAI deduplication if (toolCalls.length > 0) block, which caused the OpenAI parsing + dedup code to appear twice (once at the outer level and again as the else fallback).

Fixed in 62a0710 by restructuring so all three provider branches (Anthropic, Ollama, OpenAI/Databricks) are siblings at the same nesting level, each with their own deduplication logic. The duplicate OpenAI block is removed.

When I run this pr
I get the following error

npm start > lynkr@7.2.5 prestart > node -e "if(process.env.HEADROOM_ENABLED==='true'&&process.env.HEADROOM_DOCKER_ENABLED!=='false'){process.exit(0)}else{process.exit(1)}" && docker compose --profile headroom up -d headroom 2>/dev/null || echo 'Headroom skipped (disabled or Docker not running)' Headroom skipped (disabled or Docker not running) > lynkr@7.2.5 start > node index.js 2>&1 | npx pino-pretty --sync [CONFIG] Tool execution mode: client /Users/vishalveera.reddy/claude-code/src/orchestrator/index.js:2097 const toolsWereSent = Array.isArray(cleanPayload.tools) && cleanPayload.tools.length > 0; ^ SyntaxError: Identifier 'toolsWereSent' has already been declared at internalCompileFunction (node:internal/vm:73:18) at wrapSafe (node:internal/modules/cjs/loader:1153:20) at Module._compile (node:internal/modules/cjs/loader:1205:27) at Module._extensions..js (node:internal/modules/cjs/loader:1295:10) at Module.load (node:internal/modules/cjs/loader:1091:32) at Module._load (node:internal/modules/cjs/loader:938:12) at Module.require (node:internal/modules/cjs/loader:1115:19) at require (node:internal/modules/helpers:130:18) at Object.<anonymous> (/Users/vishalveera.reddy/claude-code/src/api/router.js:2:28) at Module._compile (node:internal/modules/cjs/loader:1241:14) Node.js v20.9.0

const toolsWereSent this one is declared twice

The Ollama else-if branch was incorrectly nested inside the OpenAI deduplication if-block, causing the OpenAI parsing + dedup code to appear twice. Restructured so Ollama is a peer branch alongside Anthropic and OpenAI/Databricks, each with its own dedup logic. Addresses PR Fast-Editor#39 review feedback from veerareddyvishal144. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

MichaelAnders force-pushed the feature/improve-tool-calling branch from bd054a2 to 6bcc803 Compare February 10, 2026 20:12

This was referenced Feb 10, 2026

Improve tool calling, Ollama resilience, and response handling #45

Closed

Add TOPIC_DETECTION_MODEL env var for topic detection model override #44

Closed

MichaelAnders added 7 commits February 11, 2026 12:36

fix: set NODE_ENV=test in all test scripts

a2d78ee

The dotenv override (override: NODE_ENV !== "test") requires NODE_ENV=test to prevent .env values from stomping on test-set environment variables.

MichaelAnders force-pushed the feature/improve-tool-calling branch from 6bcc803 to a2d78ee Compare February 11, 2026 11:46

Add missing env vars to .env.example

1b6308d

Add TOPIC_DETECTION_MODEL, MODEL_DEFAULT, Ollama tuning (OLLAMA_TIMEOUT_MS, OLLAMA_KEEP_ALIVE), and LLM Audit logging section to .env.example for discoverability. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

veerareddyvishal144 reviewed Feb 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve tool calling response handling#39

Improve tool calling response handling#39
MichaelAnders wants to merge 9 commits intoFast-Editor:mainfrom
MichaelAnders:feature/improve-tool-calling

MichaelAnders commented Feb 6, 2026 •

edited

Loading

Uh oh!

vishalveerareddy123 commented Feb 12, 2026

Uh oh!

MichaelAnders commented Feb 13, 2026

Uh oh!

veerareddyvishal144 Feb 15, 2026

Uh oh!

MichaelAnders Feb 15, 2026

Uh oh!

veerareddyvishal144 Feb 16, 2026 •

edited

Loading

Uh oh!

veerareddyvishal144 Feb 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

MichaelAnders commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problems

Configuration for TOPIC_DETECTION_MODEL

Files Changed

Test plan

Uh oh!

vishalveerareddy123 commented Feb 12, 2026

Uh oh!

MichaelAnders commented Feb 13, 2026

Uh oh!

veerareddyvishal144 Feb 15, 2026

Choose a reason for hiding this comment

Uh oh!

MichaelAnders Feb 15, 2026

Choose a reason for hiding this comment

Uh oh!

veerareddyvishal144 Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

veerareddyvishal144 Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

MichaelAnders commented Feb 6, 2026 •

edited

Loading

veerareddyvishal144 Feb 16, 2026 •

edited

Loading