fix(models): memory fixes, provider code typing, cost calculation cleanup #2515

icecrasher321 · 2025-12-22T06:29:32Z

Summary

Memory + Streaming fixes, need to wrap streaming response to persist memory at the end of accumulation
Memory should not be block scoped. Managed via conversation id.
Move cost calculation to same level as token count calcualtion. This fixes bugs related to client side execution not tracking cost for chat
Type providers using official SDK types and remove fallbacks that were difficult to see when hit
Support for parallel tool calls

Type of Change

Bug fix
New feature

Testing

Tested manually with all providers with streaming and no streaming for OpenAI, Anthropic, Gemini.

Checklist

Code follows project style guidelines
Self-reviewed my changes
Tests added/updated and passing
No new warnings introduced
I confirm that I have read and agree to the terms outlined in the Contributor License Agreement (CLA)

vercel · 2025-12-22T06:29:37Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Review	Updated (UTC)
docs	Ready	Preview, Comment	Dec 22, 2025 10:57pm

greptile-apps · 2025-12-22T06:54:46Z

Greptile Summary

This PR refactors memory management, improves provider type safety, and consolidates cost calculation logic. The changes address three key areas:

Memory Management Improvements

Changed memory scoping from workflowId to workspaceId for conversation-level persistence
Wrapped streaming responses to persist assistant messages after stream completion
Used atomic PostgreSQL array concatenation (||) for race-safe appends
Migration backfills workspaceId from workflows and deletes orphaned records

Provider Type Safety

Added explicit TypeScript types from official SDKs (ChatCompletionCreateParamsStreaming, RawMessageStreamEvent, etc.)
Removed fallback code for legacy response formats (Google's content.function_call)
Enhanced streaming utilities to track token usage from provider events

Cost Calculation Consolidation

Moved cost calculation from logger to provider layer for consistency
Cost now calculated in real-time during streaming using onComplete callbacks
Both streaming and non-streaming paths now compute costs at the same level

Issues Found

Memory seeding in agent-handler.ts:714 has race condition potential when concurrent requests use the same conversationId
Sliding window memory modes fetch-modify-write, which could lose messages during concurrent appends
Cost tracking happens in streaming callback before response completes - interrupted streams may record costs for incomplete responses

Confidence Score: 4/5

Safe to merge with minor race condition risks in high-concurrency scenarios
The PR implements solid architectural improvements with proper type safety and atomic database operations. However, memory seeding and sliding window modes have race condition vulnerabilities when multiple requests use the same conversationId concurrently. These are edge cases that may not surface in typical usage but could cause message loss or duplication under load. The provider type safety improvements and cost calculation consolidation are well-executed.
apps/sim/executor/handlers/agent/agent-handler.ts (seeding race condition) and apps/sim/executor/handlers/agent/memory.ts (sliding window race condition)

Important Files Changed

Filename	Overview
apps/sim/executor/handlers/agent/memory.ts	Refactored memory from workflow-scoped to workspace-scoped with atomic PostgreSQL operations, but sliding window modes still have race condition risk
apps/sim/executor/handlers/agent/agent-handler.ts	Wrapped streaming responses for memory persistence and moved memory seeding logic, but seeding has potential race condition with concurrent requests
apps/sim/providers/openai/index.ts	Added proper TypeScript types from OpenAI SDK and moved cost calculation into streaming callback for real-time tracking
apps/sim/providers/anthropic/utils.ts	Enhanced streaming to track token usage from Anthropic SDK events and invoke callback with accumulated content and usage metrics
packages/db/schema.ts	Changed memory table from workflowId to workspaceId scoping with unique constraint on workspace+key
packages/db/migrations/0130_bored_master_chief.sql	Migration backfills workspaceId from workflows and deletes orphaned memory records that couldn't be resolved
apps/sim/providers/google/utils.ts	Removed legacy function_call format fallback, now only supports functionCall in parts

Sequence Diagram

sequenceDiagram
    participant Client
    participant AgentHandler
    participant Provider
    participant MemoryService
    participant Database

    Client->>AgentHandler: execute(block, inputs)
    AgentHandler->>AgentHandler: buildMessages()
    
    alt Memory Enabled
        AgentHandler->>MemoryService: fetchMemoryMessages()
        MemoryService->>Database: SELECT data FROM memory
        Database-->>MemoryService: existing messages
        
        alt First Run (no existing)
            AgentHandler->>MemoryService: seedMemory(conversationMessages)
            MemoryService->>Database: INSERT memory record
        else Existing Memory
            alt New User Message
                AgentHandler->>MemoryService: appendToMemory(userMessage)
                MemoryService->>Database: UPDATE with array concat
            end
        end
    end

    AgentHandler->>Provider: executeProviderRequest()
    
    alt Streaming Response
        Provider-->>AgentHandler: StreamingExecution
        
        alt Memory Enabled
            AgentHandler->>MemoryService: wrapStreamForPersistence()
            MemoryService-->>AgentHandler: wrapped stream
        end
        
        AgentHandler-->>Client: stream chunks
        
        Note over MemoryService: Stream accumulates content
        
        MemoryService->>MemoryService: flush() callback
        MemoryService->>Database: UPDATE memory (append assistant message)
        MemoryService->>Provider: calculate cost from usage
        
    else Non-Streaming Response
        Provider-->>AgentHandler: BlockOutput
        Provider->>Provider: calculate cost from tokens
        
        alt Memory Enabled
            AgentHandler->>MemoryService: persistResponseToMemory()
            MemoryService->>Database: UPDATE memory (append assistant)
        end
        
        AgentHandler-->>Client: complete response
    end

icecrasher321 · 2025-12-22T07:10:41Z

@greptile

greptile-apps

_{39 files reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

apps/sim/executor/handlers/agent/memory.ts

icecrasher321 · 2025-12-22T20:12:27Z

@greptile

greptile-apps

_{52 files reviewed, 4 comments}

_{Edit Code Review Agent Settings | Greptile}

apps/sim/executor/handlers/agent/agent-handler.ts

apps/sim/executor/handlers/agent/memory.ts

packages/db/schema.ts

icecrasher321 · 2025-12-22T20:51:13Z

@greptile

greptile-apps

Additional Comments (1)

apps/sim/providers/openai/utils.ts, line 37-39 (link)

logic: warns if usage data missing, but still calls onComplete with zeros

if OpenAI doesn't return usage tokens (which can happen in some edge cases or errors), cost calculation will be zero. this creates silent underreporting of costs rather than failing visibly.

consider whether this should throw or use token estimation as fallback. is zero-cost acceptable for streams without usage data, or should you estimate tokens from accumulated content?

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}

_{54 files reviewed, 12 comments}

_{Edit Code Review Agent Settings | Greptile}

apps/sim/executor/handlers/agent/agent-handler.ts

apps/sim/executor/handlers/agent/memory.ts

apps/sim/app/api/memory/route.ts

packages/db/migrations/0130_bored_master_chief.sql

apps/sim/executor/handlers/agent/agent-handler.ts

apps/sim/providers/google/utils.ts

apps/sim/lib/workflows/streaming/streaming.ts

waleedlatif1 · 2025-12-22T22:47:24Z

@greptile

…ormats

greptile-apps

_{115 files reviewed, 6 comments}

_{Edit Code Review Agent Settings | Greptile}

apps/sim/providers/openai/index.ts

apps/sim/executor/handlers/agent/agent-handler.ts

apps/sim/providers/types.ts

apps/sim/providers/google/utils.ts

packages/db/migrations/0130_bored_master_chief.sql

apps/sim/executor/handlers/agent/memory.ts

vercel bot deployed to Preview December 22, 2025 06:30 View deployment

icecrasher321 marked this pull request as ready for review December 22, 2025 06:49

vercel bot temporarily deployed to Preview December 22, 2025 07:08 Inactive

icecrasher321 mentioned this pull request Dec 22, 2025

fix(chat): save tool calls #2359

Closed

6 tasks

greptile-apps bot reviewed Dec 22, 2025

View reviewed changes

apps/sim/executor/handlers/agent/memory.ts Outdated Show resolved Hide resolved

vercel bot temporarily deployed to Preview December 22, 2025 07:18 Inactive

vercel bot temporarily deployed to Preview December 22, 2025 07:24 Inactive

vercel bot deployed to Preview December 22, 2025 19:42 View deployment

vercel bot temporarily deployed to Preview December 22, 2025 19:44 Inactive

vercel bot temporarily deployed to Preview December 22, 2025 19:56 Inactive

vercel bot temporarily deployed to Preview December 22, 2025 20:07 Inactive

vercel bot deployed to Preview December 22, 2025 20:16 View deployment

greptile-apps bot reviewed Dec 22, 2025

View reviewed changes

apps/sim/executor/handlers/agent/agent-handler.ts Outdated Show resolved Hide resolved

apps/sim/executor/handlers/agent/agent-handler.ts Outdated Show resolved Hide resolved

apps/sim/executor/handlers/agent/memory.ts Show resolved Hide resolved

packages/db/schema.ts Show resolved Hide resolved

vercel bot temporarily deployed to Preview December 22, 2025 20:34 Inactive

vercel bot temporarily deployed to Preview December 22, 2025 20:50 Inactive

vercel bot temporarily deployed to Preview December 22, 2025 20:53 Inactive

greptile-apps bot reviewed Dec 22, 2025

View reviewed changes

vercel bot temporarily deployed to Preview December 22, 2025 21:08 Inactive

vercel bot temporarily deployed to Preview December 22, 2025 21:16 Inactive

vercel bot deployed to Preview December 22, 2025 22:44 View deployment

icecrasher321 added 4 commits December 22, 2025 14:48

improvement(memory): should not be block scoped

4c1e861

cleanup provider code

5fec4bb

update other providers

45d00f1

cleanup fallback code

c6736be

icecrasher321 and others added 12 commits December 22, 2025 14:48

remove flaky test

0b25574

fix memory

5fa2641

move streaming fix to right level

20e2246

cleanup streaming server

544780c

make memories workspace scoped

7a1074f

update docs

d682691

fix dedup logic

65546cf

fix streaming parsing issue for multiple onStream calls for same block

41bb7e8

fix(provieders): support parallel agent tool calls, consolidate utils

96c5358

address greptile comments

44f0499

remove all comments

f50824b

fixed openrouter response format handling, groq & cerebras response f…

5f6e7ce

…ormats

waleedlatif1 force-pushed the improvement/agent-memory-simplification branch from 90e9d4f to 5f6e7ce Compare December 22, 2025 22:48

vercel bot deployed to Preview December 22, 2025 22:50 View deployment

greptile-apps bot reviewed Dec 22, 2025

View reviewed changes

removed duplicate type

34ffdb3

vercel bot deployed to Preview December 22, 2025 22:57 View deployment

icecrasher321 merged commit 8c2c49e into staging Dec 22, 2025
10 checks passed

This was referenced Dec 23, 2025

feat(i18n): update translations #2530

Merged

feat(anthropic): add native structured outputs support #2528

Closed

aadamsx mentioned this pull request Dec 23, 2025

feat(anthropic): add native structured outputs support #2531

Closed

waleedlatif1 mentioned this pull request Dec 23, 2025

v0.5.41: memory fixes, copilot improvements, knowledgebase improvements, LLM providers standardization #2540

Merged

waleedlatif1 deleted the improvement/agent-memory-simplification branch December 23, 2025 08:28

fix(models): memory fixes, provider code typing, cost calculation cleanup #2515

fix(models): memory fixes, provider code typing, cost calculation cleanup #2515

Uh oh!

Conversation

icecrasher321 commented Dec 22, 2025 • edited by waleedlatif1 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Type of Change

Testing

Checklist

Uh oh!

vercel bot commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greptile-apps bot commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Sequence Diagram

Uh oh!

icecrasher321 commented Dec 22, 2025

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

icecrasher321 commented Dec 22, 2025

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

icecrasher321 commented Dec 22, 2025

Uh oh!

greptile-apps bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Additional Comments (1)

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

waleedlatif1 commented Dec 22, 2025

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

icecrasher321 commented Dec 22, 2025 •

edited by waleedlatif1

Loading

vercel bot commented Dec 22, 2025 •

edited

Loading

greptile-apps bot commented Dec 22, 2025 •

edited

Loading

greptile-apps bot left a comment •

edited

Loading