litellm - ✅(Solved) Fix [Bug]: Spend logs show 0 duration for failed requests — start_time and end_time are identical [3 pull requests, 1 participants]

madhu19991 · 2026-04-01T00:36:36Z

[litellm] PR 24906: fix: use actual start time for failed request spend logs - Repository: BerriAI/litellm - Author: madhu19991 - State: closed | merged: True… # PR #24906: fix: use actual start_time for failed request spend logs - Repository: BerriAI/litellm - Author: madhu19991 - State: closed | merged: True - Link: https://github.com/BerriAI/litellm/pull/24906 ## Description (problem / solution / changelog) ## Relevant issues Fixes #24888 ## Summary `async_post_call_failure_hook` in `proxy_track_cost_callback.py` set both `start_time` and `end_time` to `datetime.now()` when logging failed requests to the spend database. This caused all failed requests (timeouts, auth errors, etc.) to show `Duration: 0.000s` in the UI, making it impossible to determine how long the request actually ran before failing. The fix extracts the actual request start time from `litellm_logging_obj.start_time` (which is set when the request first arrives) and uses `datetime.now()` only for `end_time`. **Before:** Failed requests show `Start Time == End Time`, duration 0. **After:** Failed requests show the real duration from request arrival to failure. ## Pre-Submission checklist - [x] I have Added testing — 1 new test verifying start_time comes from logging obj, not datetime.now() - [x] My PR passes all unit tests — 13/13 tests pass in `test_proxy_track_cost_callback.py` - [x] My PR's scope is as isolated as possible, it only solves 1 specific problem - [ ] I have requested a Greptile review ## Type 🐛 Bug Fix ## Changes - `litellm/proxy/hooks/proxy_track_cost_callback.py` — Extract `start_time` from `litellm_logging_obj` instead of calling `datetime.now()` for both timestamps - `tests/test_litellm/proxy/hooks/test_proxy_track_cost_callback.py` — New test `test_async_post_call_failure_hook_uses_actual_start_time` verifying duration > 0 for simulated 60s request ## Changed files - `litellm/proxy/hooks/proxy_track_cost_callback.py` (modified, +9/-1) - `tests/test_litellm/proxy/hooks/test_proxy_track_cost_callback.py` (modified, +64/-0) --- # PR #24936: fix(proxy): use actual request start_time for failed spend logs - Repository: BerriAI/litellm - Author: Vedanshu7 - State: closed | merged: False - Link: https://github.com/BerriAI/litellm/pull/24936 ## Description (problem / solution / changelog) ## Relevant issues Fixes #24888 — failed requests logging `Duration: 0.000s` because both `start_time` and `end_time` were set to `datetime.now()` at failure time. ## Pre-Submission checklist **Please complete all items before asking a LiteLLM maintainer to review your PR** - [x] I have Added testing in the [`tests/test_litellm/`](https://github.com/BerriAI/litellm/tree/main/tests/test_litellm) directory, **Adding at least 1 test is a hard requirement** - [see details](https://docs.litellm.ai/docs/extras/contributing_code) - [x] My PR passes all unit tests on [`make test-unit`](https://docs.litellm.ai/docs/extras/contributing_code) - [x] My PR's scope is as isolated as possible, it only solves 1 specific problem - [ ] I have requested a Greptile review by commenting `@greptileai` and received a **Confidence Score of at least 4/5** before requesting a maintainer review ## Delays in PR merge? If you're seeing a delay in your PR being merged, ping the LiteLLM Team on [Slack (#pr-review)](https://join.slack.com/t/litellmossslack/shared_invite/zt-3o7nkuyfr-p_kbNJj8taRfXGgQI1~YyA). ## CI (LiteLLM team) > **CI status guideline:** > > - 50-55 passing tests: main is stable with minor issues. > - 45-49 passing tests: acceptable but needs attention > - <= 40 passing tests: unstable; be careful with your merges and assess the risk. - [ ] **Branch creation CI run** Link: - [ ] **CI run for the last commit** Link: - [ ] **Merge / cherry-pick CI run** Links: ## Type 🐛 Bug Fix ✅ Test ## Changes **Problem:** `async_post_call_failure_hook` in `litellm/proxy/hooks/proxy_track_cost_callback.py` (lines 123–124) called `datetime.now()` for both `start_time` and `end_time` when writing the spend log for a failed request. Both timestamps end up identical, so every failed request shows `Duration: 0.000s` regardless of how long the request actually ran. **Fix:** `_litellm_logging_obj` is already fetched earlier in the same method (for trace ID propagation). Use `_litellm_logging_obj.start_time` as `actual_start_time`, with a `datetime.now()` fallback when the logging object is absent, and pass `actual_start_time` as `start_time` to `update_database`. **Tests added** in `tests/test_litellm/proxy/hooks/test_proxy_track_cost_callback.py`: - `test_async_post_call_failure_hook_uses_actual_start_time` — asserts `start_time` equals `litellm_logging_obj.start_time` and `end_time > start_time` - `test_async_post_call_failure_hook_falls_back_start_time_when_no_logging_obj` — asserts the fallback path (`datetime.now()`) works cleanly when no logging object is present ## Changed files - `litellm/proxy/hooks/proxy_track_cost_callback.py` (modified, +6/-1) - `tests/test_litellm/

litellm2026-04-01 00:36:36

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

BerriAI/litellm#24888•Fetched 2026-04-08 01:59:04

View on GitHub

Comments

Participants

Timeline

Reactions

Author

madhu19991

Participants

madhu19991

Timeline (top)

referenced ×8cross-referenced ×3closed ×1

Root Cause

In litellm/proxy/hooks/proxy_track_cost_callback.py, the async_post_call_failure_hook method sets both timestamps to datetime.now():

# Lines 117-118
start_time=datetime.now(),
end_time=datetime.now(),

These should instead use the actual request start time from request_data["litellm_logging_obj"].start_time and set end_time to datetime.now().

Fix Action

Fix

litellm_logging_obj = request_data.get("litellm_logging_obj")
actual_start_time = litellm_logging_obj.start_time if litellm_logging_obj else datetime.now()

await proxy_logging_obj.db_spend_update_writer.update_database(
    ...
    start_time=actual_start_time,
    end_time=datetime.now(),
    ...
)

Code Example

Start Time: 2026-03-31T16:28:24.191Z
End Time:   2026-03-31T16:28:24.191Z
Duration:   0.000 s

---

# Lines 117-118
start_time=datetime.now(),
end_time=datetime.now(),

---

litellm_logging_obj = request_data.get("litellm_logging_obj")
actual_start_time = litellm_logging_obj.start_time if litellm_logging_obj else datetime.now()

await proxy_logging_obj.db_spend_update_writer.update_database(
    ...
    start_time=actual_start_time,
    end_time=datetime.now(),
    ...
)

RAW_BUFFERClick to expand / collapse

Bug

When a request fails (e.g., timeout), the spend log records start_time and end_time as the same timestamp, resulting in Duration: 0.000s. Successful requests log the correct duration.

Example from a timeout failure:

Start Time: 2026-03-31T16:28:24.191Z
End Time:   2026-03-31T16:28:24.191Z
Duration:   0.000 s

The actual request ran for ~300 seconds before Akamai timed it out.

Root Cause

In litellm/proxy/hooks/proxy_track_cost_callback.py, the async_post_call_failure_hook method sets both timestamps to datetime.now():

# Lines 117-118
start_time=datetime.now(),
end_time=datetime.now(),

These should instead use the actual request start time from request_data["litellm_logging_obj"].start_time and set end_time to datetime.now().

Expected Behavior

Failed requests should show the actual duration from when the request was sent to when the failure was recorded, the same way successful requests do.

Fix

litellm_logging_obj = request_data.get("litellm_logging_obj")
actual_start_time = litellm_logging_obj.start_time if litellm_logging_obj else datetime.now()

await proxy_logging_obj.db_spend_update_writer.update_database(
    ...
    start_time=actual_start_time,
    end_time=datetime.now(),
    ...
)

Environment

LiteLLM version: 1.81.14
Observed on gsk-prod cluster with Azure gpt-4.1 timeout failures

extent analysis

TL;DR

Update the async_post_call_failure_hook method to use the actual request start time from request_data["litellm_logging_obj"].start_time and set end_time to datetime.now() to correctly log the duration of failed requests.

Guidance

Verify that request_data["litellm_logging_obj"] is not None before accessing its start_time attribute to avoid potential errors.
Update the async_post_call_failure_hook method as shown in the provided fix to correctly set start_time and end_time for failed requests.
Test the updated method with both successful and failed requests to ensure the duration is logged correctly in all cases.
Consider adding error handling for cases where request_data["litellm_logging_obj"] is None or its start_time attribute is not set.

Example

litellm_logging_obj = request_data.get("litellm_logging_obj")
actual_start_time = litellm_logging_obj.start_time if litellm_logging_obj else datetime.now()

await proxy_logging_obj.db_spend_update_writer.update_database(
    ...
    start_time=actual_start_time,
    end_time=datetime.now(),
    ...
)

Notes

This fix assumes that request_data["litellm_logging_obj"].start_time is set correctly for all requests. If this is not the case, additional debugging may be necessary to determine why the start time is not being set.

Recommendation

Apply the provided workaround by updating the async_post_call_failure_hook method to correctly set start_time and end_time for failed requests, as this will ensure accurate logging of request durations.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#database connection #vector store #embedding generation #cache error #pipeline error

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

litellm - ✅(Solved) Fix [Bug]: Spend logs show 0 duration for failed requests — start_time and end_time are identical [3 pull requests, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Fix

PR fix notes

PR #24906: fix: use actual start_time for failed request spend logs

Description (problem / solution / changelog)

Relevant issues

Summary

Pre-Submission checklist

Type

Changes

Changed files

PR #24936: fix(proxy): use actual request start_time for failed spend logs

Description (problem / solution / changelog)

Relevant issues

Pre-Submission checklist

Delays in PR merge?

CI (LiteLLM team)

Type

Changes

Changed files

PR #25108: Litellm ishaan april1 (#25103)

Description (problem / solution / changelog)

Relevant issues

Pre-Submission checklist

Delays in PR merge?

CI (LiteLLM team)

Type

Changes

Changed files

Code Example

Bug

Root Cause

Expected Behavior

Fix

Environment

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING