llamaIndex - ✅(Solved) Fix [Bug]: `QueryFusionRetriever._aretrieve` blocks the event loop during query generation [2 pull requests, 1 comments, 2 participants]

llamaIndex2026-03-26 06:26:29

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

run-llama/llama_index#21159•Fetched 2026-04-08 01:31:19

View on GitHub

Comments

Participants

Timeline

Reactions

Author

gautamvarmadatla

Participants

dosubot[bot]

gautamvarmadatla

Timeline (top)

cross-referenced ×4labeled ×2referenced ×2closed ×1

Code Example

### Relevant Logs/Tracbacks

RAW_BUFFERClick to expand / collapse

Bug Description

_aretrieve() calls the synchronous _get_queries(), which blocks on self._llm.complete() instead of awaiting an async equivalent. When num_queries > 1 (the default), this blocks the current event loop during query generation and prevents other coroutines on that same loop from making progress until query expansion finishes.

Version

0.14.19

Steps to Reproduce

import asyncio
from llama_index.core.base.base_retriever import BaseRetriever
from llama_index.core.retrievers import QueryFusionRetriever
from llama_index.core.schema import NodeWithScore, QueryBundle, TextNode
from llama_index.llms.openai import OpenAI

class MockRetriever(BaseRetriever):
      def _retrieve(self, query_bundle):
          return [NodeWithScore(node=TextNode(text="Hi I am Gautam!"), score=1.0)]

retriever = QueryFusionRetriever(
      retrievers=[MockRetriever()],
      llm=OpenAI(model="gpt-5"),
      num_queries=4,
  )

async def other_work():
      await asyncio.sleep(0.1)
      print("other work ran")

task = asyncio.ensure_future(other_work())
await retriever.aretrieve("What is the capital of France?")
print(f"other_work done: {task.done()}")

Relevant Logs/Tracbacks

other_work done: False

So basically the task had the entire duration of a real OpenAI HTTP call to complete its
0.1s sleep and still never got scheduled.

extent analysis

Fix Plan

To fix the issue, we need to make the _get_queries() method asynchronous to avoid blocking the event loop.

Here are the steps:

Make _get_queries() an asynchronous method by adding the async keyword.
Replace the synchronous self._llm.complete() call with an asynchronous equivalent, if available, or use await asyncio.to_thread() to run the synchronous call in a separate thread.

Example Code

async def _get_queries(self, query_bundle):
    # Assuming self._llm.complete() has an async equivalent
    await self._llm.complete_async()
    # ... rest of the method ...

# or if no async equivalent is available
async def _get_queries(self, query_bundle):
    loop = asyncio.get_running_loop()
    result = await loop.run_in_executor(None, self._llm.complete)
    # ... rest of the method ...

Verification

To verify the fix, run the provided test code again and check if the other_work task is completed before the aretrieve call finishes. The output should indicate that other_work is done:

other_work done: True

Extra Tips

When working with asynchronous code, it's essential to ensure that all blocking calls are properly awaited to avoid starving the event loop. Use asyncio.to_thread() or loop.run_in_executor() to run synchronous code in a separate thread, allowing other coroutines to make progress.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #tensor shape #autograd error #model save/load #optimization

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

llamaIndex - ✅(Solved) Fix [Bug]: `QueryFusionRetriever._aretrieve` blocks the event loop during query generation [2 pull requests, 1 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Fix Action

Fixed

PR fix notes

PR #21160: fix(core): use async query generation in `QueryFusionRetriever._aretrieve`

Description (problem / solution / changelog)

Description

New Package?

Version Bump?

Type of Change

How Has This Been Tested?

Suggested Checklist:

Changed files

Code Example

Bug Description

Version

Steps to Reproduce

Relevant Logs/Tracbacks

extent analysis

Fix Plan

Example Code

Verification

Extra Tips

Still need to ship something?

TRENDING

llamaIndex - ✅(Solved) Fix [Bug]: `QueryFusionRetriever._aretrieve` blocks the event loop during query generation [2 pull requests, 1 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Fix Action

Fixed

PR fix notes

PR #21160: fix(core): use async query generation in QueryFusionRetriever._aretrieve

Description (problem / solution / changelog)

Description

New Package?

Version Bump?

Type of Change

How Has This Been Tested?

Suggested Checklist:

Changed files

Code Example

Bug Description

Version

Steps to Reproduce

Relevant Logs/Tracbacks

extent analysis

Fix Plan

Example Code

Verification

Extra Tips

Still need to ship something?

RELATED_DISCOVERY

TRENDING

PR #21160: fix(core): use async query generation in `QueryFusionRetriever._aretrieve`