vllm - 💡(How to fix) Fix [Bug]: compute_slot_mapping crashes on non-Triton platforms (ppc64le/CPU) after PR #32951 [1 comments, 2 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
vllm-project/vllm#37983Fetched 2026-04-08 01:22:10
View on GitHub
Comments
1
Participants
2
Timeline
5
Reactions
0
Timeline (top)
closed ×1commented ×1labeled ×1mentioned ×1

Error Message

(APIServer pid=1328759) INFO: Started server process [1328759] (APIServer pid=1328759) INFO: Waiting for application startup. (APIServer pid=1328759) INFO: Application startup complete. (APIServer pid=1328759) INFO: 127.0.0.1:40838 - "GET /metrics HTTP/1.1" 200 OK (APIServer pid=1328759) INFO: 127.0.0.1:40838 - "POST /v1/completions HTTP/1.1" 200 OK (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] EngineCore encountered a fatal error. (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] Traceback (most recent call last): (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1101, in run_engine_core (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] engine_core.run_busy_loop() (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1142, in run_busy_loop (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] self._process_engine_step() (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1181, in _process_engine_step (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] outputs, model_executed = self.step_fn() (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] ^^^^^^^^^^^^^^ (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 392, in step (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] future = self.model_executor.execute_model(scheduler_output, non_block=True) (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 114, in execute_model (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] output.result() (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 449, in result (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] return self.__get_result() (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] ^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 401, in __get_result (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] raise self._exception (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 84, in collective_rpc (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] result = run_method(self.driver_worker, method, args, kwargs) (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/serial_utils.py", line 510, in run_method (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] return func(*args, **kwargs) (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] ^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/worker_base.py", line 332, in execute_model (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] return self.worker.execute_model(scheduler_output) (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] return func(*args, **kwargs) (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] ^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_worker.py", line 803, in execute_model (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] output = self.model_runner.execute_model( (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] return func(*args, **kwargs) (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] ^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 3840, in execute_model (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] logits_indices, spec_decode_metadata = self._prepare_inputs( (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] ^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 1987, in _prepare_inputs (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] self.input_batch.block_table.compute_slot_mapping( (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 293, in compute_slot_mapping (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] block_table.compute_slot_mapping(num_reqs, query_start_loc, positions) (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 144, in compute_slot_mapping (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] _compute_slot_mapping_kernel[(num_reqs + 1,)]( (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] TypeError: 'function' object is not subscriptable (EngineCore pid=1328911) Process EngineCore: (APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] AsyncLLM output_handler failed. (EngineCore pid=1328911) Traceback (most recent call last): (APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] Traceback (most recent call last): (APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 663, in output_handler (APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] outputs = await engine_core.get_output_async() (APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ (APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core_client.py", line 985, in get_output_async (APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] raise self._format_exception(outputs) from None (APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause. (EngineCore pid=1328911) File "/usr/lib64/python3.12/multiprocessing/process.py", line 314, in _bootstrap (EngineCore pid=1328911) self.run() (EngineCore pid=1328911) File "/usr/lib64/python3.12/multiprocessing/process.py", line 108, in run (EngineCore pid=1328911) self._target(*self._args, **self._kwargs) (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1112, in run_engine_core (EngineCore pid=1328911) raise e (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1101, in run_engine_core (EngineCore pid=1328911) engine_core.run_busy_loop() (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1142, in run_busy_loop (EngineCore pid=1328911) self._process_engine_step() (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1181, in _process_engine_step (EngineCore pid=1328911) outputs, model_executed = self.step_fn() (EngineCore pid=1328911) ^^^^^^^^^^^^^^ (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 392, in step (EngineCore pid=1328911) future = self.model_executor.execute_model(scheduler_output, non_block=True) (EngineCore pid=1328911) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 114, in execute_model (EngineCore pid=1328911) output.result() (EngineCore pid=1328911) File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 449, in result (EngineCore pid=1328911) return self.__get_result() (EngineCore pid=1328911) ^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 401, in __get_result (EngineCore pid=1328911) raise self._exception (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 84, in collective_rpc (EngineCore pid=1328911) result = run_method(self.driver_worker, method, args, kwargs) (EngineCore pid=1328911) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/serial_utils.py", line 510, in run_method (EngineCore pid=1328911) return func(*args, **kwargs) (EngineCore pid=1328911) ^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/worker_base.py", line 332, in execute_model (EngineCore pid=1328911) return self.worker.execute_model(scheduler_output) (EngineCore pid=1328911) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context (EngineCore pid=1328911) return func(*args, **kwargs) (EngineCore pid=1328911) ^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_worker.py", line 803, in execute_model (EngineCore pid=1328911) output = self.model_runner.execute_model( (EngineCore pid=1328911) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context (EngineCore pid=1328911) return func(*args, **kwargs) (EngineCore pid=1328911) ^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 3840, in execute_model (EngineCore pid=1328911) logits_indices, spec_decode_metadata = self._prepare_inputs( (EngineCore pid=1328911) ^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 1987, in _prepare_inputs (EngineCore pid=1328911) self.input_batch.block_table.compute_slot_mapping( (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 293, in compute_slot_mapping (EngineCore pid=1328911) block_table.compute_slot_mapping(num_reqs, query_start_loc, positions) (EngineCore pid=1328911) File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 144, in compute_slot_mapping (EngineCore pid=1328911) _compute_slot_mapping_kernel[(num_reqs + 1,)]( (EngineCore pid=1328911) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^ (EngineCore pid=1328911) TypeError: 'function' object is not subscriptable (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator. (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last): (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for prompt_idx, res in result_generator: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for item in iterators[0]: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 582, in generate (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] out = q.get_nowait() or await q.get() (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] ^^^^^^^^^^^^^ (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/output_processor.py", line 85, in get (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] raise output (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 663, in output_handler (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] outputs = await engine_core.get_output_async() (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core_client.py", line 985, in get_output_async (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] raise self._format_exception(outputs) from None (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause. (APIServer pid=1328759) INFO: 127.0.0.1:40852 - "POST /v1/completions HTTP/1.1" 200 OK (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator. (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last): (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for prompt_idx, res in result_generator: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for item in iterators[0]: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] q = await self.add_request( (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] ^^^^^^^^^^^^^^^^^^^^^^^ (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] raise EngineDeadError() (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause. (APIServer pid=1328759) INFO: 127.0.0.1:40856 - "POST /v1/completions HTTP/1.1" 200 OK (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator. (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last): (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for prompt_idx, res in result_generator: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for item in iterators[0]: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] q = await self.add_request( (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] ^^^^^^^^^^^^^^^^^^^^^^^ (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] raise EngineDeadError() (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause. (APIServer pid=1328759) INFO: 127.0.0.1:40866 - "POST /v1/completions HTTP/1.1" 200 OK (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator. (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last): (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for prompt_idx, res in result_generator: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for item in iterators[0]: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] q = await self.add_request( (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] ^^^^^^^^^^^^^^^^^^^^^^^ (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] raise EngineDeadError() (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause. (APIServer pid=1328759) INFO: 127.0.0.1:40876 - "POST /v1/completions HTTP/1.1" 200 OK (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator. (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last): (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for prompt_idx, res in result_generator: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for item in iterators[0]: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] q = await self.add_request( (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] ^^^^^^^^^^^^^^^^^^^^^^^ (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] raise EngineDeadError() (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause. (APIServer pid=1328759) INFO: 127.0.0.1:40892 - "POST /v1/completions HTTP/1.1" 200 OK (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator. (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last): (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for prompt_idx, res in result_generator: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for item in iterators[0]: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] q = await self.add_request( (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] ^^^^^^^^^^^^^^^^^^^^^^^ (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] raise EngineDeadError() (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause. (APIServer pid=1328759) INFO: 127.0.0.1:40896 - "POST /v1/completions HTTP/1.1" 200 OK (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator. (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last): (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for prompt_idx, res in result_generator: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for item in iterators[0]: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] q = await self.add_request( (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] ^^^^^^^^^^^^^^^^^^^^^^^ (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] raise EngineDeadError() (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause. (APIServer pid=1328759) INFO: 127.0.0.1:40910 - "POST /v1/completions HTTP/1.1" 200 OK (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator. (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last): (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for prompt_idx, res in result_generator: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for item in iterators[0]: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] q = await self.add_request( (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] ^^^^^^^^^^^^^^^^^^^^^^^ (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] raise EngineDeadError() (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause. (APIServer pid=1328759) INFO: 127.0.0.1:40918 - "POST /v1/completions HTTP/1.1" 200 OK (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator. (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last): (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for prompt_idx, res in result_generator: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for item in iterators[0]: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] q = await self.add_request( (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] ^^^^^^^^^^^^^^^^^^^^^^^ (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] raise EngineDeadError() (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause. (APIServer pid=1328759) INFO: 127.0.0.1:40934 - "POST /v1/completions HTTP/1.1" 200 OK (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator. (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last): (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for prompt_idx, res in result_generator: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] async for item in iterators[0]: (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] q = await self.add_request( (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] ^^^^^^^^^^^^^^^^^^^^^^^ (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] raise EngineDeadError() (APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause. (APIServer pid=1328759) INFO: 127.0.0.1:40838 - "GET /metrics HTTP/1.1" 200 OK (APIServer pid=1328759) INFO: Shutting down (APIServer pid=1328759) INFO: Waiting for application shutdown. (APIServer pid=1328759) INFO: Application shutdown complete. (APIServer pid=1328759) INFO: Finished server process [1328759]

Root Cause

(APIServer pid=1328759) INFO:     Started server process [1328759]
(APIServer pid=1328759) INFO:     Waiting for application startup.
(APIServer pid=1328759) INFO:     Application startup complete.
(APIServer pid=1328759) INFO:     127.0.0.1:40838 - "GET /metrics HTTP/1.1" 200 OK
(APIServer pid=1328759) INFO:     127.0.0.1:40838 - "POST /v1/completions HTTP/1.1" 200 OK
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] EngineCore encountered a fatal error.
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] Traceback (most recent call last):
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1101, in run_engine_core
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     engine_core.run_busy_loop()
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1142, in run_busy_loop
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     self._process_engine_step()
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1181, in _process_engine_step
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     outputs, model_executed = self.step_fn()
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]                               ^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 392, in step
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     future = self.model_executor.execute_model(scheduler_output, non_block=True)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 114, in execute_model
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     output.result()
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 449, in result
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     return self.__get_result()
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]            ^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 401, in __get_result
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     raise self._exception
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 84, in collective_rpc
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     result = run_method(self.driver_worker, method, args, kwargs)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/serial_utils.py", line 510, in run_method
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     return func(*args, **kwargs)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/worker_base.py", line 332, in execute_model
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     return self.worker.execute_model(scheduler_output)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     return func(*args, **kwargs)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_worker.py", line 803, in execute_model
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     output = self.model_runner.execute_model(
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     return func(*args, **kwargs)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 3840, in execute_model
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     logits_indices, spec_decode_metadata = self._prepare_inputs(
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]                                            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 1987, in _prepare_inputs
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     self.input_batch.block_table.compute_slot_mapping(
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 293, in compute_slot_mapping
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     block_table.compute_slot_mapping(num_reqs, query_start_loc, positions)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 144, in compute_slot_mapping
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     _compute_slot_mapping_kernel[(num_reqs + 1,)](
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] TypeError: 'function' object is not subscriptable
(EngineCore pid=1328911) Process EngineCore:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] AsyncLLM output_handler failed.
(EngineCore pid=1328911) Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 663, in output_handler
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707]     outputs = await engine_core.get_output_async()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707]               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core_client.py", line 985, in get_output_async
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707]     raise self._format_exception(outputs) from None
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(EngineCore pid=1328911)   File "/usr/lib64/python3.12/multiprocessing/process.py", line 314, in _bootstrap
(EngineCore pid=1328911)     self.run()
(EngineCore pid=1328911)   File "/usr/lib64/python3.12/multiprocessing/process.py", line 108, in run
(EngineCore pid=1328911)     self._target(*self._args, **self._kwargs)
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1112, in run_engine_core
(EngineCore pid=1328911)     raise e
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1101, in run_engine_core
(EngineCore pid=1328911)     engine_core.run_busy_loop()
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1142, in run_busy_loop
(EngineCore pid=1328911)     self._process_engine_step()
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1181, in _process_engine_step
(EngineCore pid=1328911)     outputs, model_executed = self.step_fn()
(EngineCore pid=1328911)                               ^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 392, in step
(EngineCore pid=1328911)     future = self.model_executor.execute_model(scheduler_output, non_block=True)
(EngineCore pid=1328911)              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 114, in execute_model
(EngineCore pid=1328911)     output.result()
(EngineCore pid=1328911)   File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 449, in result
(EngineCore pid=1328911)     return self.__get_result()
(EngineCore pid=1328911)            ^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 401, in __get_result
(EngineCore pid=1328911)     raise self._exception
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 84, in collective_rpc
(EngineCore pid=1328911)     result = run_method(self.driver_worker, method, args, kwargs)
(EngineCore pid=1328911)              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/serial_utils.py", line 510, in run_method
(EngineCore pid=1328911)     return func(*args, **kwargs)
(EngineCore pid=1328911)            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/worker_base.py", line 332, in execute_model
(EngineCore pid=1328911)     return self.worker.execute_model(scheduler_output)
(EngineCore pid=1328911)            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context
(EngineCore pid=1328911)     return func(*args, **kwargs)
(EngineCore pid=1328911)            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_worker.py", line 803, in execute_model
(EngineCore pid=1328911)     output = self.model_runner.execute_model(
(EngineCore pid=1328911)              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context
(EngineCore pid=1328911)     return func(*args, **kwargs)
(EngineCore pid=1328911)            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 3840, in execute_model
(EngineCore pid=1328911)     logits_indices, spec_decode_metadata = self._prepare_inputs(
(EngineCore pid=1328911)                                            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 1987, in _prepare_inputs
(EngineCore pid=1328911)     self.input_batch.block_table.compute_slot_mapping(
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 293, in compute_slot_mapping
(EngineCore pid=1328911)     block_table.compute_slot_mapping(num_reqs, query_start_loc, positions)
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 144, in compute_slot_mapping
(EngineCore pid=1328911)     _compute_slot_mapping_kernel[(num_reqs + 1,)](
(EngineCore pid=1328911)     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) TypeError: 'function' object is not subscriptable
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 582, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     out = q.get_nowait() or await q.get()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]                             ^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/output_processor.py", line 85, in get
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise output
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 663, in output_handler
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     outputs = await engine_core.get_output_async()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core_client.py", line 985, in get_output_async
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise self._format_exception(outputs) from None
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40852 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40856 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40866 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40876 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40892 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40896 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40910 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40918 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40934 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40838 - "GET /metrics HTTP/1.1" 200 OK
(APIServer pid=1328759) INFO:     Shutting down
(APIServer pid=1328759) INFO:     Waiting for application shutdown.
(APIServer pid=1328759) INFO:     Application shutdown complete.
(APIServer pid=1328759) INFO:     Finished server process [1328759]

Fix Action

Fix / Workaround

============================== CPU Info

Architecture: ppc64le Byte Order: Little Endian CPU(s): 80 On-line CPU(s) list: 0-79 Model name: POWER10 (architected), altivec supported Model: 2.0 (pvr 0080 0200) Thread(s) per core: 8 Core(s) per socket: 10 Socket(s): 1 Hypervisor vendor: pHyp Virtualization type: para L1d cache: 640 KiB (20 instances) L1i cache: 960 KiB (20 instances) L2 cache: 20 MiB (20 instances) L3 cache: 80 MiB (20 instances) NUMA node(s): 2 NUMA node1 CPU(s): 0-79 NUMA node3 CPU(s):
Vulnerability Gather data sampling: Not affected Vulnerability Indirect target selection: Not affected Vulnerability Itlb multihit: Not affected Vulnerability L1tf: Not affected Vulnerability Mds: Not affected Vulnerability Meltdown: Not affected Vulnerability Mmio stale data: Not affected Vulnerability Reg file data sampling: Not affected Vulnerability Retbleed: Not affected Vulnerability Spec rstack overflow: Not affected Vulnerability Spec store bypass: Not affected Vulnerability Spectre v1: Mitigation; __user pointer sanitization, ori31 speculation barrier enabled Vulnerability Spectre v2: Mitigation; Software count cache flush (hardware accelerated), Software link stack flush Vulnerability Srbds: Not affected Vulnerability Tsa: Not affected Vulnerability Tsx async abort: Not affected Vulnerability Vmscape: Not affected

Code Example

Collecting environment information...
==============================
        System Info
==============================
OS                           : Red Hat Enterprise Linux 9.7 (Plow) (ppc64le)
GCC version                  : (GCC) 11.5.0 20240719 (Red Hat 11.5.0-11)
Clang version                : 20.1.8 (Red Hat, Inc. 20.1.8-3.el9)
CMake version                : version 3.26.5
Libc version                 : glibc-2.34

==============================
       PyTorch Info
==============================
PyTorch version              : 2.10.0+cpu
Is debug build               : False
CUDA used to build PyTorch   : None
ROCM used to build PyTorch   : N/A

==============================
      Python Environment
==============================
Python version               : 3.12.12 (main, Feb 27 2026, 00:00:00) [GCC 11.5.0 20240719 (Red Hat 11.5.0-11)] (64-bit runtime)
Python platform              : Linux-5.14.0-611.36.1.el9_7.ppc64le-ppc64le-with-glibc2.34

==============================
       CUDA / GPU Info
==============================
Is CUDA available            : False
CUDA runtime version         : No CUDA
CUDA_MODULE_LOADING set to   : N/A
GPU models and configuration : No CUDA
Nvidia driver version        : No CUDA
cuDNN version                : No CUDA
HIP runtime version          : N/A
MIOpen runtime version       : N/A
Is XNNPACK available         : False

==============================
          CPU Info
==============================
Architecture:                            ppc64le
Byte Order:                              Little Endian
CPU(s):                                  80
On-line CPU(s) list:                     0-79
Model name:                              POWER10 (architected), altivec supported
Model:                                   2.0 (pvr 0080 0200)
Thread(s) per core:                      8
Core(s) per socket:                      10
Socket(s):                               1
Hypervisor vendor:                       pHyp
Virtualization type:                     para
L1d cache:                               640 KiB (20 instances)
L1i cache:                               960 KiB (20 instances)
L2 cache:                                20 MiB (20 instances)
L3 cache:                                80 MiB (20 instances)
NUMA node(s):                            2
NUMA node1 CPU(s):                       0-79
NUMA node3 CPU(s):                       
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Not affected
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization, ori31 speculation barrier enabled
Vulnerability Spectre v2:                Mitigation; Software count cache flush (hardware accelerated), Software link stack flush
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

==============================
Versions of relevant libraries
==============================
[pip3] numpy==2.2.6
[pip3] pyzmq==26.4.0
[pip3] torch==2.10.0+cpu
[pip3] torchaudio==2.8.0
[pip3] torchvision==0.25.0+cpu
[pip3] transformers==4.57.6
[conda] Could not collect

==============================
         vLLM Info
==============================
ROCM Version                 : Could not collect
vLLM Version                 : 0.18.1rc1.dev69+g2e67fa756 (git sha: 2e67fa756)
vLLM Build Flags:
  CUDA Archs: Not Set; ROCm: Disabled
GPU Topology:
  Could not collect

==============================
     Environment Variables
==============================
LD_LIBRARY_PATH=/home/akashk/protbuf/lib64::/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages//libprotobuf/lib64/:/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages//openblas/lib/:/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages//:/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages//ffmpeg/lib/:/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages//libvpx/lib/:/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages//lame/lib/
PYTORCH_NVML_BASED_CUDA_CHECK=1
TORCHINDUCTOR_COMPILE_THREADS=1
TORCHINDUCTOR_CACHE_DIR=/tmp/torchinductor_akashk

---

(APIServer pid=1328759) INFO:     Started server process [1328759]
(APIServer pid=1328759) INFO:     Waiting for application startup.
(APIServer pid=1328759) INFO:     Application startup complete.
(APIServer pid=1328759) INFO:     127.0.0.1:40838 - "GET /metrics HTTP/1.1" 200 OK
(APIServer pid=1328759) INFO:     127.0.0.1:40838 - "POST /v1/completions HTTP/1.1" 200 OK
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] EngineCore encountered a fatal error.
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] Traceback (most recent call last):
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1101, in run_engine_core
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     engine_core.run_busy_loop()
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1142, in run_busy_loop
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     self._process_engine_step()
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1181, in _process_engine_step
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     outputs, model_executed = self.step_fn()
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]                               ^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 392, in step
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     future = self.model_executor.execute_model(scheduler_output, non_block=True)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 114, in execute_model
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     output.result()
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 449, in result
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     return self.__get_result()
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]            ^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 401, in __get_result
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     raise self._exception
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 84, in collective_rpc
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     result = run_method(self.driver_worker, method, args, kwargs)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/serial_utils.py", line 510, in run_method
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     return func(*args, **kwargs)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/worker_base.py", line 332, in execute_model
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     return self.worker.execute_model(scheduler_output)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     return func(*args, **kwargs)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_worker.py", line 803, in execute_model
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     output = self.model_runner.execute_model(
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     return func(*args, **kwargs)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 3840, in execute_model
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     logits_indices, spec_decode_metadata = self._prepare_inputs(
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]                                            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 1987, in _prepare_inputs
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     self.input_batch.block_table.compute_slot_mapping(
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 293, in compute_slot_mapping
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     block_table.compute_slot_mapping(num_reqs, query_start_loc, positions)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 144, in compute_slot_mapping
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     _compute_slot_mapping_kernel[(num_reqs + 1,)](
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] TypeError: 'function' object is not subscriptable
(EngineCore pid=1328911) Process EngineCore:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] AsyncLLM output_handler failed.
(EngineCore pid=1328911) Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 663, in output_handler
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707]     outputs = await engine_core.get_output_async()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707]               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core_client.py", line 985, in get_output_async
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707]     raise self._format_exception(outputs) from None
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(EngineCore pid=1328911)   File "/usr/lib64/python3.12/multiprocessing/process.py", line 314, in _bootstrap
(EngineCore pid=1328911)     self.run()
(EngineCore pid=1328911)   File "/usr/lib64/python3.12/multiprocessing/process.py", line 108, in run
(EngineCore pid=1328911)     self._target(*self._args, **self._kwargs)
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1112, in run_engine_core
(EngineCore pid=1328911)     raise e
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1101, in run_engine_core
(EngineCore pid=1328911)     engine_core.run_busy_loop()
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1142, in run_busy_loop
(EngineCore pid=1328911)     self._process_engine_step()
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1181, in _process_engine_step
(EngineCore pid=1328911)     outputs, model_executed = self.step_fn()
(EngineCore pid=1328911)                               ^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 392, in step
(EngineCore pid=1328911)     future = self.model_executor.execute_model(scheduler_output, non_block=True)
(EngineCore pid=1328911)              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 114, in execute_model
(EngineCore pid=1328911)     output.result()
(EngineCore pid=1328911)   File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 449, in result
(EngineCore pid=1328911)     return self.__get_result()
(EngineCore pid=1328911)            ^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 401, in __get_result
(EngineCore pid=1328911)     raise self._exception
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 84, in collective_rpc
(EngineCore pid=1328911)     result = run_method(self.driver_worker, method, args, kwargs)
(EngineCore pid=1328911)              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/serial_utils.py", line 510, in run_method
(EngineCore pid=1328911)     return func(*args, **kwargs)
(EngineCore pid=1328911)            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/worker_base.py", line 332, in execute_model
(EngineCore pid=1328911)     return self.worker.execute_model(scheduler_output)
(EngineCore pid=1328911)            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context
(EngineCore pid=1328911)     return func(*args, **kwargs)
(EngineCore pid=1328911)            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_worker.py", line 803, in execute_model
(EngineCore pid=1328911)     output = self.model_runner.execute_model(
(EngineCore pid=1328911)              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context
(EngineCore pid=1328911)     return func(*args, **kwargs)
(EngineCore pid=1328911)            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 3840, in execute_model
(EngineCore pid=1328911)     logits_indices, spec_decode_metadata = self._prepare_inputs(
(EngineCore pid=1328911)                                            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 1987, in _prepare_inputs
(EngineCore pid=1328911)     self.input_batch.block_table.compute_slot_mapping(
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 293, in compute_slot_mapping
(EngineCore pid=1328911)     block_table.compute_slot_mapping(num_reqs, query_start_loc, positions)
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 144, in compute_slot_mapping
(EngineCore pid=1328911)     _compute_slot_mapping_kernel[(num_reqs + 1,)](
(EngineCore pid=1328911)     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) TypeError: 'function' object is not subscriptable
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 582, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     out = q.get_nowait() or await q.get()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]                             ^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/output_processor.py", line 85, in get
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise output
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 663, in output_handler
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     outputs = await engine_core.get_output_async()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core_client.py", line 985, in get_output_async
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise self._format_exception(outputs) from None
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40852 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40856 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40866 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40876 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40892 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40896 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40910 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40918 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40934 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40838 - "GET /metrics HTTP/1.1" 200 OK
(APIServer pid=1328759) INFO:     Shutting down
(APIServer pid=1328759) INFO:     Waiting for application shutdown.
(APIServer pid=1328759) INFO:     Application shutdown complete.
(APIServer pid=1328759) INFO:     Finished server process [1328759]
RAW_BUFFERClick to expand / collapse

Your current environment

<details> <summary>The output of <code>python collect_env.py</code></summary>
Collecting environment information...
==============================
        System Info
==============================
OS                           : Red Hat Enterprise Linux 9.7 (Plow) (ppc64le)
GCC version                  : (GCC) 11.5.0 20240719 (Red Hat 11.5.0-11)
Clang version                : 20.1.8 (Red Hat, Inc. 20.1.8-3.el9)
CMake version                : version 3.26.5
Libc version                 : glibc-2.34

==============================
       PyTorch Info
==============================
PyTorch version              : 2.10.0+cpu
Is debug build               : False
CUDA used to build PyTorch   : None
ROCM used to build PyTorch   : N/A

==============================
      Python Environment
==============================
Python version               : 3.12.12 (main, Feb 27 2026, 00:00:00) [GCC 11.5.0 20240719 (Red Hat 11.5.0-11)] (64-bit runtime)
Python platform              : Linux-5.14.0-611.36.1.el9_7.ppc64le-ppc64le-with-glibc2.34

==============================
       CUDA / GPU Info
==============================
Is CUDA available            : False
CUDA runtime version         : No CUDA
CUDA_MODULE_LOADING set to   : N/A
GPU models and configuration : No CUDA
Nvidia driver version        : No CUDA
cuDNN version                : No CUDA
HIP runtime version          : N/A
MIOpen runtime version       : N/A
Is XNNPACK available         : False

==============================
          CPU Info
==============================
Architecture:                            ppc64le
Byte Order:                              Little Endian
CPU(s):                                  80
On-line CPU(s) list:                     0-79
Model name:                              POWER10 (architected), altivec supported
Model:                                   2.0 (pvr 0080 0200)
Thread(s) per core:                      8
Core(s) per socket:                      10
Socket(s):                               1
Hypervisor vendor:                       pHyp
Virtualization type:                     para
L1d cache:                               640 KiB (20 instances)
L1i cache:                               960 KiB (20 instances)
L2 cache:                                20 MiB (20 instances)
L3 cache:                                80 MiB (20 instances)
NUMA node(s):                            2
NUMA node1 CPU(s):                       0-79
NUMA node3 CPU(s):                       
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Not affected
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization, ori31 speculation barrier enabled
Vulnerability Spectre v2:                Mitigation; Software count cache flush (hardware accelerated), Software link stack flush
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

==============================
Versions of relevant libraries
==============================
[pip3] numpy==2.2.6
[pip3] pyzmq==26.4.0
[pip3] torch==2.10.0+cpu
[pip3] torchaudio==2.8.0
[pip3] torchvision==0.25.0+cpu
[pip3] transformers==4.57.6
[conda] Could not collect

==============================
         vLLM Info
==============================
ROCM Version                 : Could not collect
vLLM Version                 : 0.18.1rc1.dev69+g2e67fa756 (git sha: 2e67fa756)
vLLM Build Flags:
  CUDA Archs: Not Set; ROCm: Disabled
GPU Topology:
  Could not collect

==============================
     Environment Variables
==============================
LD_LIBRARY_PATH=/home/akashk/protbuf/lib64::/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages//libprotobuf/lib64/:/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages//openblas/lib/:/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages//:/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages//ffmpeg/lib/:/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages//libvpx/lib/:/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages//lame/lib/
PYTORCH_NVML_BASED_CUDA_CHECK=1
TORCHINDUCTOR_COMPILE_THREADS=1
TORCHINDUCTOR_CACHE_DIR=/tmp/torchinductor_akashk
</details>

🐛 Describe the bug

PR #32951 replaced the numpy-based compute_slot_mapping with a Triton-only kernel launch (_compute_slot_mapping_kernel(grid,)) with no fallback — when Triton is absent, @triton.jit becomes the TritonPlaceholder no-op decorator which leaves the function as a plain Python function, making it non-subscriptable

(APIServer pid=1328759) INFO:     Started server process [1328759]
(APIServer pid=1328759) INFO:     Waiting for application startup.
(APIServer pid=1328759) INFO:     Application startup complete.
(APIServer pid=1328759) INFO:     127.0.0.1:40838 - "GET /metrics HTTP/1.1" 200 OK
(APIServer pid=1328759) INFO:     127.0.0.1:40838 - "POST /v1/completions HTTP/1.1" 200 OK
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] EngineCore encountered a fatal error.
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] Traceback (most recent call last):
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1101, in run_engine_core
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     engine_core.run_busy_loop()
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1142, in run_busy_loop
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     self._process_engine_step()
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1181, in _process_engine_step
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     outputs, model_executed = self.step_fn()
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]                               ^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 392, in step
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     future = self.model_executor.execute_model(scheduler_output, non_block=True)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 114, in execute_model
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     output.result()
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 449, in result
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     return self.__get_result()
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]            ^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 401, in __get_result
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     raise self._exception
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 84, in collective_rpc
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     result = run_method(self.driver_worker, method, args, kwargs)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/serial_utils.py", line 510, in run_method
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     return func(*args, **kwargs)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/worker_base.py", line 332, in execute_model
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     return self.worker.execute_model(scheduler_output)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     return func(*args, **kwargs)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_worker.py", line 803, in execute_model
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     output = self.model_runner.execute_model(
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     return func(*args, **kwargs)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 3840, in execute_model
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     logits_indices, spec_decode_metadata = self._prepare_inputs(
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]                                            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 1987, in _prepare_inputs
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     self.input_batch.block_table.compute_slot_mapping(
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 293, in compute_slot_mapping
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     block_table.compute_slot_mapping(num_reqs, query_start_loc, positions)
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 144, in compute_slot_mapping
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     _compute_slot_mapping_kernel[(num_reqs + 1,)](
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110]     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) ERROR 03-24 03:32:45 [core.py:1110] TypeError: 'function' object is not subscriptable
(EngineCore pid=1328911) Process EngineCore:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] AsyncLLM output_handler failed.
(EngineCore pid=1328911) Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 663, in output_handler
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707]     outputs = await engine_core.get_output_async()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707]               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core_client.py", line 985, in get_output_async
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707]     raise self._format_exception(outputs) from None
(APIServer pid=1328759) ERROR 03-24 03:32:45 [async_llm.py:707] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(EngineCore pid=1328911)   File "/usr/lib64/python3.12/multiprocessing/process.py", line 314, in _bootstrap
(EngineCore pid=1328911)     self.run()
(EngineCore pid=1328911)   File "/usr/lib64/python3.12/multiprocessing/process.py", line 108, in run
(EngineCore pid=1328911)     self._target(*self._args, **self._kwargs)
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1112, in run_engine_core
(EngineCore pid=1328911)     raise e
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1101, in run_engine_core
(EngineCore pid=1328911)     engine_core.run_busy_loop()
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1142, in run_busy_loop
(EngineCore pid=1328911)     self._process_engine_step()
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 1181, in _process_engine_step
(EngineCore pid=1328911)     outputs, model_executed = self.step_fn()
(EngineCore pid=1328911)                               ^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core.py", line 392, in step
(EngineCore pid=1328911)     future = self.model_executor.execute_model(scheduler_output, non_block=True)
(EngineCore pid=1328911)              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 114, in execute_model
(EngineCore pid=1328911)     output.result()
(EngineCore pid=1328911)   File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 449, in result
(EngineCore pid=1328911)     return self.__get_result()
(EngineCore pid=1328911)            ^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 401, in __get_result
(EngineCore pid=1328911)     raise self._exception
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/executor/uniproc_executor.py", line 84, in collective_rpc
(EngineCore pid=1328911)     result = run_method(self.driver_worker, method, args, kwargs)
(EngineCore pid=1328911)              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/serial_utils.py", line 510, in run_method
(EngineCore pid=1328911)     return func(*args, **kwargs)
(EngineCore pid=1328911)            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/worker_base.py", line 332, in execute_model
(EngineCore pid=1328911)     return self.worker.execute_model(scheduler_output)
(EngineCore pid=1328911)            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context
(EngineCore pid=1328911)     return func(*args, **kwargs)
(EngineCore pid=1328911)            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_worker.py", line 803, in execute_model
(EngineCore pid=1328911)     output = self.model_runner.execute_model(
(EngineCore pid=1328911)              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context
(EngineCore pid=1328911)     return func(*args, **kwargs)
(EngineCore pid=1328911)            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 3840, in execute_model
(EngineCore pid=1328911)     logits_indices, spec_decode_metadata = self._prepare_inputs(
(EngineCore pid=1328911)                                            ^^^^^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/gpu_model_runner.py", line 1987, in _prepare_inputs
(EngineCore pid=1328911)     self.input_batch.block_table.compute_slot_mapping(
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 293, in compute_slot_mapping
(EngineCore pid=1328911)     block_table.compute_slot_mapping(num_reqs, query_start_loc, positions)
(EngineCore pid=1328911)   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/worker/block_table.py", line 144, in compute_slot_mapping
(EngineCore pid=1328911)     _compute_slot_mapping_kernel[(num_reqs + 1,)](
(EngineCore pid=1328911)     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
(EngineCore pid=1328911) TypeError: 'function' object is not subscriptable
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 582, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     out = q.get_nowait() or await q.get()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]                             ^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/output_processor.py", line 85, in get
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise output
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 663, in output_handler
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     outputs = await engine_core.get_output_async()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/core_client.py", line 985, in get_output_async
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise self._format_exception(outputs) from None
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40852 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40856 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40866 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40876 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40892 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40896 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40910 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40918 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40934 - "POST /v1/completions HTTP/1.1" 200 OK
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Error in completion stream generator.
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] Traceback (most recent call last):
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/entrypoints/openai/completion/serving.py", line 294, in completion_stream_generator
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for prompt_idx, res in result_generator:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/utils/async_utils.py", line 281, in merge_async_iterators
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     async for item in iterators[0]:
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 563, in generate
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     q = await self.add_request(
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]         ^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]   File "/home/akashk/vllm_workspace/vllm-wheel-managment/v0.11.1/vllm_env/lib64/python3.12/site-packages/vllm-0.18.1rc1.dev69+g2e67fa756.cpu-py3.12-linux-ppc64le.egg/vllm/v1/engine/async_llm.py", line 308, in add_request
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450]     raise EngineDeadError()
(APIServer pid=1328759) ERROR 03-24 03:32:45 [serving.py:450] vllm.v1.engine.exceptions.EngineDeadError: EngineCore encountered an issue. See stack trace (above) for the root cause.
(APIServer pid=1328759) INFO:     127.0.0.1:40838 - "GET /metrics HTTP/1.1" 200 OK
(APIServer pid=1328759) INFO:     Shutting down
(APIServer pid=1328759) INFO:     Waiting for application shutdown.
(APIServer pid=1328759) INFO:     Application shutdown complete.
(APIServer pid=1328759) INFO:     Finished server process [1328759]

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

extent analysis

Fix Plan

The issue arises from the compute_slot_mapping function being replaced with a Triton-only kernel launch without a fallback. When Triton is absent, the @triton.jit decorator becomes a no-op, leaving the function as a plain Python function, which is non-subscriptable.

To fix this issue, we need to add a fallback for when Triton is not available. Here are the steps:

  • Check if Triton is installed and available before using the @triton.jit decorator.
  • If Triton is not available, use a numpy-based implementation of compute_slot_mapping as a fallback.

Here's an example of how the code could be modified:

import numpy as np
import triton

def compute_slot_mapping(num_reqs, query_start_loc, positions):
    # Check if Triton is available
    if triton.is_available():
        # Use Triton implementation
        @triton.jit
        def _compute_slot_mapping_kernel(num_reqs, query_start_loc, positions):
            # Triton implementation
            pass
        return _compute_slot_mapping_kernel(num_reqs, query_start_loc, positions)
    else:
        # Use numpy-based fallback implementation
        def _compute_slot_mapping_numpy(num_reqs, query_start_loc, positions):
            # Numpy implementation
            pass
        return _compute_slot_mapping_numpy(num_reqs, query_start_loc, positions)

Verification

To verify that the fix worked, you can test the compute_slot_mapping function with and without Triton installed. The function should work correctly in both cases.

Extra Tips

  • Make sure to handle any potential errors that may occur when using the Triton implementation.
  • Consider adding a warning or log message when the numpy-based fallback implementation is used, to indicate that Triton is not available.
  • If you're using a virtual environment, ensure that Triton is installed and available in the environment.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

vllm - 💡(How to fix) Fix [Bug]: compute_slot_mapping crashes on non-Triton platforms (ppc64le/CPU) after PR #32951 [1 comments, 2 participants]