CVE-2025-47277

Опубликовано: 20 мая 2025

Источник: redhat

CVSS3: 7.5

EPSS Низкий

Описание

vLLM, an inference and serving engine for large language models (LLMs), has an issue in versions 0.6.5 through 0.8.4 that ONLY impacts environments using the PyNcclPipe KV cache transfer integration with the V0 engine. No other configurations are affected. vLLM supports the use of the PyNcclPipe class to establish a peer-to-peer communication domain for data transmission between distributed nodes. The GPU-side KV-Cache transmission is implemented through the PyNcclCommunicator class, while CPU-side control message passing is handled via the send_obj and recv_obj methods on the CPU side. The intention was that this interface should only be exposed to a private network using the IP address specified by the --kv-ip CLI parameter. The vLLM documentation covers how this must be limited to a secured network. The default and intentional behavior from PyTorch is that the TCPStore interface listens on ALL interfaces, regardless of what IP address is provided. The IP address given was only used as a client-side address to use. vLLM was fixed to use a workaround to force the TCPStore instance to bind its socket to a specified private interface. As of version 0.8.5, vLLM limits the TCPStore socket to the private interface as configured.

A flaw was found in vLLM. This vulnerability allows unauthorized access to key-value caches via network exposure of the TCPStore interface when using the PyNcclPipe KV cache transfer integration with the V0 engine.

Отчет

By default, Red Hat products are configured to restrict vLLM nodes to an isolated network. However, this vulnerability could become relevant if customers change the specific configurations, and therefore, Red Hat products are affected. This vulnerability is classified as Moderate rather than Critical because its exploitability and impact are constrained by specific deployment contexts and assumptions about network trust boundaries. While the use of pickle.loads on untrusted input typically leads to remote code execution (RCE), the vulnerable PyNcclPipe interface is not intended to be exposed to the internet or untrusted networks, it is designed for use within a secured, internal cluster environment as explicitly documented by vLLM. Successful exploitation requires an attacker to have direct network access to a misconfigured or poorly segmented system where the KV cache transfer service is bound to a public interface. Additionally, the vulnerable code path exists only in a niche configuration (V0 engine with PyNcclPipe), further reducing its exposure. Therefore, while the flaw does introduce RCE risk in misconfigured setups, the combination of non-default exposure, clear documentation, and limited applicability justifies a reduced impact.

Меры по смягчению последствий

No mitigation is currently available that meets Red Hat Product Security’s standards for usability, deployment, applicability, or stability.

Затронутые пакеты

Платформа	Пакет	Состояние
Red Hat AI Inference Server	vllm-cuda-rhel9	Affected
Red Hat AI Inference Server	vllm-rocm-rhel9	Affected
Red Hat Enterprise Linux AI (RHEL AI)	rhelai1/bootc-amd-rhel9	Affected
Red Hat Enterprise Linux AI (RHEL AI)	rhelai1/bootc-aws-nvidia-rhel9	Affected
Red Hat Enterprise Linux AI (RHEL AI)	rhelai1/bootc-azure-amd-rhel9	Affected
Red Hat Enterprise Linux AI (RHEL AI)	rhelai1/bootc-azure-nvidia-rhel9	Affected
Red Hat Enterprise Linux AI (RHEL AI)	rhelai1/bootc-gcp-nvidia-rhel9	Affected
Red Hat Enterprise Linux AI (RHEL AI)	rhelai1/bootc-intel-rhel9	Affected
Red Hat Enterprise Linux AI (RHEL AI)	rhelai1/bootc-nvidia-rhel9	Affected
Red Hat Enterprise Linux AI (RHEL AI)	rhelai1/instructlab-amd-rhel9	Affected

Показывать по

Ссылки на источники

Дополнительная информация

Статус:

Moderate

Дефект:

CWE-502

https://bugzilla.redhat.com/show_bug.cgi?id=2367605vllm: vLLM Allows Remote Code Execution via PyNcclPipe Communication Service

EPSS

Процентиль: 21%

0.00067

Низкий

7.5 High

CVSS3

Связанные уязвимости

CVE-2025-47277

CVSS3: 9.8

nvd

3 месяца назад

vLLM, an inference and serving engine for large language models (LLMs), has an issue in versions 0.6.5 through 0.8.4 that ONLY impacts environments using the `PyNcclPipe` KV cache transfer integration with the V0 engine. No other configurations are affected. vLLM supports the use of the `PyNcclPipe` class to establish a peer-to-peer communication domain for data transmission between distributed nodes. The GPU-side KV-Cache transmission is implemented through the `PyNcclCommunicator` class, while CPU-side control message passing is handled via the `send_obj` and `recv_obj` methods on the CPU side. The intention was that this interface should only be exposed to a private network using the IP address specified by the `--kv-ip` CLI parameter. The vLLM documentation covers how this must be limited to a secured network. The default and intentional behavior from PyTorch is that the `TCPStore` interface listens on ALL interfaces, regardless of what IP address is provided. The IP add

CVE-2025-47277

CVSS3: 9.8

debian

3 месяца назад

vLLM, an inference and serving engine for large language models (LLMs) ...

GHSA-hjq4-87xh-g4fv

CVSS3: 9.8

github

3 месяца назад

vLLM Allows Remote Code Execution via PyNcclPipe Communication Service

EPSS

Процентиль: 21%

0.00067

Низкий

7.5 High

CVSS3