Skip to content

DEV Community

# vllm

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Feb 5

Running Claude Code with Local LLMs via vLLM and LiteLLM

#claudecode #vllm #selfhosted #ai

6 min read

Ben

Feb 1

vLLM — Session 2: The Engine Layer — Request Management

#vllm #llm #python #machinelearning

13 min read

Ben

Feb 1

Session 1: vLLM Overview and the User API

#vllm #llm #python #machinelearning

12 min read

Gláucio for Magalu Cloud

Feb 5

Pare de Brincar com LLMs Locais: Leve a IAG Open Source para a Produção na Magalu Cloud

#ai #llm #vllm #docker

22 min read

Dec 29 '25

The Hidden Switchboard Behind vLLM Attention

#vllm #llm #attention #aiinference

10 min read

raphiki for Technology at Worldline

Dec 29 '25

The Ultimate LLM Inference Battle: vLLM vs. Ollama vs. ZML

#qsos #zml #ollama #vllm

6 min read

Karl Weinmeister for Google AI

Nov 6 '25

Deploy Faster with Terraform: Your Guide to vLLM on GKE with Infrastructure-as-Code

#vllm #gke #terraform #ai

7 min read

Marco Gonzalez for AWS Community Builders

Aug 26 '25

vLLM on x86: Because Not Everyone Can Afford a GPU Cluster

#ai #machinelearning #vllm #production

12 min read

Hyogeun Oh (오효근)

Jun 19 '25

Code Review: Deep Dive into vLLM's Architecture and Implementation Analysis of OpenAI-Compatible Serving (2/2)

#python #pytorch #vllm #fastapi

37 min read

Hyogeun Oh (오효근)

Jun 15 '25

Code Review: Deep Dive into vLLM's Architecture and Implementation Analysis of OpenAI-Compatible Serving (1/2)

#python #pytorch #vllm #fastapi

28 min read

Torque for MechCloud Academy

Apr 13 '25

Ollama vs vLLM: A Detailed Comparison of LLM Frameworks

#ollama #vllm #llm #genai

10 min read

Emilien Lancelot

Jan 17 '25

Making VLLM work on WSL2

#vllm #ia #inference #llm

4 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.