Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
vllm
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Running Claude Code with Local LLMs via vLLM and LiteLLM
Donald Cruver
Donald Cruver
Donald Cruver
Follow
Feb 5
Running Claude Code with Local LLMs via vLLM and LiteLLM
#
claudecode
#
vllm
#
selfhosted
#
ai
Comments
Add Comment
6 min read
vLLM — Session 2: The Engine Layer — Request Management
Ben
Ben
Ben
Follow
Feb 1
vLLM — Session 2: The Engine Layer — Request Management
#
vllm
#
llm
#
python
#
machinelearning
Comments
Add Comment
13 min read
Session 1: vLLM Overview and the User API
Ben
Ben
Ben
Follow
Feb 1
Session 1: vLLM Overview and the User API
#
vllm
#
llm
#
python
#
machinelearning
Comments
Add Comment
12 min read
Pare de Brincar com LLMs Locais: Leve a IAG Open Source para a Produção na Magalu Cloud
Gláucio
Gláucio
Gláucio
Follow
for
Magalu Cloud
Feb 5
Pare de Brincar com LLMs Locais: Leve a IAG Open Source para a Produção na Magalu Cloud
#
ai
#
llm
#
vllm
#
docker
1
 reaction
Comments
2
 comments
22 min read
The Hidden Switchboard Behind vLLM Attention
Mahmoud Zalt
Mahmoud Zalt
Mahmoud Zalt
Follow
Dec 29 '25
The Hidden Switchboard Behind vLLM Attention
#
vllm
#
llm
#
attention
#
aiinference
Comments
Add Comment
10 min read
The Ultimate LLM Inference Battle: vLLM vs. Ollama vs. ZML
raphiki
raphiki
raphiki
Follow
for
Technology at Worldline
Dec 29 '25
The Ultimate LLM Inference Battle: vLLM vs. Ollama vs. ZML
#
qsos
#
zml
#
ollama
#
vllm
1
 reaction
Comments
Add Comment
6 min read
Deploy Faster with Terraform: Your Guide to vLLM on GKE with Infrastructure-as-Code
Karl Weinmeister
Karl Weinmeister
Karl Weinmeister
Follow
for
Google AI
Nov 6 '25
Deploy Faster with Terraform: Your Guide to vLLM on GKE with Infrastructure-as-Code
#
vllm
#
gke
#
terraform
#
ai
87
 reactions
Comments
1
 comment
7 min read
vLLM on x86: Because Not Everyone Can Afford a GPU Cluster
Marco Gonzalez
Marco Gonzalez
Marco Gonzalez
Follow
for
AWS Community Builders
Aug 26 '25
vLLM on x86: Because Not Everyone Can Afford a GPU Cluster
#
ai
#
machinelearning
#
vllm
#
production
5
 reactions
Comments
Add Comment
12 min read
Code Review: Deep Dive into vLLM's Architecture and Implementation Analysis of OpenAI-Compatible Serving (2/2)
Hyogeun Oh (오효근)
Hyogeun Oh (오효근)
Hyogeun Oh (오효근)
Follow
Jun 19 '25
Code Review: Deep Dive into vLLM's Architecture and Implementation Analysis of OpenAI-Compatible Serving (2/2)
#
python
#
pytorch
#
vllm
#
fastapi
Comments
Add Comment
37 min read
Code Review: Deep Dive into vLLM's Architecture and Implementation Analysis of OpenAI-Compatible Serving (1/2)
Hyogeun Oh (오효근)
Hyogeun Oh (오효근)
Hyogeun Oh (오효근)
Follow
Jun 15 '25
Code Review: Deep Dive into vLLM's Architecture and Implementation Analysis of OpenAI-Compatible Serving (1/2)
#
python
#
pytorch
#
vllm
#
fastapi
1
 reaction
Comments
Add Comment
28 min read
Ollama vs vLLM: A Detailed Comparison of LLM Frameworks
Torque
Torque
Torque
Follow
for
MechCloud Academy
Apr 13 '25
Ollama vs vLLM: A Detailed Comparison of LLM Frameworks
#
ollama
#
vllm
#
llm
#
genai
11
 reactions
Comments
Add Comment
10 min read
Making VLLM work on WSL2
Emilien Lancelot
Emilien Lancelot
Emilien Lancelot
Follow
Jan 17 '25
Making VLLM work on WSL2
#
vllm
#
ia
#
inference
#
llm
26
 reactions
Comments
Add Comment
4 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account