Encoder signals turn raw requests into legible semantic state.
Signal
before scale
Built on Shannon signals, entropy folding, and neural-symbolic routing.
A router should feel like a system brain: encoder-guided, entropy-aware, and ruthlessly clear.
13 signal families spanning intent, safety, modality, context, and preference.
12 selectors across symbolic policy, latency heuristics, reinforcement learning, and ML routing.
One architecture across cpu-local, amd-local, and ci-k8s.
Install locally in one line.
The supported first-run path is a single installer that sets up the CLI and local serve flow on macOS and Linux.
curl -fsSL https://vllm-semantic-router.com/zh-Hans/install.sh | bashNeural-symbolic routing, kept legible.
Encoder priors, Shannon mapping, entropy folding, and model selection stay visible from research prototypes to production paths.
Neural signals meet symbolic rules in auditable routing logic.
Cache, safety, rewrite, and tracing attach as composable behaviors.
Natural language intent compiles into neural-symbolic policy before execution begins.
Selection stays measurable enough for papers, benchmarks, and production tuning.
Docs, papers, and product routes read as one system, not scattered collateral.
路由蓝图
系统如何工作
通过交互式演示理解信号提取、决策逻辑与模型路由行为。
香农映射
从通信理论到路由流水线的结构映射。
用户请求是在编码前的原始源消息。
编码器驱动的智能
专用编码器模型从每个请求中提取语义 — 理解意图、排序相关性、跨模态实时分类内容。
Sequence classification, token labeling, embeddings, and reranking collapse into one system-intelligence layer.
多模态
检测并路由文本、图像和音频输入到合适的模态模型。
Bi-Encoder 嵌入
独立编码查询和候选项为稠密向量,用于相似 度搜索和语义缓存。
Cross-Encoder 学习
联合交叉注意力评分查询-候选对,实现高精度重排序。
分类
基于自研 BERT 的领域、越狱、PII 和事实核查的分类器,覆盖多个 signal
全注意力
跨 token 和句子的双向注意力 — 双向完整上下文,非因果掩码。
2DMSE
推理时自适应调整嵌入层数和维度,按需平衡计算量与精度。
MRL
无需重训即可截断嵌入向量到任意维度 — 按请求平衡精度与速度。
认识我们的团队
vLLM Semantic Router 背后的优秀成员
维护者Huamin Chen
Distinguished Engineer @Red Hat
维护者Chen Wang
Senior Staff Research Scientist @IBM
维护者Yue Zhu
Staff Research Scientist @IBM
维护者Xunzhuo Liu
Intelligent Routing @vLLM
提交者Senan Zedan
R&D Manager @Red Hat
提交者samzong
AI Infrastructure / Cloud-Native PM @DaoCloud
Liav Weiss
Software Engineer @Red Hat
Asaad Balum
Senior Software Engineer @Red Hat
Yehudit
Software Engineer @Red Hat
Noa Limoy
Software Engineer @Red Hat
提交者JaredforReal
Software Engineer @Z.ai
Srinivas A
Software Engineer @Yokogawa
carlory
Open Source Engineer @DaoCloud
提交者Yossi Ovadia
Senior Principal Engineer @Red Hat
提交者Jintao Zhang
Senior Software Engineer @Kong
提交者yuluo-yx
Individual Contributor
提交者cryo-zd
Individual Contributor
提交者OneZero-Y
Individual Contributor
提交者aeft
Individual Contributor
提交者Hao Wu
Individual Contributor
提交者Qiping Pan
Individual Contributor
维护者Huamin Chen
Distinguished Engineer @Red Hat
维护者Chen Wang
Senior Staff Research Scientist @IBM
维护者Yue Zhu
Staff Research Scientist @IBM
维护者Xunzhuo Liu
Intelligent Routing @vLLM
提交者Senan Zedan
R&D Manager @Red Hat
提交者samzong
AI Infrastructure / Cloud-Native PM @DaoCloud
Liav Weiss
Software Engineer @Red Hat
Asaad Balum
Senior Software Engineer @Red Hat
Yehudit
Software Engineer @Red Hat
Noa Limoy
Software Engineer @Red Hat
提交者JaredforReal
Software Engineer @Z.ai
Srinivas A
Software Engineer @Yokogawa
carlory
Open Source Engineer @DaoCloud
提交者Yossi Ovadia
Senior Principal Engineer @Red Hat
提交者Jintao Zhang
Senior Software Engineer @Kong
提交者yuluo-yx
Individual Contributor
提交者cryo-zd
Individual Contributor
提交者OneZero-Y
Individual Contributor
提交者aeft
Individual Contributor
提交者Hao Wu
Individual Contributor
提交者Qiping Pan
Individual Contributor
维护者Huamin Chen
Distinguished Engineer @Red Hat
维护者Chen Wang
Senior Staff Research Scientist @IBM
维护者Yue Zhu
Staff Research Scientist @IBM
维护者Xunzhuo Liu
Intelligent Routing @vLLM
提交者Senan Zedan
R&D Manager @Red Hat
提交者samzong
AI Infrastructure / Cloud-Native PM @DaoCloud
Liav Weiss
Software Engineer @Red Hat
Asaad Balum
Senior Software Engineer @Red Hat
Yehudit
Software Engineer @Red Hat
Noa Limoy
Software Engineer @Red Hat
提交者JaredforReal
Software Engineer @Z.ai
Srinivas A
Software Engineer @Yokogawa
carlory
Open Source Engineer @DaoCloud
提交者Yossi Ovadia
Senior Principal Engineer @Red Hat
提交者Jintao Zhang
Senior Software Engineer @Kong
提交者yuluo-yx
Individual Contributor
提交者cryo-zd
Individual Contributor
提交者OneZero-Y
Individual Contributor
提交者aeft
Individual Contributor
提交者Hao Wu
Individual Contributor
提交者Qiping Pan
Individual Contributor
致谢
vLLM Semantic Router 诞生于开源,构建于开源
Architecture, written to be used.
Install, configure, train, and operate from one dense documentation graph.
Docs indexResearch and builders in one loop.
Papers, working groups, and contributors evolve the same system in public.
Community routes




