©RillNews
new
show
ask
jobs
submit
login
KVarN: Native vLLM backend for KV-cache quantization by Huawei
github.com
70 points by
theanonymousone
3 hours ago
|
7 comments
add comment