©RillNews
new
show
ask
jobs
submit
login
Show HN: Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU
github.com
187 points by
xaskasdf
9 hours ago
|
46 comments
add comment