©RillNews
new
show
ask
jobs
submit
login
Is One Layer Enough? A Single Transformer Layer Matches Full-Parameter RL Train
arxiv.org
86 points by
tcp_handshaker
5 hours ago
|
20 comments
add comment