Value-Based Deep RL Scales Predictably by bearseascape

0CommentsShare PostShare on Facebook Share on XShare by EmailSend Link

News

Value-Based Deep RL Scales Predictably by bearseascape

ByHackTech February 8, 2025

0Comments

Share This Article

Sed ut perspiciatis unde.

Send to HN

[Submitted on 6 Feb 2025]

View PDF
HTML (experimental)

Abstract:Scaling data and compute is critical to the success of machine learning. However, scaling demands predictability: we want methods to not only perform well with more compute or data, but also have their performance be predictable from small-scale runs, without running the large-scale experiment. In this paper, we show that value-based off-policy RL methods are p

Tags: Scales Value-Based

0Likes

Written by

HackTech

View all posts by HackTech

Value-Based Deep RL Scales Predictably by bearseascape

Value-Based Deep RL Scales Predictably by bearseascape

Share This Article

Newsletter

HackTech

Leave a comment Cancel reply

Editor's Choice

Value-Based Deep RL Scales Predictably by bearseascape

Value-Based Deep RL Scales Predictably by bearseascape

Share This Article

Newsletter

HackTech

Leave a comment Cancel reply

Editor's Choice

Sign Up to Our Newsletter