Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

experiment #7

Closed
Jqq3482840604 opened this issue Jul 4, 2024 · 1 comment
Closed

experiment #7

Jqq3482840604 opened this issue Jul 4, 2024 · 1 comment

Comments

@Jqq3482840604
Copy link

Sorry, could you please tell me if the baseline CQL in the article updates its offline buffer during the fine-tuning phase, or does it keep the buffer fixed?

@nakamotoo
Copy link
Owner

Hi, we used a mixing ratio hyperparameter to mix the offline buffer and online buffer, as shown in Table 3 of the paper, for CQL

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants