You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Use vmap strategy for T operator to match T_σ implementation
- Replace vectorized B with vmap-based B(v, model, i, j, ip)
- Add staged vmap application: B_1, B_2, B_vmap
- Update T and get_greedy to use B_vmap with index arrays
- Consistent with T_σ implementation which also uses vmap
- Performance: ~6.7x speedup (slightly better than vectorized version)
This makes the codebase more consistent by using the same vmap strategy
for both the Bellman operator and the policy operator.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude <noreply@anthropic.com>
0 commit comments