Hacker News — vinext + Cloudflare Workers

new
past
show
ask
show
jobs
submit

▲Show HN: I built an integration for RL training of browser agents for everyone (github.com)

7 points by filtr12 2 days ago | 1 comment

nithisha2201 2 days ago [-]

Interesting, how do you handle the observability side during training? One thing I ran into with multi-agent RL is that reward signals alone don't tell you much about why an agent is failing. Curious if you've built any tooling around that.

Remi_Etien 2 days ago [-]

[dead]

georaa 2 days ago [-]

[flagged]

Rendered at 18:36:01 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.