uni-rlhf/uni-rlhf.github.ioPublic NotificationsYou must be signed in to change notification settings Fork0 Star1 master 1Branch0Tags Code Folders and files Name Last commit message Last commit date Latest commit pickxiguapi Update site
Pull requests Actions Projects Security Insights Additional navigation options Files main assets d4rl scripts uni_rlhf .gitignore LICENSE.txt README.md __init__.py requirements.txt run.py run.sh pickxiguapi/Uni-RLHF-Platform is licensed under the ...
Uni-RLHF contains three packages: 1) a universal multi-feedback annotation platform, 2) large-scale crowdsourced feedback datasets, and 3) modular offline RLHF baseline implementations. Uni-RLHF develops a user-friendly annotation interface tailored to various feedback types, compatible with a ...
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024) - Uni-RLHF-Platform/requirements.txt at main · pickxiguapi/Uni-RLHF-Platform
The Uni-RLHF platform consists of a vue front-end and a flask back-end. Also, we support a wide range of mainstream RL environments for annotation. Clone the repo git clone https://github.com/TJU-DRL-LAB/Uni-RLHF.gitcdUni-RLHF ...
Saved searches Use saved searches to filter your results more quickly Cancel Create saved search Sign in Sign up Reseting focus {{ message }} uni-rlhf / uni-rlhf.github.io Public Notifications You must be signed in to change notification settings Fork 0 Star 1 ...
uni-rlhf / uni-rlhf.github.io Public Notifications Fork 0 Star 1 Code Issues Pull requests Actions Projects Security Insights Files master .idea assets static videos .gitignore index.htmlBreadcrumbs uni-rlhf.github.io / .gitignore ...
Actions: uni-rlhf/uni-rlhf.github.io Actions All workflows pages-build-deployment Management Caches Deployments All workflows Showing runs from all workflows 3 workflow runs Event Status Branch Actor pages build and deployment pages-build-deployment #14: by pickxiguapi master ...
uni-rlhf.github.iouni-rlhf.github.ioPublic JavaScript1 2 contributions in the last year No contributions on September 24th.No contributions on October 1st.No contributions on October 8th.No contributions on October 15th.No contributions on October 22nd.No contributions on October 29th.No contributio...
Clean-Offline-RLHF Project Website· Paper· Platform· Datasets· Clean Offline RLHF This is the official PyTorch implementation of the paper "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback". Clean-Offline-RLHF is an Offline Reinforcement...