Handshake AI Research
Untangling the science of frontier data and evaluation
Popular repositories Loading
-
gandalf-the-grader
gandalf-the-grader PublicAgent-as-a-Judge grading framework for evaluating AI outputs/deliverables
Python 2
-
harbor
harbor PublicForked from harbor-framework/harbor
Harbor is a framework for running agent evaluations and creating and using RL environments.
Python 1
-
hart-org-gh-actions
hart-org-gh-actions PublicCommon GitHub Actions for supporting our workflows
JavaScript
-
Repositories
Showing 5 of 5 repositories
- harbor Public Forked from harbor-framework/harbor
Harbor is a framework for running agent evaluations and creating and using RL environments.
Handshake-AI-Research/harbor’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…