LogoAIAny
Icon for item

verl

Volcano Engine Reinforcement Learning library for efficient LLM post-training—open-sourced HybridFlow.

Introduction

Overview

verl implements the HybridFlow RLHF framework, offering a hybrid-controller graph that flexibly maps RLHF workloads across heterogeneous GPUs.

Key Capabilities
  • Hybrid-controller API for PPO, GRPO and custom RL flows
  • 3D-HybridEngine to cut memory & communication overhead
  • Plug-ins for Megatron-LM, FSDP, vLLM, SGLang
  • Flexible device placement & mixed-precision training
  • Tutorials, recipes and PyPI wheels for quick adoption

Information

  • Websitegithub.com
  • AuthorsByteDance Seed / Volcano Engine
  • Published date2024/12/11

Categories