LogoAIAny
Icon for item

LLaMA-Factory

Zero-code CLI & WebUI to fine-tune 100+ LLMs/VLMs with LoRA, QLoRA, PPO, DPO and more.

Introduction

Overview

LLaMA-Factory lowers the barrier to task-specific tuning of open-source LLMs. It bundles data prep, training, evaluation and deployment behind a simple command or WebUI.

Key Capabilities
  • Supports SFT, reward-model, PPO, DPO, ORPO, KTO, etc.
  • 16-bit, LoRA, QLoRA (2-8 bit) and GaLore/DoRA optimizers
  • DeepSpeed, FSDP and NativeDDP distributed back-ends
  • FlashAttention-2 & Unsloth acceleration
  • Integrated dashboards: LlamaBoard, W&B, MLflow, SwanLab

Information

  • Websitegithub.com
  • Authorshiyouga
  • Published date2024/02/01

Categories