axolotl

majiayu000

Updated 11 days ago

13 views

DesignFine-TuningAxolotlLLMLoRAQLoRADPOKTOORPOGRPOYAMLHuggingFaceDeepSpeedMultimodal

About

This skill provides expert guidance for fine-tuning LLMs using the Axolotl framework, helping developers configure YAML files and implement techniques like LoRA/QLoRA and DPO/KTO. It's designed for when you're working with Axolotl features, debugging code, or implementing fine-tuning solutions. Key capabilities include support for 100+ models, multimodal training, and integration with tools like HuggingFace and DeepSpeed.

Quick Install

Claude Code

Recommended

Primary

npx skills add majiayu000/claude-skill-registry -a claude-code

Plugin CommandAlternative

/plugin add https://github.com/majiayu000/claude-skill-registry

Git CloneAlternative

git clone https://github.com/majiayu000/claude-skill-registry.git ~/.claude/skills/axolotl

Copy and paste this command in Claude Code to install this skill

GitHub Repository

majiayu000/claude-skill-registry

Path: skills/axolotl

Related Skills

quantizing-models-bitsandbytes

Other

This skill quantizes LLMs to 8-bit or 4-bit precision using bitsandbytes, achieving 50-75% memory reduction with minimal accuracy loss. It's ideal for running larger models on limited GPU memory or accelerating inference, supporting formats like INT8, NF4, and FP4. The skill integrates with HuggingFace Transformers and enables QLoRA training and 8-bit optimizers.

View skill

blip-2-vision-language

Design

BLIP-2 is a vision-language framework that connects a frozen image encoder with a large language model for multimodal tasks. Use it for zero-shot image captioning, visual question answering, or image-text retrieval without task-specific fine-tuning. It's ideal for developers needing to add state-of-the-art visual understanding to LLM-based applications.

View skill

weights-and-biases

Design

This skill integrates Weights & Biases for comprehensive ML experiment tracking and MLOps. It automatically logs metrics, visualizes training in real-time, and manages hyperparameter sweeps and model versions. Use it to compare runs, optimize models, and collaborate within team workspaces directly from your development environment.

View skill

openrlhf-training

Design

OpenRLHF is a high-performance RLHF training framework for fine-tuning large language models (7B-70B+ parameters) using methods like PPO, DPO, and GRPO. It leverages Ray for distributed architecture and vLLM for accelerated inference, achieving speeds 2x faster than alternatives like DeepSpeedChat. Use this skill when you need efficient, distributed RLHF training with optimized GPU resource sharing and ZeRO-3 support.

View skill