Instruction Tuning¶

Instruction tuning is the process of further training LLMs on a dataset consisting of (instruction, output) pairs in a supervised fashion, which bridges the gap between the next-word prediction objective of LLMs and the users’ objective of having LLMs adhere to human instructions. This implementation deploys a Ray cluster for the task.

Table of contents¶

Architecture
Deployment Options
Instruction Tuning Service Usage

Architecture¶

The instruction tuning application is a customizable end-to-end workflow that takes user provided instruction dataset to finetune the user specified LLMs.

Deployment Options¶

Deploy Instruction Tuning Service on Xeon¶

Refer to the Xeon Guide for detail.

Deploy Instruction Tuning Service on Gaudi¶

Refer to the Gaudi Guide for detail.

Instruction Tuning Service Usage¶

1. Upload a training file¶

Download a training file alpaca_data.json and upload it to the server with below command, this file can be downloaded in here:

# upload a training file
curl http://${your_ip}:8015/v1/files -X POST -H "Content-Type: multipart/form-data" -F "file=@./alpaca_data.json" -F purpose="fine-tune"

2. Create fine-tuning job¶

After a training file like alpaca_data.json is uploaded, use the following command to launch a finetuning job using meta-llama/Llama-2-7b-chat-hf as base model:

# create a finetuning job
curl http://${your_ip}:8015/v1/fine_tuning/jobs \
  -X POST \
  -H "Content-Type: application/json" \
  -d '{
    "training_file": "alpaca_data.json",
    "model": "meta-llama/Llama-2-7b-chat-hf"
  }'

The outputs of the finetune job (adapter_model.safetensors, adapter_config,json… ) are stored in /home/user/comps/finetuning/src/output and other execution logs are stored in /home/user/ray_results

3. Manage fine-tuning job¶

Below commands show how to list finetuning jobs, retrieve a finetuning job, cancel a finetuning job and list checkpoints of a finetuning job.

# list finetuning jobs
curl http://${your_ip}:8015/v1/fine_tuning/jobs -X GET

# retrieve one finetuning job
curl http://${your_ip}:8015/v1/fine_tuning/jobs/retrieve -X POST -H "Content-Type: application/json" -d '{"fine_tuning_job_id": ${fine_tuning_job_id}}'

# cancel one finetuning job
curl http://${your_ip}:8015/v1/fine_tuning/jobs/cancel -X POST -H "Content-Type: application/json" -d '{"fine_tuning_job_id": ${fine_tuning_job_id}}'

# list checkpoints of a finetuning job
curl http://${your_ip}:8015/v1/finetune/list_checkpoints -X POST -H "Content-Type: application/json" -d '{"fine_tuning_job_id": ${fine_tuning_job_id}}'

Validated Configurations¶

Deploy Method	Hardware
Docker Compose	Intel Xeon