olive-recipes/meta-llama-Llama-3.2-1B-Instruct/olive at main · CodeLinaro/olive-recipes

Name	Name	Last commit message	Last commit date
parent directory ..
README-mixed.md	README-mixed.md
README.md	README.md
dora.json	dora.json
hqq.json	hqq.json
info.yml	info.yml
lmeval.json	lmeval.json
lmeval_onnx.json	lmeval_onnx.json
loha.json	loha.json
lokr.json	lokr.json
mixed.json	mixed.json
qlora.json	qlora.json
requirements-mixed.txt	requirements-mixed.txt
requirements.txt	requirements.txt
rtn.json	rtn.json

Name

Last commit message

Last commit date

requirements-mixed.txt

requirements.txt

rtn.json

Llama 3.2 1B Instruct Recipes

This folder provides Olive optimization / fine-tuning / quantization / evaluation recipes for meta-llama/Llama-3.2-1B-Instruct.

Each recipe is a self‑contained JSON passed to the Olive CLI: olive run --config <file>.json.

Quick Start

Install dependencies (make sure you are in the olive-recipes/meta-llama-Llama-3.2-1B-Instruct/olive directory):

python -m pip install -r requirements.txt

Typical steps:

Run optimization / finetuning:

olive run --config qlora.json

Output models & adapters saved under the output_dir (default models/).

Recipe Summary

File	Goal	Main Pass Chain (order)
`qlora.json`	QLoRA PEFT finetune + export + ORT opt + extract adapters	q (qlora) → m (ModelBuilder fp16) → o (OrtTransformersOptimization fp16) → e (ExtractAdapters)
`loha.json`	LoHa finetune + ONNX export + ORT opt + extract	l (loha) → c (OnnxConversion) → o (ORT opt) → e (ExtractAdapters) → m (metadata)
`lokr.json`	LoKr finetune + ONNX export + ORT opt + extract	l (lokr) → c (OnnxConversion) → o → e → m
`dora.json`	DoRA finetune + ORT opt + extract	d (dora) → m (ModelBuilder fp16) → o → e
`rtn.json`	Block‑wise RTN quantization (ONNX)	m (ModelBuilder fp16) → q (OnnxBlockWiseRtnQuantization)
`hqq.json`	HQQ quantization (ONNX)	m (ModelBuilder fp16) → q (OnnxHqqQuantization)
`lmeval.json`	HF (fp16/fp32) evaluation with LMEval	evaluator only
`lmeval_onnx.json`	INT4 ModelBuilder + LMEval	mb (ModelBuilder int4) + evaluator

Example Commands

QLoRA training + optimization:

olive run --config qlora.json

LoHa adapter training and export to ONNX:

olive run --config loha.json

HQQ quantization (after ONNX build inside pass chain):

olive run --config hqq.json

Run LM evaluation on HF model:

olive run --config lmeval.json

Evaluate INT4 ONNX build:

olive run --config lmeval_onnx.json

Clean cache for a fresh run (example):

olive run --config qlora.json --clean_cache

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Llama 3.2 1B Instruct Recipes

Quick Start

Run optimization / finetuning:

Recipe Summary

Example Commands

FilesExpand file tree

olive

Directory actions

More options

Directory actions

More options

Latest commit

History

olive

Folders and files

parent directory

README.md

Llama 3.2 1B Instruct Recipes

Quick Start

Run optimization / finetuning:

Recipe Summary

Example Commands