Mitigating Quantization Errors Due to Activation Spikes in GLU-Based LLMs [pdf]

Installation

Requirements)

pytorch==2.2.0
transformers==4.38.1

pip install transformers==4.38.1 accelerate bitsandbytes easydict matplotlib datasets scipy seaborn sentencepiece protobuf

# install lm-eval
git clone https://github.com/EleutherAI/lm-evaluation-harness.git
cd lm-evaluation-harness
git checkout 6a1c19
pip install -e .
cd ..

# install act_spike
pip install -e .

Run Code

1) Prepare misc (calibration results)

cd exp
python extract_misc.py {hf_model_name}

2) Evaluation

cd exp
python eval.py {hf_model_name} {--flags}

Spported Flags)

--use_cache : enable QFeP
--except_layer : enable QFeM
--sq : enable smooth quant
--osp : enable outlier suppression plus
--weight_quant : weight quantization scheme
--act_granul : activation quantization scheme
--bmm : enable BMM quantization
--fp16 : enable FP16

3) Benchmark Computational Cost

cd exp
python bench.py {hf_model_name} {--flags}

Spported Flags)

--use_cache : enable QFeP
--except_layer : enable QFeM
--seqlen : set sequence length
--n_samples : set number of samples
--act_granul : activation quantization scheme
--fp16 : enable FP16

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
act_spike		act_spike
exp		exp
figs		figs
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mitigating Quantization Errors Due to Activation Spikes in GLU-Based LLMs [pdf]

Installation

Run Code

1) Prepare misc (calibration results)

2) Evaluation

3) Benchmark Computational Cost

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

onnoo/activation-spikes

Folders and files

Latest commit

History

Repository files navigation

Mitigating Quantization Errors Due to Activation Spikes in GLU-Based LLMs [pdf]

Installation

Run Code

1) Prepare misc (calibration results)

2) Evaluation

3) Benchmark Computational Cost

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages