Pruning

Layer Sensitivity Scores

The first step is to compute the sensitivity of each layer to pruning. This gives us an idea of the importance of each layer.

Perplexity as a Sensitivity Score

The approach is to examine the perplexity induced when removing a given layer.

python -m DistAya.src.pruning.perplexity_sensivity \
            --model CohereForAI/aya-23-8B \
            --batch_size 8 \
            --output_folder sensitivities \
            --subset 128

This will produce a CSV representing the sensitivity of each layer to pruning. This sensitivity score is just the perplexity of the model when this layer is dropped.

Input/output similarity as a Sensivity Score

See the ShortGPT Paper

Compression

...

Distillation

...

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
results/sensivities		results/sensivities
src		src
.gitignore		.gitignore
KLDBasedPruning.ipynb		KLDBasedPruning.ipynb
README.md		README.md
finetuning_trl.py		finetuning_trl.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Pruning

Layer Sensitivity Scores

Perplexity as a Sensitivity Score

Input/output similarity as a Sensivity Score

Compression

Distillation

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

rsk2327/DistAya

Folders and files

Latest commit

History

Repository files navigation

Pruning

Layer Sensitivity Scores

Perplexity as a Sensitivity Score

Input/output similarity as a Sensivity Score

Compression

Distillation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages