Skip to content

rsk2327/DistAya

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Pruning

Layer Sensitivity Scores

The first step is to compute the sensitivity of each layer to pruning. This gives us an idea of the importance of each layer.

Perplexity as a Sensitivity Score

The approach is to examine the perplexity induced when removing a given layer.

python -m DistAya.src.pruning.perplexity_sensivity \
            --model CohereForAI/aya-23-8B \
            --batch_size 8 \
            --output_folder sensitivities \
            --subset 128

This will produce a CSV representing the sensitivity of each layer to pruning. This sensitivity score is just the perplexity of the model when this layer is dropped.

Input/output similarity as a Sensivity Score

See the ShortGPT Paper

Compression

...

Distillation

...

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •