Add Bloom Model #1382

abuelnasr0 · 2023-12-27T22:47:01Z

The architecture is done. and the generates output successfully.
remaining two tasks:

add documentation
checkpoint conversion

once I finish, I will mention.

mattdangerw

Thank you!! This is awesome.

Left some initial comments. Adding a test file for the backbone should catch some things.

Just a heads up most everyone will be out for new years, so the next review will probably be next week!

keras_nlp/models/bloom/bloom_backbone.py

keras_nlp/models/bloom/bloom_decoder.py

mattdangerw · 2023-12-27T22:56:35Z

Make sure to run ./shell/format.sh too.

This reverts commit 889f204.

abuelnasr0 · 2024-01-01T14:48:11Z

checkpoint conversion script worked fine and the model produced output that is close to the huggingface output.
check this Gist : https://colab.research.google.com/gist/abuelnasr0/1edd8f43cb05630cc51c9823002e763c/bloom.ipynb

mattdangerw

Left some more comments on the code.

But maybe more importantly, I am looking into license here. We just integrated with Kaggle https://github.com/keras-team/keras-nlp/releases/tag/v0.7.0, which I believe gives us a way to support the open-RAIL license that bloom weights are release under. But we need to double check this. Hope to have an answer next week!

keras_nlp/models/bloom/bloom_attention.py

keras_nlp/models/bloom/bloom_decoder.py

keras_nlp/models/bloom/bloom_mlp.py

abuelnasr0 · 2024-01-06T13:53:32Z

@mattdangerw about the license. If you follow this link https://huggingface.co/bigscience/bloom#uses, you will find a hyperlink (BLOOM license) which points to this License: https://huggingface.co/spaces/bigscience/license

Also I have found these two licenses:

The BigScience RAIL License: https://bigscience.huggingface.co/blog/the-bigscience-rail-license
a license in a github repo but it's mentioned that the repo is deprecated: https://github.com/bigscience-workshop/model_card

This reverts commit 2d03d2c.

This reverts commit 531b1ff.

This reverts commit 2eeb5f4.

abuelnasr0 · 2024-01-06T20:00:19Z

check this gist to see model output compared to huggingface after applying requested changes: https://colab.research.google.com/gist/abuelnasr0/22877985ce1a1c9125e8ed46cfc87da2/bloom.ipynb

mattdangerw

Thanks! This looks good, and I think we are good to land the architecture here.

Can you do two things?

Rebase or merge to the latest changes to see if test are passing again? (we had a keras 3 breakage yesterday)
Send me your kaggle username if you have one?

keras_nlp/models/bloom/bloom_decoder.py

abuelnasr0 · 2024-01-11T12:37:20Z

Send me your kaggle username if you have one?

my kaggle username: mohamedabuelnasr

mattdangerw · 2024-01-15T04:32:40Z

@abuelnasr0 thanks! Sorry for the delay here, but I think you have been added to a list that will allow you to upload models.

I will pull this PR in, then you can proceed roughly as follows...

Create a PR for a tokenizer.
Update the conversion script to output to our new preset format. Use Update llama conversion script for new kaggle format #1402 as a rough reference. That will save weights in a new format ready for kaggle (essentially a directory with a config.json, tokenizer.json, models.weights.h5 and some tokenizer assets). Note the script expects you have installed the keras_nlp package using python pip_build.py --install.
Using the Kaggle UI -> https://www.kaggle.com/models/?new=true create a new bloom model under your username, and upload variants where variant name == preset name. You can just drag the entire contents of a local preset directory into the kaggle upload UI.
Create a new preset file following the pattern here, where essentially we just record some metadata and a link to the kaggle model. The link form will be kaggle://YOUR_USERNAME/bloom/keras/PRESET_ID/1.
Add preset tests.

One other note, the largest models 7b & 176b will require a lot of ram to load, even on a CPU. Feel free to just test the conversion with the smaller models, and we can do the conversion for the larger models on our own compute resources.

This is our first time going through this new Kaggle upload flow, so please let us know any feedback!

abuelnasr0 · 2024-01-16T20:58:47Z

@mattdangerw Thanks for the merge and the instructions. I will open the PR and add the models as soon as possible.

SamanehSaadat · 2024-01-17T18:24:28Z

Hi @abuelnasr0 !

Thanks for contributing this model. I'm working on the Falcon model which similar to the Bloom model, uses alibi. I was wondering if you are interested in separating your alibi implementation and making it reusable.

abuelnasr0 · 2024-01-18T00:33:29Z

@SamanehSaadat sure. I will open a PR for it.

Add Bloom Model

438bf01

mattdangerw requested changes Dec 27, 2023

View reviewed changes

abuelnasr0 added 12 commits December 28, 2023 16:32

Add Backbone test and some fixes

bb281d9

Add BloomBackbone to keras_nlp.models

762f1a0

Fix a typo in layer naming

a01fcd8

Remove self.built = True

889f204

Revert "Remove self.built = True"

7e0313f

This reverts commit 889f204.

Add built=True to MLP layer

6eba2fa

Add Checkpoint conversion script

b77a22e

Change LayerNorm name

a61267f

Fix typo

f64f532

Fix getting HF model output

103664a

Add and to allclose function in checkpoint conversion script

52a1160

Remove allclose check

8017f4e

Add doc for bloom

d700931

abuelnasr0 requested a review from mattdangerw January 2, 2024 18:23

mattdangerw requested changes Jan 5, 2024

View reviewed changes

abuelnasr0 added 10 commits January 6, 2024 16:05

Write batch size instead of _

3461a3e

Rename out_dense to output_dense

5964185

Rename out_dense to output_dense

7c95c10

Format to 80 chars and remove unnecessery check

069f0d2

Remove exporting BloomDecoder

d2514c9

Add intermediate_dim Arg

344c903

Format the code

b2076ff

Remove unnecessery comment

fd9c64c

Use keras gelu

4a5a114

Remove MLP layer and implement it inside BloomDecoder

1136ca2

abuelnasr0 added 9 commits January 6, 2024 20:21

Split q k v heads

2d03d2c

Remove shapes comments

2eeb5f4

Revert "Split q k v heads"

531b1ff

This reverts commit 2d03d2c.

Revert "Revert "Split q k v heads""

786677b

This reverts commit 531b1ff.

Revert "Remove shapes comments"

eccef98

This reverts commit 2eeb5f4.

Add bias axes

ea3063d

Add bias axes to the correct axes

38826c4

Update conversion script for splitting q,k,v

2ad1d17

format the code

b68300f

abuelnasr0 added 6 commits January 7, 2024 12:59

Rename _dropout -> _dropout_layer

67f7198

use clone initializer instead of paasing str name

33ddca2

Serialize kernal & bais initializers

e59e9e2

Format the code

56e4492

Add alibi_bias_max to _build_alibi_tensor function

d89a207

Format the code

0268dd7

mattdangerw reviewed Jan 11, 2024

View reviewed changes

keras_nlp/models/bloom/bloom_decoder.py Outdated Show resolved Hide resolved

Lowercase vairiable names

0c0233f

mattdangerw added the kokoro:force-run Runs Tests on GPU label Jan 11, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Jan 11, 2024

mattdangerw merged commit c41e844 into keras-team:master Jan 15, 2024

abuelnasr0 deleted the bloom branch January 16, 2024 20:59

abuelnasr0 mentioned this pull request Jan 18, 2024

Add Alibi bias layer #1404

Merged

abuelnasr0 mentioned this pull request Feb 2, 2024

Add Electra Weights to Kaggle Models #1422

Open

Add Bloom Model #1382

Add Bloom Model #1382

Uh oh!

Conversation

abuelnasr0 commented Dec 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattdangerw commented Dec 27, 2023

Uh oh!

abuelnasr0 commented Jan 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

abuelnasr0 commented Jan 6, 2024

Uh oh!

abuelnasr0 commented Jan 6, 2024

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

abuelnasr0 commented Jan 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattdangerw commented Jan 15, 2024

Uh oh!

abuelnasr0 commented Jan 16, 2024

Uh oh!

SamanehSaadat commented Jan 17, 2024

Uh oh!

abuelnasr0 commented Jan 18, 2024

Uh oh!

Uh oh!

abuelnasr0 commented Dec 27, 2023 •

edited

Loading

abuelnasr0 commented Jan 1, 2024 •

edited

Loading

abuelnasr0 commented Jan 11, 2024 •

edited

Loading