Added an Example for BLEU. #799

Neeshamraghav012 · 2023-03-02T17:06:13Z

I would like to inform you that I have made some updates to the content. Firstly, I have added an example for BLEU which was not present before. This new example serves as an illustration of the concept and should be helpful for those who are new to this topic.

Furthermore, I have also fixed the URLs by wrapping them in markdown. This should make it easier for readers to access the resources mentioned in the content.

I have made some changes in the docstring example of the TokenAndPosition Embedding Layer as suggested in issue number 658.

I have changed the epsilon value 1e-5 to 1e-12.

I have changed the epsilon value from 1e-5 to 1e-12.

I would like to inform you that I have made some updates to the content. Firstly, I have added an example for BLEU which was not present before. This new example serves as an illustration of the concept and should be helpful for those who are new to this topic. Furthermore, I have also fixed the URLs by wrapping them in markdown. This should make it easier for readers to access the resources mentioned in the content.

abheesht17

Left a few comments!

abheesht17 · 2023-03-02T17:25:46Z

keras_nlp/metrics/bleu.py

+    Examples:
+            ```python
+            bleu = keras_nlp.metrics.Bleu(max_order=4)
+            # reference sentence
+            ref_sentence = "the quick brown fox jumps over the lazy dog"
+            # predicted sentence
+            pred_sentence = "the quick brown fox jumps over the box"
+            # compute BLEU score
+            score = bleu([ref_sentence], [pred_sentence])
+            print("BLEU score:", score)
+            ```


Oh, shoot! I must have forgotten to add examples. Instead of fenced doc-strings, can we have examples this way: https://github.com/keras-team/keras-nlp/blob/master/keras_nlp/metrics/rouge_l.py#L46-L113?

abheesht17 · 2023-03-02T17:26:59Z

keras_nlp/metrics/bleu.py

    penalise short predictions. For more details, see the following article:
-    https://cloud.google.com/translate/automl/docs/evaluate#bleu.
+    [Link](https://cloud.google.com/translate/automl/docs/evaluate#bleu).


We can probably change this to

For more details, see [this article](https://cloud.google.com/translate/automl/docs/evaluate#bleu).

abheesht17 · 2023-03-02T17:28:58Z

keras_nlp/metrics/bleu.py

            `"tokenizer_13a"` tokenizer
-            (https://github.com/mjpost/sacrebleu/blob/v2.1.0/sacrebleu/tokenizers/tokenizer_13a.py).
+            [Link](https://github.com/mjpost/sacrebleu/blob/v2.1.0/sacrebleu/tokenizers/tokenizer_13a.py).


Here, as well, we can probably change this to

... [SacreBLEU's `"tokenizer_13a"` tokenizer](https://github.com/mjpost/sacrebleu/blob/v2.1.0/sacrebleu/tokenizers/tokenizer_13a.py).

abheesht17 · 2023-03-02T17:30:08Z

keras_nlp/models/bart/bart_backbone.py

@@ -140,7 +140,7 @@ def __init__(
        x = keras.layers.LayerNormalization(
            name="encoder_embeddings_layer_norm",
            axis=-1,
-            epsilon=1e-5,
+            epsilon=1e-12,


Why are we changing the epsilon values here? BART uses epsilon=1e-5: https://github.com/huggingface/transformers/blob/main/src/transformers/models/bart/modeling_bart.py#L732. The PyTorch default is 1e-5: https://pytorch.org/docs/stable/generated/torch.nn.LayerNorm.html.

I have read an issue about changing the epsilon value, that's why I changed it:
#642
I will update it as you said from 1e-12 to 1e-5.

Changed the epsilon value from 1e-12 to 1e-5.

Made the changes as suggested by the reviewer.

abheesht17

Thanks for the update, @Neeshamraghav012! A few minor comments, and one major comment about adding more examples.

Oh, and please run the code formatters (./shell/format.sh).

abheesht17 · 2023-03-05T00:01:53Z

keras_nlp/metrics/bleu.py

+    Examples:
+
+            1. Calculate BLEU score by calling Bleu directly.
+            >>> bleu = keras_nlp.metrics.Bleu(max_order=4)
+            >>> ref_sentence = "the quick brown fox jumps over the lazy dog"
+            >>> pred_sentence = "the quick brown fox jumps over the box"
+            >>> score = bleu([ref_sentence], [pred_sentence])
+            <tf.Tensor(0.7420885, shape=(), dtype=float32)>


No need for the extra indentation, it can be in line with "Examples:!

abheesht17 · 2023-03-05T00:05:09Z

keras_nlp/metrics/bleu.py

-        **kwargs: Other keyword arguments.
-
+        **kwargs: Other keyword arguments.  
+    Examples:


We need to add more examples, which cover the various input types and input shapes. Check this for reference: https://github.com/keras-team/keras-nlp/blob/master/keras_nlp/metrics/rouge_l.py#L46.

We also need to demonstrate that we can pass multiple references for the same sample, i.e., (references, translation) pair. We need a separate example for this.

abheesht17 · 2023-03-05T00:07:13Z

keras_nlp/metrics/bleu.py

@@ -92,8 +92,15 @@ class Bleu(keras.metrics.Metric):
        dtype: string or tf.dtypes.Dtype. Precision of metric computation. If
               not specified, it defaults to tf.float32.
        name: string. Name of the metric instance.
-        **kwargs: Other keyword arguments.
-
+        **kwargs: Other keyword arguments.  


Let's remove **kwargs...we don't actually document it anywhere in the library. And please add a newline after the last arg!

abheesht17 · 2023-03-05T00:09:05Z

keras_nlp/metrics/bleu.py

+            `"tokenizer_13a"` tokenizer, see
+            [tokenizer details](https://github.com/mjpost/sacrebleu/blob/v2.1.0/sacrebleu/tokenizers/tokenizer_13a.py).


This didn't work?

[SacreBLEU's `"tokenizer_13a"` tokenizer](https://github.com/mjpost/sacrebleu/blob/v2.1.0/sacrebleu/tokenizers/tokenizer_13a.py).

I think this looks cleaner, what do you think?

@abheesht17

I have added more example as suggested by the reviewer, and made some minor changes as @abheesht17 said.

Neeshamraghav012 added 6 commits February 18, 2023 16:47

New docstring example for TokenAndPosition Embedding layer.

7b2ada2

I have made some changes in the docstring example of the TokenAndPosition Embedding Layer as suggested in issue number 658.

Merge branch 'keras-team:master' into master

afccdc6

Updated bart_backbone.py

160ca4b

I have changed the epsilon value 1e-5 to 1e-12.

Updated bart_backbone.py

bc71189

I have changed the epsilon value from 1e-5 to 1e-12.

Merge branch 'keras-team:master' into master

1c9022b

abheesht17 reviewed Mar 2, 2023

View reviewed changes

Neeshamraghav012 added 2 commits March 2, 2023 23:39

Updated bart_backbone.py

37e78d4

Changed the epsilon value from 1e-12 to 1e-5.

Updated bleu.pu

1b68a72

Made the changes as suggested by the reviewer.

Neeshamraghav012 requested a review from abheesht17 March 3, 2023 06:03

abheesht17 requested changes Mar 5, 2023

View reviewed changes

Neeshamraghav012 and others added 3 commits March 6, 2023 20:33

Merge branch 'keras-team:master' into master

364b87d

Added more examples for bleu

e4481ee

Updated blue.py

2b42107

I have added more example as suggested by the reviewer, and made some minor changes as @abheesht17 said.

Neeshamraghav012 closed this Mar 7, 2023

abheesht17 mentioned this pull request Mar 8, 2023

Added an Example for BLEU. #806

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added an Example for BLEU. #799

Added an Example for BLEU. #799

Uh oh!

Neeshamraghav012 commented Mar 2, 2023

Uh oh!

abheesht17 left a comment

Uh oh!

abheesht17 Mar 2, 2023

Uh oh!

Neeshamraghav012 Mar 2, 2023

Uh oh!

abheesht17 Mar 2, 2023

Uh oh!

abheesht17 Mar 2, 2023

Uh oh!

abheesht17 Mar 2, 2023

Uh oh!

Neeshamraghav012 Mar 2, 2023

Uh oh!

abheesht17 left a comment •

edited

Loading

Uh oh!

abheesht17 Mar 5, 2023

Uh oh!

abheesht17 Mar 5, 2023

Uh oh!

abheesht17 Mar 5, 2023

Uh oh!

abheesht17 Mar 5, 2023 •

edited

Loading

Uh oh!

Uh oh!

		`"tokenizer_13a"` tokenizer, see
		[tokenizer details](https://github.com/mjpost/sacrebleu/blob/v2.1.0/sacrebleu/tokenizers/tokenizer_13a.py).

Added an Example for BLEU. #799

Added an Example for BLEU. #799

Uh oh!

Conversation

Neeshamraghav012 commented Mar 2, 2023

Uh oh!

abheesht17 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

abheesht17 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

abheesht17 Mar 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

abheesht17 left a comment •

edited

Loading

abheesht17 Mar 5, 2023 •

edited

Loading