decode_n_tokens clean up #1532

manuelcandales · 2025-04-21T21:34:51Z

Deletes unused python arrays

pytorch-bot · 2025-04-21T21:34:54Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1532

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 3816bca with merge base 701d826 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

manuelcandales · 2025-04-21T21:53:17Z

torchchat/generate.py

@@ -535,7 +535,6 @@ def decode_n_tokens(
        attention_backend: SDPBackend = torch.nn.attention.SDPBackend.MATH,
        **sampling_kwargs,
    ):
-        new_tokens, new_probs = [], []


Not really used. This function yields individual tokens and probabilities, not the arrays.

manuelcandales · 2025-04-21T21:54:33Z

torchchat/generate.py

@@ -553,12 +552,10 @@ def decode_n_tokens(
                    **sampling_kwargs,
                )
                input_pos += 1
-                new_tokens.append(next_token.clone())
-                callback(new_tokens[-1], done_generating=_i == num_new_tokens - 2)
-                if need_probs or next_prob is None:


This was backwards. It should be: if not need probs or next_prob is None:. Otherwise you are saying, if you need the probabilities you are getting None, and if you don't need them you are getting them.

manuelcandales · 2025-04-21T21:55:32Z

torchchat/generate.py

@@ -788,7 +784,6 @@ def generate(
                input_pos = input_pos + num_added
                next_token = next_tokens[-1]
        else:
-            generated_tokens = []


Not used. Generated tokens are appended after the call to decode_n_tokens, but then nothing happens.

Jack-Khuu · 2025-04-21T22:02:13Z

torchchat/generate.py

-                callback(new_tokens[-1], done_generating=_i == num_new_tokens - 2)
-                if need_probs or next_prob is None:
+                callback(next_token.clone(), done_generating=_i == num_new_tokens - 2)
+                if not need_probs or next_prob is None:


Everything else is just cleaning up unused code

not need_probs is the only real change and in the non-speculative path is always false so the old check is effectively just if next_prob is None

torchchat/torchchat/generate.py

Line 799 in 701d826

need_probs=False,

I think we should drop the check instead of negating here, so it becomes easier to rip spec decoding out completely. The returned prob doesn't get used either way

torchchat/torchchat/generate.py

Line 792 in 701d826

for generated_token, _ in self.decode_n_tokens(

Suggested change

if not need_probs or next_prob is None:

if next_prob is None:

Honestly this is a nit we can just merge

decode_n_tokens clean up

3816bca

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 21, 2025

manuelcandales requested a review from Jack-Khuu April 21, 2025 21:46

manuelcandales commented Apr 21, 2025

View reviewed changes

Jack-Khuu approved these changes Apr 21, 2025

View reviewed changes

manuelcandales merged commit 8a0897d into main Apr 22, 2025
72 checks passed

manuelcandales deleted the manuel/decode branch April 23, 2025 00:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

decode_n_tokens clean up #1532

decode_n_tokens clean up #1532

Uh oh!

manuelcandales commented Apr 21, 2025

Uh oh!

pytorch-bot bot commented Apr 21, 2025 •

edited

Loading

Uh oh!

manuelcandales Apr 21, 2025

Uh oh!

manuelcandales Apr 21, 2025

Uh oh!

manuelcandales Apr 21, 2025

Uh oh!

Jack-Khuu Apr 21, 2025 •

edited

Loading

Uh oh!

Jack-Khuu Apr 21, 2025

Uh oh!

Uh oh!

Uh oh!

	if not need_probs or next_prob is None:
	if next_prob is None:

decode_n_tokens clean up #1532

decode_n_tokens clean up #1532

Uh oh!

Conversation

manuelcandales commented Apr 21, 2025

Uh oh!

pytorch-bot bot commented Apr 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1532

✅ No Failures

Uh oh!

manuelcandales Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

manuelcandales Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

manuelcandales Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

Jack-Khuu Apr 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Jack-Khuu Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 21, 2025 •

edited

Loading

Jack-Khuu Apr 21, 2025 •

edited

Loading