Skip to content

Conversation

kashif
Copy link
Contributor

@kashif kashif commented Oct 16, 2025

Summary

Returns the mean token accuracy metric when minimizing the cross-entropy loss without materializing the logits

https://x.com/jeremyphoward/status/1703246293802586155

Testing Done

  • Hardware Type:
  • run make test to ensure correctness
  • run make checkstyle to ensure code style
  • run make test-convergence to ensure convergence

@kashif
Copy link
Contributor Author

kashif commented Oct 20, 2025

@vaibhavjindal would you be able to kindly review?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants