-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Poor accuracy with non-US regional settings #4910
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi Damir, You bring up a very good edge case we shouldn't forget, which is that decimal points are represented with commas in a lot of regions outside the US, (For example, 3.5 * 10^1 is represented as 3.5 in the US and 3,5 in Europe, and 3.5 * 10^5 is represented as 350,000 in the US and 350.000 in Europe). We will look into this, thank you! Edit: Another good edge case is the negative (-) sign. In some cultures (e.g. Faroese in Denmark), |
We may want to setup a CI leg which tests another culture. Perhaps "de-DE" or "ru-RU". |
How about this or similar?
|
Here is a .NET Fiddle that demonstrates some of these cultural differences in .NET Core 3.1 (major thanks to @justinormont for the idea and the initial implementation): https://dotnetfiddle.net/LtAtoi |
Hey @ddobric , can you provide sample code and data for which you've used to obtain the difference in your results above? It'll be helpful in testing the validity of a PR to fix this issue. Thanks! |
Hi @ddobric, I'm reaching out again to ask if you can provide sample code and data with which you've encountered this issue. Thanks. |
System information
Issue
Getting poor accuracy running the training code on a system with non-US regional settings. The issue is number format. After replacing ',' as decimal symbol to '.' all works fine.
Accuracy with ',' decimal symbol:
Accuracy with '.' decimal symbol:
Is there some way to take a control of localization in .NET ML?
Thanks
Damir
The text was updated successfully, but these errors were encountered: