Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: sotirios4/fastText
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: main
Choose a base ref
...
head repository: facebookresearch/fastText
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: main
Choose a head ref
  • 10 commits
  • 34 files changed
  • 9 contributors

Commits on Apr 17, 2023

  1. Replace outdated url in the scripts

    Summary: Replace outdated url in the scripts
    
    Reviewed By: piotr-bojanowski
    
    Differential Revision: D43464784
    
    fbshipit-source-id: 51a98a9ad5a0939acd0d578126290909a613938b
    Celebio authored and facebook-github-bot committed Apr 17, 2023
    Configuration menu
    Copy the full SHA
    0622aad View commit details
    Browse the repository at this point in the history

Commits on Jun 7, 2023

  1. Add documentation about Hugging Face integration (facebookresearch#1335)

    Summary:
    [Word vectors](https://huggingface.co/facebook/fasttext-en-vectors) for 157 languages are now hosted on the Hugging Face Hub as well as the [language identification model](https://huggingface.co/facebook/fasttext-language-identification). (cc ajoulin)
    
    A newer language model [referred in the NLLB project](https://github.com/facebookresearch/fairseq/blob/nllb/README.md#lid-model) is not mentioned in the official website, so I updated the doc accordingly.
    
    Pull Request resolved: facebookresearch#1335
    
    Reviewed By: Celebio
    
    Differential Revision: D46507563
    
    Pulled By: jmp84
    
    fbshipit-source-id: 64883a6829c68b968acd980ba77a712b8e7a1365
    sheonhan authored and facebook-github-bot committed Jun 7, 2023
    Configuration menu
    Copy the full SHA
    48171ac View commit details
    Browse the repository at this point in the history

Commits on Jun 21, 2023

  1. Migrate "deeplearning/fastText" from LLVM-12 to LLVM-15

    Summary:
    fbcode is migrating to LLVM-15 for safer and more up-to-date code and new compiler features. All contbuilds in your directory have passed our build test with LLVM-15, and your directory does not host any packages. This diff will migrate it to LLVM-15.
    
    If you approve of this diff, please use the "Accept & Ship" button. If you have a reason for why it should not build with LLVM 15, please make a comment and send it back to author. Otherwise we will land this on Thursday 06/15/2023.
    
    See the [FAQ post](https://fb.workplace.com/groups/llvm15platform010/posts/749154386769776/)! Please also direct any questions to [this group](https://fb.workplace.com/groups/llvm15platform010).
    
     - If you approve of this diff, please use the "Accept & Ship" button :-)
    
    Reviewed By: meyering
    
    Differential Revision: D46661531
    
    fbshipit-source-id: 7278fbfcadec2392c94efd6deb710bdd5e9280f8
    generatedunixname89002005320047 authored and facebook-github-bot committed Jun 21, 2023
    Configuration menu
    Copy the full SHA
    440f46a View commit details
    Browse the repository at this point in the history

Commits on Aug 25, 2023

  1. Del (object) from 200 inc deeplearning/aicamera/trainer/utils/metri…

    …cs.py
    
    Summary: Python3 makes the use of `(object)` in class inheritance unnecessary. Let's modernize our code by eliminating this.
    
    Reviewed By: itamaro
    
    Differential Revision: D48673901
    
    fbshipit-source-id: 3e0ef05efe886b32a07bb58bd0725fa2ec934c14
    r-barnes authored and facebook-github-bot committed Aug 25, 2023
    Configuration menu
    Copy the full SHA
    166ce2c View commit details
    Browse the repository at this point in the history

Commits on Oct 4, 2023

  1. deeplearning, dcp (2972240286315620591)

    Reviewed By: r-barnes
    
    Differential Revision: D49677606
    
    fbshipit-source-id: ec5b375177586c76ecccb83a29b562bc6e9961f6
    generatedunixname89002005287564 authored and facebook-github-bot committed Oct 4, 2023
    Configuration menu
    Copy the full SHA
    789e328 View commit details
    Browse the repository at this point in the history

Commits on Nov 27, 2023

  1. Add pyproject.toml to comply with PEP-518 (facebookresearch#1292)

    Summary:
    Adds pyproject.toml to comply with PEP-518, which fixes the building of the library by poetry - See python-poetry/poetry#6113 . This is a copy of facebookresearch#1270 , but I have signed the CLA.
    
    Pull Request resolved: facebookresearch#1292
    
    Differential Revision: D51601444
    
    Pulled By: alexkosau
    
    fbshipit-source-id: 357d702281ca3519c3640483eba04d124d0744b4
    cfculhane authored and facebook-github-bot committed Nov 27, 2023
    Configuration menu
    Copy the full SHA
    de458dd View commit details
    Browse the repository at this point in the history
  2. fix compile error with gcc13 facebookresearch#1281 (facebookresearch#…

    …1340)
    
    Summary:
    Due to[ header dependency changes](https://gcc.gnu.org/gcc-13/porting_to.html#header-dep-changes) in GCC 13, we need to include the <cstdint> header.
    
    Pull Request resolved: facebookresearch#1340
    
    Reviewed By: jmp84
    
    Differential Revision: D51602433
    
    Pulled By: alexkosau
    
    fbshipit-source-id: cc9bffb276cb00f1db8ec97a36784c484ae4563a
    NiuBlibing authored and facebook-github-bot committed Nov 27, 2023
    Configuration menu
    Copy the full SHA
    6c2204b View commit details
    Browse the repository at this point in the history

Commits on Jan 9, 2024

  1. Predict 1.9-4.2x faster (facebookresearch#1341)

    Summary:
    I made prediction 1.9x to 4.2x faster than before.
    
    # Motivation
    I want to use https://tinyurl.com/nllblid218e and similarly parametrized models to run language classification on petabytes of web data.
    
    # Methodology
    The costliest operation is summing the rows for each model input.  I've optimized this in three ways:
    1. `addRowToVector` was a virtual function call for each row.  I've replaced this with one virtual function call per prediction by adding `averageRowsToVector` to `Matrix` calls.
    2. `Vector` and `DenseMatrix` were not 64-byte aligned so the CPU was doing a lot of unaligned memory access.  I've brought in my own `vector` replacement that does 64-byte alignment.
    3.  Write the `averageRowsToVector` in intrinsics for common vector sizes.  This works on SSE, AVX, and AVX512F.
    
    See the commit history for a breakdown of speed improvement from each change.
    
    # Experiments
    Test set [docs1000.txt.gz](https://github.com/facebookresearch/fastText/files/11832996/docs1000.txt.gz) which is a bunch of random documents https://data.statmt.org/heafield/classified-fasttext/
    CPU: AMD Ryzen 9 7950X 16-Core
    
    Model https://tinyurl.com/nllblid218e with 256-dimensional vectors
    Before
    real    0m8.757s
    user    0m8.434s
    sys     0m0.327s
    
    After
    real    0m2.046s
    user    0m1.717s
    sys     0m0.334s
    
    Model https://dl.fbaipublicfiles.com/fasttext/supervised-models/lid.176.bin with 16-dimensional vectors
    Before
    real    0m0.926s
    user    0m0.889s
    sys     0m0.037s
    
    After
    real    0m0.477s
    user    0m0.436s
    sys     0m0.040s
    
    Pull Request resolved: facebookresearch#1341
    
    Reviewed By: graemenail
    
    Differential Revision: D52134736
    
    Pulled By: kpuatfb
    
    fbshipit-source-id: 42067161f4c968c34612934b48a562399a267f3b
    kpu authored and facebook-github-bot committed Jan 9, 2024
    Configuration menu
    Copy the full SHA
    b733943 View commit details
    Browse the repository at this point in the history

Commits on Feb 20, 2024

  1. deeplearning/fastText 2/2

    Reviewed By: azad-meta
    
    Differential Revision: D53908330
    
    fbshipit-source-id: b2215f0522c32a82cd876633210befefe9317d76
    generatedunixname89002005287564 authored and facebook-github-bot committed Feb 20, 2024
    Configuration menu
    Copy the full SHA
    ae1fe80 View commit details
    Browse the repository at this point in the history

Commits on Mar 13, 2024

  1. Delete .circleci directory (facebookresearch#1366)

    Summary: Pull Request resolved: facebookresearch#1366
    
    Reviewed By: jailby
    
    Differential Revision: D54850920
    
    Pulled By: bigfootjon
    
    fbshipit-source-id: 9a3eec7b7cb42335a786fb247cb16be9ed3c2d59
    bigfootjon authored and facebook-github-bot committed Mar 13, 2024
    Configuration menu
    Copy the full SHA
    1142dc4 View commit details
    Browse the repository at this point in the history
Loading