-
Notifications
You must be signed in to change notification settings - Fork 140
Pull requests: NVIDIA-NeMo/Curator
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ci: Add placeholder codecov check for when tests do not run
#749
opened Jun 27, 2025 by
chtruong814
Loading…
3 tasks
[Ray]
DocumentFilter
and Filter
/Score
/ScoreFilter
#746
opened Jun 24, 2025 by
sarahyurick
Loading…
3 of 4 tasks
[Ray] Add Ray Data as an experimental backend
#740
opened Jun 18, 2025 by
praateekmahajan
Loading…
3 tasks
Add classifier CLI script tests
gpuci
Run GPU CI/CD on PR
#684
opened Apr 22, 2025 by
sarahyurick
Loading…
10 tasks done
SemDedup bug fix for single element cluster
gpuci
Run GPU CI/CD on PR
#683
opened Apr 22, 2025 by
praateekmahajan
Loading…
3 tasks
Fail loudly for NeMo Curator Dask-Cuda cluster creation CUDA context issues
gpuci
Run GPU CI/CD on PR
#675
opened Apr 18, 2025 by
VibhuJawa
Loading…
Add option to skip data by adding a flag instead of removing them
#566
opened Feb 22, 2025 by
shuoyangd
Loading…
1 of 3 tasks
Add a way to pass expected language to FastTextLangId filter
#565
opened Feb 21, 2025 by
shuoyangd
Loading…
2 of 3 tasks
Hard negative mining for Retriever fine-tuning
#523
opened Feb 5, 2025 by
vinay-raman
Loading…
3 tasks done
Clean up Pandas, cuDF, Dask, and Dask-cuDF Run GPU CI/CD on PR
DocumentDataset
type logic
gpuci
#494
opened Jan 23, 2025 by
sarahyurick
Loading…
Standardize Run GPU CI/CD on PR
text_field
and id_field
terminology
gpuci
#485
opened Jan 17, 2025 by
sarahyurick
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.