-
spark-rapids-tools Public
Forked from NVIDIA/spark-rapids-toolsUser tools for Spark RAPIDS
Scala Apache License 2.0 UpdatedOct 29, 2025 -
spark-rapids-benchmarks Public
Forked from NVIDIA/spark-rapids-benchmarksSpark RAPIDS Benchmarks – benchmark sets and utilities for the RAPIDS Accelerator for Apache Spark
Python Apache License 2.0 UpdatedMay 15, 2024 -
spider Public
Forked from taoyds/spiderscripts and baselines for Spider: Yale complex and cross-domain semantic parsing and text-to-SQL challenge
Python UpdatedMay 15, 2024 -
tpch-spark Public
Forked from ssavvides/tpch-sparkTPC-H queries in Apache Spark SQL using native DataFrames API
C MIT License UpdatedJan 23, 2024 -
spark-rapids-ml Public
Forked from NVIDIA/spark-rapids-mlSpark RAPIDS MLlib – accelerate Apache Spark MLlib with GPUs
Jupyter Notebook Apache License 2.0 UpdatedNov 9, 2023 -
spark-rapids-examples Public
Forked from NVIDIA/spark-rapids-examplesA repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.
Jupyter Notebook Apache License 2.0 UpdatedMay 23, 2023 -
sparkext Public
Spark DL Inferencing using external frameworks
-
spark Public
Forked from apache/sparkApache Spark - A unified analytics engine for large-scale data processing
-
horovod Public
Forked from horovod/horovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Python Other UpdatedFeb 17, 2023 -
mlflow Public
Forked from mlflow/mlflowOpen source platform for the machine learning lifecycle
Python Apache License 2.0 UpdatedAug 4, 2022 -
NVTabular Public
Forked from NVIDIA-Merlin/NVTabularNVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Python Apache License 2.0 UpdatedJun 22, 2022 -
TensorFlowOnSpark Public
Forked from yahoo/TensorFlowOnSparkTensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.


