Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Machine Learning Software

Open Source Machine Learning Software - Page 5

x

Sort By:

Most Popular

Clear All Filters

OS

Linux 1,020
Windows 1,012
Mac 895
More...
BSD 179
ChromeOS 130
Mobile Operating Systems 23
Desktop Operating Systems 15
Server Operating Systems 2
Embedded Operating Systems 1
Game Consoles 1

Category

Artificial Intelligence 1,170
Software Development 231
Scientific/Engineering 202
Business 90
Multimedia 58
Education 41
System 32
Games 18
Internet 11
Formats and Protocols 9
Communications 7
Database 7
Security 7
Text Editors 6
Mobile 3
Social sciences 3
Blockchain 2
Desktop Environment 1

License

OSI-Approved Open Source 1,020
Creative Commons Attribution License 24
Other License 14
Public Domain 10
More...
GNU Free Documentation License 7

Translations

English 107
Spanish 9
Chinese (Simplified) 8
French 7
More...
Russian 5
Arabic 3
Brazilian Portuguese 3
German 3
Dutch 2
Serbian 2
Bengali 1
Czech 1
Hindi 1
Indonesian 1
Korean 1
Latin 1
Polish 1
Portuguese 1
Vietnamese 1

Programming Language

Status

Beta 114
Production/Stable 113
Alpha 67
Pre-Alpha 45
More...
Planning 39
Mature 15
Inactive 10

Machine Learning Software

View 446 business solutions

Machine Learning Clear Filters

Keep company data safe with Chrome Enterprise
Protect your business with AI policies and data loss prevention in the browser

Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.

Download Chrome
Grafana: The open and composable observability platform
Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

Grafana is the open source analytics & monitoring solution for every database.

Learn More
1

GROBID

A machine learning software for extracting information

GROBID is a machine learning library for extracting, parsing, and re-structuring raw documents such as PDF into structured XML/TEI encoded documents with a particular focus on technical and scientific publications. First developments started in 2008 as a hobby. In 2011 the tool has been made available in open source. Work on GROBID has been steady as a side project since the beginning and is expected to continue as such. Header extraction and parsing from article in PDF format. The extraction here covers the usual bibliographical information (e.g. title, abstract, authors, affiliations, keywords, etc.). References extraction and parsing from articles in PDF format, around .87 F1-score against on an independent PubMed Central set of 1943 PDF containing 90,125 references, and around .89 on a similar bioRxiv set of 2000 PDF (using the Deep Learning citation model). All the usual publication metadata are covered (including DOI, PMID, etc.).

Downloads: 3 This Week

Last Update: 2025-05-11
See Project
2

Hummingbird

Hummingbird compiles trained ML models into tensor computation

Hummingbird is a library for compiling trained traditional ML models into tensor computations. Hummingbird allows users to seamlessly leverage neural network frameworks (such as PyTorch) to accelerate traditional ML models. Thanks to Hummingbird, users can benefit from (1) all the current and future optimizations implemented in neural network frameworks; (2) native hardware acceleration; (3) having a unique platform to support both traditional and neural network models; and having all of this (4) without having to re-engineer their models.

Downloads: 3 This Week

Last Update: 2024-10-24
See Project
3

Lepton AI

A Pythonic framework to simplify AI service building

A Pythonic framework to simplify AI service building. Cutting-edge AI inference and training, unmatched cloud-native experience, and top-tier GPU infrastructure. Ensure 99.9% uptime with comprehensive health checks and automatic repairs.

Downloads: 3 This Week

Last Update: 2025-11-07
See Project
4

LightGBM

Gradient boosting framework based on decision tree algorithms

LightGBM or Light Gradient Boosting Machine is a high-performance, open source gradient boosting framework based on decision tree algorithms. Compared to other boosting frameworks, LightGBM offers several advantages in terms of speed, efficiency and accuracy. Parallel experiments have shown that LightGBM can attain linear speed-up through multiple machines for training in specific settings, all while consuming less memory. LightGBM supports parallel and GPU learning, and can handle large-scale data. It’s become widely-used for ranking, classification and many other machine learning tasks.

Downloads: 3 This Week

Last Update: 2025-02-15
See Project
Create and run cloud-based virtual machines.
Secure and customizable compute service that lets you create and run virtual machines.

Computing infrastructure in predefined or custom machine sizes to accelerate your cloud transformation. General purpose (E2, N1, N2, N2D) machines provide a good balance of price and performance. Compute optimized (C2) machines offer high-end vCPU performance for compute-intensive workloads. Memory optimized (M2) machines offer the highest memory and are great for in-memory databases. Accelerator optimized (A2) machines are based on the A100 GPU, for very demanding applications.

Try for free
5

MLRun

Machine Learning automation and tracking

MLRun is an open MLOps framework for quickly building and managing continuous ML and generative AI applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications, significantly reducing engineering efforts, time to production, and computation resources. MLRun breaks the silos between data, ML, software, and DevOps/MLOps teams, enabling collaboration and fast continuous improvements. In MLRun the assets, metadata, and services (data, functions, jobs, artifacts, models, secrets, etc.) are organized into projects. Projects can be imported/exported as a whole, mapped to git repositories or IDE projects (in PyCharm, VSCode, etc.), which enables versioning, collaboration, and CI/CD. Project access can be restricted to a set of users and roles.

Downloads: 3 This Week

Last Update: 2025-07-23
See Project
6

Machine Learning Yearning

Machine Learning Yearning

Artificial intelligence, machine learning and deep learning are transforming numerous industries. Professor Andrew Ng is currently writing a book on how to build machine learning projects. The point of this book is not to teach traditional machine learning algorithms, but to teach you how to make machine learning algorithms work. Some technical courses in AI will give you a tool, and this book will teach you how to use those tools. If you aspire to be a technical leader in AI and want to learn how to set a direction for your team, this book will help. This book is still a sample draft. In order to make the corresponding Chinese content available to you as soon as possible, the translation time is rushed, and some of the content is inevitably oversight. You can enter the warehouse address through the Github icon in the upper right corner, and make certain modification suggestions.

Downloads: 3 This Week

Last Update: 2022-08-01
See Project
7

Nixtla TimeGPT

TimeGPT-1: production ready pre-trained Time Series Foundation Model

TimeGPT is a production ready, generative pretrained transformer for time series. It's capable of accurately predicting various domains such as retail, electricity, finance, and IoT with just a few lines of code. Whether you're a bank forecasting market trends or a startup predicting product demand, TimeGPT democratizes access to cutting-edge predictive insights, eliminating the need for a dedicated team of machine learning engineers. A generative model for time series. TimeGPT is capable of accurately predicting various domains such as retail, electricity, finance, and IoT.

Downloads: 3 This Week

Last Update: 2025-11-19
See Project
8

OpenBB

Investment Research for Everyone, Everywhere

Customize and speed up your analysis, bring your own data, and create instant reports to gain a competitive edge. Whether it’s a CSV file, a private endpoint, an RSS feed, or even embed an SEC filing directly. Chat with financial data using large language models. Don’t waste time reading, create summaries in seconds and ask how that impacts investments. Create your dashboard with your favorite widgets. Create charts directly from raw data in seconds. Create charts directly from raw data in seconds. Customize your dashboards to build your dream terminal, integrate with your private datasets and bring your own fine-tuned AI copilots.

Downloads: 3 This Week

Last Update: 2025-10-22
See Project
9

PaddlePaddle

PArallel Distributed Deep LEarning: Machine Learning Framework

PaddlePaddle is an open source deep learning industrial platform with advanced technologies and a rich set of features that make innovation and application of deep learning easier. It is the only independent R&D deep learning platform in China, and has been widely adopted in various sectors including manufacturing, agriculture and enterprise service. PaddlePaddle covers core deep learning frameworks, basic model libraries, end-to-end development kits and more, with support for both dynamic and static graphs.

Downloads: 3 This Week

Last Update: 2025-09-08
See Project
Fully managed relational database service for MySQL, PostgreSQL, and SQL Server
Focus on your application, and leave the database to us

Cloud SQL manages your databases so you don't have to, so your business can run without disruption. It automates all your backups, replication, patches, encryption, and storage capacity increases to give your applications the reliability, scalability, and security they need.

Try for free
10

Pedalboard

A Python library for audio

pedalboard is a Python library for working with audio: reading, writing, rendering, adding effects, and more. It supports the most popular audio file formats and a number of common audio effects out of the box and also allows the use of VST3® and Audio Unit formats for loading third-party software instruments and effects. pedalboard was built by Spotify’s Audio Intelligence Lab to enable using studio-quality audio effects from within Python and TensorFlow. Internally at Spotify, pedalboard is used for data augmentation to improve machine learning models and to help power features like Spotify’s AI DJ and AI Voice Translation. pedalboard also helps in the process of content creation, making it possible to add effects to audio without using a Digital Audio Workstation.

Downloads: 3 This Week

Last Update: 2025-10-08
See Project
11

PyCaret

An open-source, low-code machine learning library in Python

PyCaret is an open-source, low-code machine learning library in Python that automates machine learning workflows. It is an end-to-end machine learning and model management tool that speeds up the experiment cycle exponentially and makes you more productive. In comparison with the other open-source machine learning libraries, PyCaret is an alternate low-code library that can be used to replace hundreds of lines of code with few lines only. This makes experiments exponentially fast and efficient. PyCaret is essentially a Python wrapper around several machine learning libraries and frameworks such as scikit-learn, XGBoost, LightGBM, CatBoost, Optuna, Hyperopt, Ray, and few more. The design and simplicity of PyCaret are inspired by the emerging role of citizen data scientists, a term first used by Gartner. Citizen Data Scientists are power users who can perform both simple and moderately sophisticated analytical tasks that would previously have required more technical expertise.

Downloads: 3 This Week

Last Update: 2024-08-01
See Project
12

PyTorch/XLA

Enabling PyTorch on Google TPU

PyTorch/XLA is a Python package that uses the XLA deep learning compiler to connect the PyTorch deep learning framework and Cloud TPUs. You can try it right now, for free, on a single Cloud TPU with Google Colab, and use it in production and on Cloud TPU Pods with Google Cloud. Take a look at one of our Colab notebooks to quickly try different PyTorch networks running on Cloud TPUs and learn how to use Cloud TPUs as PyTorch devices. We are also introducing new TPU VMs for more transparent and easier access to the TPU hardware. This is our recommedned way of running PyTorch/XLA on Cloud TPU. Please check out our Cloud TPU VM User Guide. Cloud TPU VM is currently on general availability and provides direct access to the TPU host. The recommended setup for running distributed training on TPU Pods uses the pairing of Compute VM Instance Groups and TPU Pods. Each of the Compute VM in the instance group drives 8 cores on the TPU Pod.

Downloads: 3 This Week

Last Update: 2025-11-17
See Project
13

Ray

A unified framework for scalable computing

Modern workloads like deep learning and hyperparameter tuning are compute-intensive and require distributed or parallel execution. Ray makes it effortless to parallelize single machine code — go from a single CPU to multi-core, multi-GPU or multi-node with minimal code changes. Accelerate your PyTorch and Tensorflow workload with a more resource-efficient and flexible distributed execution framework powered by Ray. Accelerate your hyperparameter search workloads with Ray Tune. Find the best model and reduce training costs by using the latest optimization algorithms. Deploy your machine learning models at scale with Ray Serve, a Python-first and framework agnostic model serving framework. Scale reinforcement learning (RL) with RLlib, a framework-agnostic RL library that ships with 30+ cutting-edge RL algorithms including A3C, DQN, and PPO. Easily build out scalable, distributed systems in Python with simple and composable primitives in Ray Core.

Downloads: 3 This Week

Last Update: 2025-11-19
See Project
14

Semantic Segmentation in PyTorch

Semantic segmentation models, datasets & losses implemented in PyTorch

Semantic segmentation models, datasets and losses implemented in PyTorch. PyTorch and Torchvision needs to be installed before running the scripts, together with PIL and opencv for data-preprocessing and tqdm for showing the training progress. PyTorch v1.1 is supported (using the new supported tensoboard); can work with earlier versions, but instead of using tensoboard, use tensoboardX. Poly learning rate, where the learning rate is scaled down linearly from the starting value down to zero during training. Considered as the go-to scheduler for semantic segmentation. One Cycle learning rate, for a learning rate LR, we start from LR / 10 up to LR for 30% of the training time, and we scale down to LR / 25 for remaining time, the scaling is done in a cos annealing fashion (see Figure bellow), the momentum is also modified but in the opposite manner starting from 0.95 down to 0.85 and up to 0.95.

Downloads: 3 This Week

Last Update: 2022-08-18
See Project
15

ZenML

Build portable, production-ready MLOps pipelines

A simple yet powerful open-source framework that scales your MLOps stack with your needs. Set up ZenML in a matter of minutes, and start with all the tools you already use. Gradually scale up your MLOps stack by switching out components whenever your training or deployment requirements change. Keep up with the latest changes in the MLOps world and easily integrate any new developments. Define simple and clear ML workflows without wasting time on boilerplate tooling or infrastructure code. Write portable ML code and switch from experimentation to production in seconds. Manage all your favorite MLOps tools in one place with ZenML's plug-and-play integrations. Prevent vendor lock-in by writing extensible, tooling-agnostic, and infrastructure-agnostic code. Run your ML workflows anywhere: local, on-premises, or in the cloud environment of your choice. Keep yourself open to new tools - ZenML is easily extensible and forever open-source!

Downloads: 3 This Week

Last Update: 2025-11-19
See Project
16

tsai

Time series Timeseries Deep Learning Machine Learning Pytorch fastai

tsai is an open-source deep learning package built on top of Pytorch & fastai focused on state-of-the-art techniques for time series tasks like classification, regression, forecasting, and imputation. Starting with tsai 0.3.0 tsai will only install hard dependencies. Other soft dependencies (which are only required for selected tasks) will not be installed by default (this is the recommended approach. If you require any of the dependencies that is not installed, tsai will ask you to install it when necessary) We've also added a new PredictionDynamics callback that will display the predictions during training. This is the type of output you would get in a classification task. New tutorial notebook on how to train your model with larger-than-memory datasets in less time achieving up to 100% GPU usage! See our new tutorial notebook on how to track your experiments with Weights & Biases

Downloads: 3 This Week

Last Update: 2025-07-29
See Project
17

Clustering Variation

Clustering Variation looks for a good subset of attributes in order to improve the classification accuracy of supervised learning techniques in classification problems with a huge number of attributes involved. It first creates a ranking of attributes based on the Variation value, then divide into two groups, last using Verification method to select the best group.

2 Reviews

Downloads: 28 This Week

Last Update: 2014-12-20
See Project
18

weka neural network algorithms

This project contains weka packages of neural networks algorithms implementations like Learning Vector Quantizer (LVQ) and Self-organizing Maps (SOM). For more information about weka, please visit http://www.cs.waikato.ac.nz/~ml/weka/

Downloads: 72 This Week

Last Update: 2012-11-04
See Project
19

MEKA

A Multi-label Extension to Weka

Multi-label classifiers and evaluation procedures using the Weka machine learning framework.

3 Reviews

Downloads: 19 This Week

Last Update: 2024-09-06
See Project
20

AI-Job-Notes

AI algorithm position job search strategy

AI-Job-Notes is a pragmatic notebook for landing roles in machine learning, computer vision, and related engineering tracks. It assembles study paths, checklists, and interview prep materials, but also covers job-search mechanics—portfolio building, resume patterns, and communication tips. The emphasis is on doing: practicing with project ideas, setting up reproducible experiments, and showcasing results that convey impact. It ties technical study (ML/DL fundamentals) to real hiring signals like problem-solving, code quality, and experiment logging. The repository’s structure encourages progressive preparation—from fundamentals to mock interviews and post-interview retrospectives. It’s designed to reduce uncertainty and decision fatigue during the often lengthy job-hunt cycle.

Downloads: 2 This Week

Last Update: 2025-11-12
See Project
21

Adversarial Robustness Toolbox

Adversarial Robustness Toolbox (ART) - Python Library for ML security

Adversarial Robustness Toolbox (ART) is a Python library for Machine Learning Security. ART provides tools that enable developers and researchers to evaluate, defend, certify and verify Machine Learning models and applications against the adversarial threats of Evasion, Poisoning, Extraction, and Inference. ART supports all popular machine learning frameworks (TensorFlow, Keras, PyTorch, MXNet, sci-kit-learn, XGBoost, LightGBM, CatBoost, GPy, etc.), all data types (images, tables, audio, video, etc.) and machine learning tasks (classification, object detection, generation, certification, etc.).

Downloads: 2 This Week

Last Update: 2025-07-07
See Project
22

BentoML

Unified Model Serving Framework

BentoML simplifies ML model deployment and serves your models at a production scale. Support multiple ML frameworks natively: Tensorflow, PyTorch, XGBoost, Scikit-Learn and many more! Define custom serving pipeline with pre-processing, post-processing and ensemble models. Standard .bento format for packaging code, models and dependencies for easy versioning and deployment. Integrate with any training pipeline or ML experimentation platform. Parallelize compute-intense model inference workloads to scale separately from the serving logic. Adaptive batching dynamically groups inference requests for optimal performance. Orchestrate distributed inference graph with multiple models via Yatai on Kubernetes. Easily configure CUDA dependencies for running inference with GPU. Automatically generate docker images for production deployment.

Downloads: 2 This Week

Last Update: 2025-11-17
See Project
23

Caffe Framework

Caffe, a fast open framework for deep learning

Caffe is a deep learning framework made with expression, speed, and modularity in mind. It is developed by Berkeley AI Research (BAIR) and by community contributors. Yangqing Jia created the project during his PhD at UC Berkeley. Caffe is released under the BSD 2-Clause license. Expressive architecture encourages application and innovation. Models and optimization are defined by configuration without hard-coding. Switch between CPU and GPU by setting a single flag to train on a GPU machine then deploy to commodity clusters or mobile devices. Extensible code fosters active development. In Caffe’s first year, it has been forked by over 1,000 developers and had many significant changes contributed back. Thanks to these contributors the framework tracks the state-of-the-art in both code and models.

Downloads: 2 This Week

Last Update: 2021-12-09
See Project
24

Chronos Forecasting

Pretrained (Language) Models for Probabilistic Time Series Forecasting

Chronos is a family of pretrained time series forecasting models based on language model architectures. A time series is transformed into a sequence of tokens via scaling and quantization, and a language model is trained on these tokens using the cross-entropy loss. Once trained, probabilistic forecasts are obtained by sampling multiple future trajectories given the historical context. Chronos models have been trained on a large corpus of publicly available time series data, as well as synthetic data generated using Gaussian processes.

Downloads: 2 This Week

Last Update: 6 days ago
See Project
25

CodeContests

Large dataset of coding contests designed for AI and ML model training

CodeContests, developed by Google DeepMind, is a large-scale competitive programming dataset designed for training and evaluating machine learning models on code generation and problem solving. This dataset played a central role in the development of AlphaCode, DeepMind’s model for solving programming problems at a human-competitive level, as published in Science. CodeContests aggregates problems and human-written solutions from multiple programming competition platforms, including AtCoder, Codeforces, CodeChef, Aizu, and HackerEarth. Each problem includes structured metadata, problem descriptions, paired input/output test cases, and multiple correct and incorrect solutions in various programming languages. The dataset is distributed in Riegeli format using Protocol Buffers, with separate training, validation, and test splits for reproducible machine learning experiments.

Downloads: 2 This Week

Last Update: 2025-11-19
See Project

Previous
1
2
3
4
You're on page 5
6
7
8
9
Next

Related Searches

clustering

ai

license plate recognition using java

decision tree in matlab

agile

guitar

machine learning workloads

time series analysis and forecasting

weka som

weka

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2025 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: