Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Multimedia Software

Open Source Python Multimedia Software for BSD

x

Sort By:

Most Popular

Clear All Filters

OS

BSD 26
Linux 26
Windows 24
More...
Mac 22
ChromeOS 18
Desktop Operating Systems 2
Mobile Operating Systems 1
Server Operating Systems 1

Category

Artificial Intelligence 26
Multimedia 26
Scientific/Engineering 10
Business 4
Communications 2
Software Development 2
Desktop Environment 1
Education 1

License

OSI-Approved Open Source 24
Creative Commons Attribution License 1

Translations

English 10
Bosnian 1
Croatian 1
German 1
More...
Serbian 1
Spanish 1

Programming Language

Python 26
C++ 6
C 4
Java 4
Perl 3
More...
Ruby 2
PHP 1
PL/SQL 1

Status

Beta 10
Production/Stable 6
Planning 4
Alpha 4
More...
Pre-Alpha 2

Python Multimedia Software for BSD

Multimedia Python BSD Artificial Intelligence Clear Filters

Browse free open source Python Multimedia Software for BSD and projects below. Use the toggles on the left to filter open source Python Multimedia Software for BSD by OS, license, language, programming language, and project status.

Gen AI apps are built with MongoDB Atlas
Build gen AI apps with an all-in-one modern database: MongoDB Atlas

MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.

Start Free
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

OpenCV

Open Source Computer Vision Library

The Open Source Computer Vision Library has >2500 algorithms, extensive documentation and sample code for real-time computer vision. It works on Windows, Linux, Mac OS X, Android, iOS in your browser through JavaScript. Languages: C++, Python, Julia, Javascript Homepage: https://opencv.org Q&A forum: https://forum.opencv.org/ Documentation: https://docs.opencv.org Source code: https://github.com/opencv Please pay special attention to our tutorials! https://docs.opencv.org/master Books about the OpenCV are described here: https://opencv.org/books.html

123 Reviews

Downloads: 3,772 This Week

Last Update: 2025-07-03
See Project
2

Animated Drawings

Code to accompany "A Method for Animating Children's Drawings"

AnimatedDrawings is a framework that converts user sketches or line drawings into fully animated 2D motion sequences using learned motion priors. The idea is that you draw a simple static figure (stick figure, silhouette, or contour lines), and the system produces plausible skeletal motion (walking, jumping, dancing) that adheres to the drawn shape constraints. The architecture separates shape embedding (to understand user-drawn geometry) from motion embedding / generation (to produce temporally coherent movement). Users can provide rough keyframes or control constraints (pose anchors), and the system fills intermediate frames with fluid animation. The repository includes demonstration apps and notebooks where you can upload or draw shapes and watch animations play. Because the approach is data-driven, it generalizes to new drawings even with varying proportions or stylizations.

Downloads: 15 This Week

Last Update: 2025-10-07
See Project
3

TimeSformer

The official pytorch implementation of our paper

TimeSformer is a vision transformer architecture for video that extends the standard attention mechanism into spatiotemporal attention. The model alternates attention along spatial and temporal dimensions (or designs variants like divided attention) so that it can capture both appearance and motion cues in video. Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch provides configurations, pretrained models, and training scripts that make it straightforward to evaluate or fine-tune on video datasets. TimeSformer was influential in showing that pure transformer architectures—without convolutional backbones—can perform strongly on video classification tasks. Its flexible attention design allows experimenting with different factoring (spatial-then-temporal, joint, etc.) to trade off compute, memory, and accuracy.

Downloads: 3 This Week

Last Update: 2025-10-07
See Project
4

AudioCraft

Audiocraft is a library for audio processing and generation

AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides inference scripts, checkpoints, and simple Python APIs so you can generate clips from prompts or incorporate the models into applications. It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.

Downloads: 1 This Week

Last Update: 2025-10-13
See Project
Connect every part of your business to one bank account
North One is a business banking app that integrates cash flow, payments, and budgeting to turn your North One Account into one Connected Bank Account

North One is proudly built for small businesses, startups and freelancers across America. Make payments easily, keep tabs on your money and put your finances on autopilot through smart integrations with the tools you’re already using. North One was built to make managing money easy so you can focus on running your business. No more branches. No more lines. No more paperwork. Get complete access to your North One Account from your phone or computer, wherever your business takes you. Create Envelopes for taxes, payroll, rent, and anything else automatically.

Get started for free.
5

Video Pre-Training

Learning to Act by Watching Unlabeled Online Videos

The Video PreTraining (VPT) repository provides code and model artifacts for a project where agents learn to act by watching human gameplay videos—specifically, gameplay of Minecraft—using behavioral cloning. The idea is to learn general priors of control from large-scale, unlabeled video data, and then optionally fine-tune those priors for more goal-directed behavior via environment interaction. The repository contains demonstration models of different widths, fine-tuned variants (e.g. for building houses or early-game tasks), and inference scripts that instantiate agents from pretrained weights. Key modules include the behavioral cloning logic, the agent wrapper, and data loading pipelines (with an accessible skeleton for loading Minecraft demonstration data). The repo also includes a run_agent.py script for testing an agent interactively, and an agent.py module encapsulating the control logic.

Downloads: 1 This Week

Last Update: 2025-10-03
See Project
6

Linux-Intelligent-Ocr-Solution

Easy-OCR solution and Tesseract trainer for GNU/Linux

Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!forum/lios Video Tutorial : https://www.youtube.com/playlist?list=PLn29o8rxtRe1zS1r2-yGm1DNMOZCgdU0i Tesseract Training Tutorial (beta) : https://www.youtube.com/watch?v=qLpCld4cdtk Source Code Github : https://github.com/Nalin-x-Linux/lios-3 Gitlab : https://gitlab.com/Nalin-x-Linux/lios-3 User guide is available in download page

5 Reviews

Downloads: 6 This Week

Last Update: 2020-10-19
See Project
7

Python Computer Vision Framework

The Python Computer Vision Framework is an opened project deisgned for all those interested in computer vision. It aims at making computer vision more easy and structured and matlab-free. It may also be used for other artistic and scientific areas.

1 Review

Downloads: 7 This Week

Last Update: 2013-04-10
See Project
8

SMILI

Scientific Visualisation Made Easy

The Simple Medical Imaging Library Interface (SMILI), pronounced 'smilie', is an open-source, light-weight and easy-to-use medical imaging viewer and library for all major operating systems. The main sMILX application features for viewing n-D images, vector images, DICOMs, anonymizing, shape analysis and models/surfaces with easy drag and drop functions. It also features a number of standard processing algorithms for smoothing, thresholding, masking etc. images and models, both with graphical user interfaces and/or via the command-line. See our YouTube channel for tutorial videos via the homepage. The applications are all built out of a uniform user-interface framework that provides a very high level (Qt) interface to powerful image processing and scientific visualisation algorithms from the Insight Toolkit (ITK) and Visualisation Toolkit (VTK). The framework allows one to build stand-alone medical imaging applications quickly and easily.

Downloads: 12 This Week

Last Update: 2025-09-05
See Project
9

AnimateDiff

Plug-n-play module turning text-to-image models into animation

AnimateDiff is an open-source project designed to enhance text-to-image diffusion models by adding animation capabilities. It allows users to turn static images generated by popular text-to-image models into animated sequences without requiring additional model training. This plug-and-play tool is compatible with a wide range of community models and facilitates the generation of animation directly from pre-existing text-to-image models. It supports various configurations to create animations with different visual styles, providing flexibility and ease of use for developers and artists interested in exploring dynamic, AI-generated animations.

1 Review

Downloads: 6 This Week

Last Update: 2025-03-06
See Project
Build Securely on Azure with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
10

AIChatbot

An extensible (by plugin) chatbot project

Downloads: 0 This Week

Last Update: 2015-07-02
See Project
11

AarTon

AarTon is an automated text-to-speech application. It allows user to enter text in a web-based front-end and render these texts via a multi-channel sound card.

Downloads: 0 This Week

Last Update: 2013-11-14
See Project
12

FrankMocap

A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

FrankMocap is a monocular 3D human capture system that estimates body, hand, and optionally face pose from a single RGB image or video. It regresses parametric human models (e.g., SMPL/SMPL-X) directly, producing temporally stable meshes and joint angles suitable for animation or analytics. The pipeline couples a robust 2D keypoint detector with 3D mesh regression networks and priors that keep results anatomically plausible. It can run frame-by-frame or with temporal smoothing, and includes demo apps for live webcam capture as well as batch processing. Outputs include textured meshes, joint locations, and model parameters that can be exported to common DCC tools and game engines. The codebase offers pretrained models, clear inference scripts, and utilities to visualize results, making single-camera motion capture approachable on commodity hardware. Researchers and creators use it for motion studies, AR/VR prototyping, character animation, and human-in-the-loop editing.

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
13

Gamera

Gamera is a framework for the creation of structured document analysis applications by domain experts. It combines a programming library with GUI tools for the training and interactive development of recognition systems.

Downloads: 0 This Week

Last Update: 2016-05-11
See Project
14

Gaze at the landscape

Gaze at the landscape is a wallpaper switcher that uses on-line sources of pictures to provide a delightful desktop environment.

Downloads: 0 This Week

Last Update: 2013-04-19
See Project
15

InproTK

An Incremental Spoken Dialogue Processing Toolkit

InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/

Downloads: 0 This Week

Last Update: 2015-06-16
See Project
16

Ivolution

Timelapse creation using Face Recognition

Ivolution is a face timelapse generator. Feed it with a bunch of images and it will generate a movie with your face centered on the screen. Ivolution uses face detection and modifies the images so that your face always keeps the same size and location over the movie. Images are processed in chronological order, so that you can see your face evoluate over time !

Downloads: 0 This Week

Last Update: 2012-09-18
See Project
17

NeuroLife

Simulator of virtual animals made up of biological neural networks for research in the Computational Neuroscience field.

Downloads: 0 This Week

Last Update: 2014-06-26
See Project
18

Phission

Concurrent vision processing system; Toolkit for easy implementation of software concurrent vision processing sub-system. Aimed at robotic applications w/best effort realtime. System includes: capture,processing and displaying.

Downloads: 0 This Week

Last Update: 2013-04-08
See Project
19

R2D3 Robotic Development 3nvironment

Robotic Manipulator Development and Simulation Environment in Python and Blender. IMPORTANT: Development moved to github. http://github.com/ajnsit/r2d3

Downloads: 0 This Week

Last Update: 2013-04-09
See Project
20

Spectral Python

A python module for hyperspectral image processing

Spectral Python (SPy) is a python package for reading, viewing, manipulating, and classifying hyperspectral image (HSI) data. SPy includes functions for clustering, dimensionality reduction, supervised classification, and more.

4 Reviews

Downloads: 0 This Week

Last Update: 2020-08-01
See Project
21

Speect

Speect is a multilingual TTS system. It offers a full text-to-speech system with various API's, as well as an environment for research and development of TTS systems and voices. It is written in ANSI C and uses a plug-in mechanism for extensions. Speect also includes an extensive set of Python bindings for quick implementation of new ideas, these bindings are derived from SWIG interface files and can easily be extended for other languages supported by SWIG. Speect is free and open source software. As a collection it is distributed under a MIT license.

Downloads: 0 This Week

Last Update: 2013-05-30
See Project
22

Swarm Stability SIMulator (SS-SIM)

This project simulates a multi-agent system (swarm) behavior both graphically and not. The purpose of this project is to research the properties suggested in "stability analysis of swarms" V.Gazi & K.M.Passino. Using the vpython library for 3D modeling

Downloads: 0 This Week

Last Update: 2016-04-13
See Project
23

Training Image Operators from Samples

Tools to train Image Operators automatically from a set of samples.

TRIOS - Training Image Operators from Samples is a set of tools to bring Image Processing closer to scientists in general. It is capable of estimating an operator between two images using only pairs of samples that contain an input image and the desired output. The operator is saved to a file and can be applied to any image.

Downloads: 0 This Week

Last Update: 2017-07-31
See Project
24

aksharbodh

Handwriting recognition and OCR in Indic languages

Downloads: 0 This Week

Last Update: 2013-04-15
See Project
25

ftw. Text Modeller

Software to fit whole-sentence language models using the principle of maximum entropy. For developers of speech recognizers, text prediction interfaces, OCR, machine translation software.

Downloads: 0 This Week

Last Update: 2013-03-20
See Project

Previous
You're on page 1
2
Next

Related Searches

opencv

opencv-4.5.5-vc14_vc15.exe

opencv 2.4.9

opencv-4.6.0

mingw-w64-install.exe

opencv-4.12.0-android-sdk

opencv android

avx2

opencv-4.6.0 contrib java

opencv-2.4.13

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2025 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: