Skip to content
View deepsworld's full-sized avatar
💻
Never underestimate the power of more data
💻
Never underestimate the power of more data

Organizations

@necla-ml

Block or report deepsworld

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Plug n Play GBNF Compiler for llama.cpp

Python 28 5 Updated Nov 8, 2023

A fully open-source humanoid arm for physical AI research and deployment in contact-rich environments.

MDX 1,464 156 Updated Nov 3, 2025

This repository provides training and evaluation code for `MCTR` using MMPTracking and MTMC_NVIDIA datasets.

Python 1 Updated Aug 28, 2025

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2,569 259 Updated Sep 22, 2025

A ComfyUI extention for BAGEL(Unified Model for Multimodal Understanding and Generation)

Python 184 16 Updated Oct 13, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,230 407 Updated Oct 27, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,336 307 Updated Jun 21, 2025

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 19,439 1,844 Updated Nov 7, 2025

[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling

Python 4,080 321 Updated Sep 26, 2025

GRAB: A Dataset of Whole-Body Human Grasping of Objects

Python 331 31 Updated Mar 8, 2022

A mini, open-weights, version of our Proxy assistant.

Python 970 148 Updated Feb 26, 2025

The official PyTorch implementation of the paper "MotionGPT: Finetuned LLMs are General-Purpose Motion Generators"

Python 235 18 Updated Dec 28, 2023

[Technical Report 2023] PhysHOI: Physics-Based Imitation of Dynamic Human-Object Interaction

Python 223 10 Updated Sep 4, 2024

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Python 8,595 654 Updated Nov 6, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 72,329 8,590 Updated Nov 9, 2025

WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild

Python 392 30 Updated Aug 1, 2025

A Library for Differentiable Logic Gate Networks

Python 743 84 Updated Mar 19, 2024
Shell 3 1 Updated Dec 19, 2024
Python 423 33 Updated Nov 4, 2024

openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.

Python 58,797 10,405 Updated Nov 9, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,558 1,992 Updated Nov 9, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,993 695 Updated Feb 10, 2025

Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)

Python 222 13 Updated Jan 22, 2025

[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper

Python 159 8 Updated May 7, 2024

[NeurIPS 2023] Self-supervised Object-Centric Learning for Videos

Python 31 Updated Nov 28, 2024

[CVPR 2023] Code for "3D Concept Learning and Reasoning from Multi-View Images"

Python 80 3 Updated Jan 20, 2024

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,872 505 Updated May 31, 2024

Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"

Python 20 Updated Apr 20, 2023

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 6,978 694 Updated Jan 22, 2025
Next