JaeDukSeo

🙏

Praying

J JaeDukSeo

🙏

Praying

Exploring the intersection of AI, deep learning, and art. Passionate about pushing the boundaries of multi-media production and beyond. #AIArt

335 followers · 272 following

Achievements

Stars

rishabhc9 / Music-Collector

Python 8 2 Updated Mar 5, 2025

Breakthrough / PySceneDetect

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 4,362 469 Updated Nov 12, 2025

PaddlePaddle / PaddleMIX

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …

Python 707 222 Updated Nov 19, 2025

tangly1024 / NotionNext

使用 NextJS + Notion API 实现的，支持多种部署方案的静态博客，无需服务器、零门槛搭建网站，为Notion和所有创作者设计。 (A static blog built with NextJS and Notion API, supporting multiple deployment options. No server required, zero threshold t…

JavaScript 10,704 13,898 Updated Oct 15, 2025

bytedance / Sa2VA

Official Repo For "Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos"

Python 1,446 102 Updated Nov 28, 2025

Agentic-Web-Interfaces / concierge

Declarative framework for building Agentic AI Services. Build powerful AI apps allowing Agents to navigate, interact and transact with your service.

Python 52 5 Updated Nov 25, 2025

FluidInference / FluidAudio

Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.

Swift 1,019 117 Updated Nov 29, 2025

EmilianPostolache / stable-audio-controlnet

Fine-tune Stable Audio Open with DiT ControlNet.

Python 249 9 Updated May 16, 2025

yihao-meng / HoloCine

Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Python 510 86 Updated Nov 26, 2025

Q-Bukold / TikTok-Content-Scraper

TikTok Content Scraper -- No API-Key needed, minimal dependencies, citable | Download videos (MP4), slides (JPEG) and metadata of author, music, file, hashtags, content, interactions etc.

Python 60 14 Updated Sep 21, 2025

VadlapatiKarthik / autoclipper

AI-powered Auto-Clipper: automatically detects highlight segments from YouTube channels, Twitch/Kick live streams (via audience-retention data, chat spikes and timestamped comments), clips them wit…

Python 5 Updated May 7, 2025

spotify / basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Python 4,456 401 Updated Nov 13, 2025

jawah / charset_normalizer

Truly universal encoding detector in pure Python.

Python 719 62 Updated Nov 9, 2025

nanobrowser / nanobrowser

Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

TypeScript 11,437 1,151 Updated Nov 24, 2025

Rikorose / DeepFilterNet

Noise supression using deep filtering

Python 3,558 356 Updated Oct 17, 2024

nomadkaraoke / python-audio-separator

Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)

Python 933 152 Updated Nov 30, 2025

narcotic-sh / zanshin

A novel media player that allows you to navigate by speaker

Svelte 67 4 Updated Nov 11, 2025

narcotic-sh / senko

Very fast, accurate speaker diarization

Python 176 16 Updated Nov 12, 2025

engasd999 / senko

⚡ Accelerate speaker diarization with Senko, processing 1 hour of audio in just 5 seconds on powerful hardware—boost your audio analysis efficiency.

Python 1 1 Updated Nov 30, 2025

datacrystals / AIStoryWriter

LLM story writer with a focus on high-quality long output based on a user provided prompt.

Python 199 55 Updated Nov 24, 2025

filliptm / ComfyUI_Fill-ChatterBox

TTS + Voice Cloning

Python 172 29 Updated Aug 16, 2025

diodiogod / TTS-Audio-Suite

A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, IndexTTS-2, Chatterbox (classic and multilingual 23-lang), F5-TTS, Higgs Audio …

Python 422 29 Updated Nov 20, 2025

devnen / Chatterbox-TTS-Server

Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale…

Python 618 180 Updated Jul 14, 2025

petermg / Chatterbox-TTS-Extended

Modified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially for my kids.

Python 479 84 Updated Aug 23, 2025

randombk / chatterbox-vllm

VLLM Port of the Chatterbox TTS model

Python 340 44 Updated Oct 18, 2025

resemble-ai / chatterbox

SoTA open-source TTS

Python 14,805 2,043 Updated Sep 25, 2025

abus-aikorea / voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal is…

Python 5,176 491 Updated Oct 5, 2025

krillinai / KrillinAI

Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process deployment. The video translation output is optimized for platfo…

Go 8,984 749 Updated Nov 5, 2025

Kedreamix / Linly-Dubbing

智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能，语言无界”

Jupyter Notebook 2,808 308 Updated Mar 5, 2025

kevinkoech357 / transkript

A flask built web app that leverages the power of OpenAI's whisper model to transcribe audio and video files. Has support for various file formats. Generates timestamped .srt files.

HTML 3 1 Updated Jun 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

J JaeDukSeo

Achievements

Achievements

Block or report JaeDukSeo

Stars

rishabhc9 / Music-Collector

Breakthrough / PySceneDetect

PaddlePaddle / PaddleMIX

tangly1024 / NotionNext

bytedance / Sa2VA

Agentic-Web-Interfaces / concierge

FluidInference / FluidAudio

EmilianPostolache / stable-audio-controlnet

yihao-meng / HoloCine

Q-Bukold / TikTok-Content-Scraper

VadlapatiKarthik / autoclipper

spotify / basic-pitch

jawah / charset_normalizer

nanobrowser / nanobrowser

Rikorose / DeepFilterNet

nomadkaraoke / python-audio-separator

narcotic-sh / zanshin

narcotic-sh / senko

engasd999 / senko

datacrystals / AIStoryWriter

filliptm / ComfyUI_Fill-ChatterBox

diodiogod / TTS-Audio-Suite

devnen / Chatterbox-TTS-Server

petermg / Chatterbox-TTS-Extended

randombk / chatterbox-vllm

resemble-ai / chatterbox

abus-aikorea / voice-pro

krillinai / KrillinAI

Kedreamix / Linly-Dubbing

kevinkoech357 / transkript