Skip to content
View jeradf's full-sized avatar

Block or report jeradf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official code of ICML 2025 paper "NTPP: Generative Speech Language Modeling for Dual-Channel Spoken Dialogue via Next-Token-Pair Prediction"

Python 129 21 Updated Oct 27, 2025
Python 1 1 Updated Nov 6, 2025
Python 10 1 Updated Feb 16, 2024

LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey | Awesome Human-Agent Collaboration | Human-AI Collaboration

160 6 Updated Oct 31, 2025

Code for our NeurIPS 2022 paper

Python 369 21 Updated Jan 13, 2023

open-source coding LLM for software engineering tasks

Python 1,016 117 Updated Sep 30, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

8,493 561 Updated Nov 6, 2025

A resource to create a multi domain Dialog Act Tagger for conversational agents using publicly available data

HTML 51 10 Updated Sep 6, 2021

Vogent Turn: fast, open-source turn-detection for Voice AI applications

Python 34 2 Updated Oct 28, 2025

We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction

Python 141 9 Updated Nov 5, 2025

Text Normalization & Inverse Text Normalization

Python 685 90 Updated Oct 6, 2025

Evaluating the Moral Beliefs Encoded in LLMs

Python 31 6 Updated Dec 17, 2024

DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI

Python 514 38 Updated Jan 27, 2025

Simplifying reinforcement learning for complex game environments

C 4,109 299 Updated Nov 6, 2025

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,823 344 Updated Jan 4, 2024
Python 6 Updated Oct 4, 2025
Python 1 Updated Sep 29, 2025

The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.

Python 49 13 Updated Mar 12, 2021

This is an evolving repo for the paper “From Turn-Taking to Synchronous Dialogue: A Survey of Full-Duplex Spoken Language Models ”A comprehensive survey of Full-Duplex Spoken Language Models (FD-SL…

6 Updated Nov 1, 2025

(Realtime) Temporal Convolutions in PyTorch

Python 168 14 Updated Apr 7, 2025

portion, a Python library providing data structure and operations for intervals.

Python 508 37 Updated Nov 6, 2025
Python 4,541 362 Updated Jun 12, 2025

Official Implementation of NAACL 2025 Paper: Behavior-SD: Behaviorally Aware Spoken Dialogue Generation with Large Language Models

8 1 Updated Apr 30, 2025

Official PyTorch implementation of EMOVA in CVPR 2025 (https://arxiv.org/abs/2409.18042)

Python 74 7 Updated Mar 16, 2025

Open-Source Turn-Taking Detection Model and Dataset for Full-Duplex Spoken Dialogue Systems

Python 47 2 Updated Oct 12, 2025

OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.

Python 449 29 Updated Oct 29, 2025

Code for the paper "Modelling Turn-taking in Multispeaker Parties for Realistic Data Simulation"

Python 6 Updated May 11, 2022

FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity.

Python 46 6 Updated Sep 30, 2025

Python toolkit for speech processing

Python 72 21 Updated Oct 29, 2025
Next