Skip to content
View Jaykef's full-sized avatar

Block or report Jaykef

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in language modeling.

Jupyter Notebook 72 6 Updated Jun 30, 2025

We perform functional grounding of LLMs' knowledge in BabyAI-Text

Python 265 32 Updated Aug 23, 2024

MiniCPM4: Ultra-Efficient LLMs on End Devices, achieving 5+ speedup on typical end-side chips

Jupyter Notebook 8,051 500 Updated Jul 1, 2025

[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

Python 4,360 480 Updated Nov 18, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,028 2,136 Updated Mar 11, 2025

Simple MPI implementation for prototyping or learning

C 258 9 Updated Jun 27, 2025

Handwriting Synthesis with RNNs ✏️

Python 4,554 626 Updated Jan 11, 2024

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,331 1,512 Updated Jun 26, 2025

Minimalistic large language model 3D-parallelism training

Python 1,963 202 Updated Jun 25, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 11,340 826 Updated May 15, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,244 253 Updated Jun 12, 2025

Minimal implementation of scalable rectified flow transformers, based on SD3's approach

Jupyter Notebook 593 52 Updated Jul 1, 2024

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 3,641 329 Updated Jun 30, 2025

A nascent multi-agent tool for learning anything the feynman way (Microsoft AI Agent Hackathon Submission)

Python 2 Updated May 21, 2025
Python 419 40 Updated May 6, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,565 107 Updated Jun 2, 2025

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Python 2,811 417 Updated May 16, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 17,301 1,422 Updated Jun 28, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,457 67 Updated Apr 18, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 41,128 5,325 Updated Aug 16, 2024

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 1,728 135 Updated Jun 5, 2025

Understanding R1-Zero-Like Training: A Critical Perspective

Python 1,009 49 Updated Jul 1, 2025

Dream 7B, a large diffusion language model

Python 799 39 Updated Jun 18, 2025
Python 18 3 Updated Aug 13, 2024

SpatialLM: Training Large Language Models for Structured Indoor Modeling

Python 3,453 258 Updated Jun 24, 2025

This package contains the original 2012 AlexNet code.

Cuda 2,667 348 Updated Mar 12, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,243 321 Updated Jun 26, 2025

Repository to create traveling waves integrate special information through time

Jupyter Notebook 53 5 Updated Mar 7, 2025

Official Repo for "TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding" [ACL 2025 oral]

Python 1,321 164 Updated Jun 25, 2025
Next