Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
OmniGen2: Exploration to Advanced Multimodal Generation.
🌐 WebWalker [ACL2025] & WebDancer [Preprint]
🍀 HuLa是一款基于Tauri v2+Vue3的跨平台即时通讯桌面应用(不仅仅是即时通讯),兼容Windows、MacOS、Linux、Android、IOS
Official PyTorch implementation of "Weakly Supervised Semantic Segmentation for Driving Scenes", AAAI2024
A Web Interface for chatting with your local LLMs via the ollama API
A minimal, easy-to-read PyTorch reimplementation of the Qwen2 series—without the complexity of larger frameworks.
keras implement of transformers for humans
win32ss / supermium
Forked from chromium/chromiumChromium fork for Windows XP/2003 and up
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力
Termux - a terminal emulator application for Android OS extendible by variety of packages.
TensorFlow's Visualization Toolkit
Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"
Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’
Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.
markdown2: A fast and complete implementation of Markdown in Python
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Solve Visual Understanding with Reinforced VLMs
Witness the aha moment of VLM with less than $3.