-
University of Texas at Dallas
- Dallas, Texas, USA
-
10:09
(UTC -06:00)
Highlights
- Pro
Stars
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
🚀 Pre-process, annotate, evaluate, and train your Affect Computing (e.g., Multimodal Emotion Recognition, Sentiment Analysis) datasets ALL within MER-Factory! (LangGraph Based Agent Workflow)
[NeurIPS 2025] Toward Human Deictic Gesture Target Estimation
A high-performance Rust tool for generating MeiliSearch dump files from JSON data.
A CLI to import massive CSV and NdJson into Meilisearch
A collection of token reduction (token pruning, merging, clustering, etc.) techniques for ML/AI
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
一个简单易用的工具,帮助您重置 Cursor IDE 的机器ID。无任何依赖。支持 Windows、macOS 和 Linux。无限试用。
The repository for Springer IJCV 2025 (LR-ASD: Lightweight and Robust Network for Active Speaker Detection)
The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
一个基于 Rust 的 DNSLog 平台,无外部依赖,集成 DNS 服务和 Web 仪表盘,支持自动注册用户、生成唯一子域名以及实时展示 DNS 日志,适用于安全测试、信息外传及漏洞验证等场景。
✨First Open-Source R1-like Video-LLM [2025/02/18]
Witness the aha moment of VLM with less than $3.
Ollama负载均衡服务器 | 一款高性能、易配置的开源负载均衡服务器,优化Ollama负载。它能够帮助您提高应用程序的可用性和响应速度,同时确保系统资源的有效利用。
2026 AI/ML internship & new graduate job list updated daily
微信sqlite解密 | 仅支持v3版本微信,从内存中快速搜索指定数据。获取基址+偏移量与特征,从而达到微信版本每次更新不需要重新查找地址。可获取自己电脑上已登录微信的微信号,wxid,手机号,sqlite解密密钥。解密微信sqlite数据库中存放的历史消息记录
A soft raster renderer that uses GDI to draw on windows
In 2024, the strongest open-source implementation of asymmetric magvit_v2 supports inference code but excludes VQVAE. It supports the joint encoding of images and videos, accommodating arbitrary vi…
Powered by Vitepress,Vue,Gsap,Canvas,Integrating Algolia, and so on ...


