Phi-3.5 for Mac: Locally-run Vision and Language Models
TextWorld is a sandbox learning environment for the training
Guiding Instruction-based Image Editing via Multimodal Large Language
Code for Language models can explain neurons in language models paper
Implementation of RLHF (Reinforcement Learning with Human Feedback)
Open-source large language model family from Tencent Hunyuan
Machine Learning Systems: Design and Implementation
Refer and Ground Anything Anywhere at Any Granularity
DeepSeek Coder: Let the Code Write Itself
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A Family of Open Foundation Models for Code Intelligence
Beyond the Imitation Game collaborative benchmark for measuring
Towards Real-World Vision-Language Understanding
Evals is a framework for evaluating LLMs and LLM systems
Transformers4Rec is a flexible and efficient library
Framework that is dedicated to making neural data processing
Benchmarking Multimodal Agents for Open-Ended Tasks
tiktoken is a fast BPE tokeniser for use with OpenAI's models
LLM powered fuzzing via OSS-Fuzz
The official Meta Llama 3 GitHub site
Utilities intended for use with Llama models
Set of tools to assess and improve LLM security
CLIP, Predict the most relevant text snippet given an image
Volcano Engine Reinforcement Learning for LLMs
This repository contains the official implementation of FastVLM