ChatGLM-6B: An Open Bilingual Dialogue Language Model
Implementation of Make-A-Video, new SOTA text to video generator
The official Meta Llama 3 GitHub site
A simple, secure MCP-to-OpenAPI proxy server
4M: Massively Multimodal Masked Modeling
A fast, powerful, and simple hierarchical vision transformer
PyTorch code and models for V-JEPA self-supervised learning from video
CLIP, Predict the most relevant text snippet given an image
PPTAgent: Generating and Evaluating Presentations
Utilities intended for use with Llama models
Set of tools to assess and improve LLM security
PyTorch code and models for VJEPA2 self-supervised learning from video
The repository provides code for running inference with SAM 2
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Guiding Instruction-based Image Editing via Multimodal Large Language
Code release for Cut and Learn for Unsupervised Object Detection
PyTorch code and models for the DINOv2 self-supervised learning
The official PyTorch implementation of Google's Gemma models
Implementation of "MobileCLIP" CVPR 2024
Anthropic's Interactive Prompt Engineering Tutorial
Code for Language models can explain neurons in language models paper
The ChatGPT Retrieval Plugin lets you easily find personal
Implementation of Vision Transformer, a simple way to achieve SOTA
Volcano Engine Reinforcement Learning for LLMs
Implementation of the Surya Foundation Model for Heliophysics