- Hyderabad
- shravankumar147.wordpress.com
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
A crowdsourced dataset of high-quality problem-solution code pairs.
leverages Hugging Face’s NLP models to build an intelligent ATS resume scorer and ranking tool. It parses and analyzes resumes, scoring them based on relevance and alignment with job descriptions,…
The official Python SDK for Model Context Protocol servers and clients
A tutorial that guides users through the process of fine-tuning a stable diffusion model using HuggingFace's diffusers library. The tutorial includes advice on suitable hardware requirements, data …
Wan: Open and Advanced Large-Scale Video Generative Models
This repository contains the Hugging Face Agents Course.
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
PicChronicle organizes your photos by year, month, and location. It extracts EXIF data and uses reverse geocoding to auto-categorize images, making browsing and archiving memories effortless.
Easily fine-tune, evaluate and deploy Qwen3, DeepSeek-R1, Llama 4 or any open source LLM / VLM!
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
🤗 smolagents: a barebones library for agents that think in code.
Fully open reproduction of DeepSeek-R1
Janus-Series: Unified Multimodal Understanding and Generation Models
Self-paced bootcamp on Generative AI. Tutorials on ML fundamentals, LLMs, RAGs, LangChain, LangGraph, Fine-tuning Llama 3 & AI Agents (CrewAI)
Train high-quality text-to-image diffusion models in a data & compute efficient manner
High-resolution models for human tasks.
Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model
Experimenting and exploring Computer Vision with Deep Learning
PyTorch implementation for SDEdit: Image Synthesis and Editing with Stochastic Differential Equations
Idefics3-8B-Llama3: A powerful multimodal AI model by Hugging Face that integrates image and text inputs to enhance visual reasoning and text generation
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
llama3 implementation one matrix multiplication at a time
[ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3
Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) …