Skip to content
View rishabh135's full-sized avatar
  • Japan

Block or report rishabh135

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Standardized Serverless ML Inference Platform on Kubernetes

Python 4,313 1,202 Updated Jul 2, 2025

AI Powered Knowledge Graph Generator

Python 994 125 Updated Jun 26, 2025

PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"

Python 63 17 Updated Mar 27, 2025

Tools for merging pretrained large language models.

Python 5,937 571 Updated Jun 19, 2025

NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs

C++ 539 68 Updated May 3, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,522 445 Updated Jul 1, 2025
Python 194 28 Updated Jun 23, 2025

A profiling and performance analysis tool for machine learning

C++ 395 67 Updated Jul 2, 2025
HTML 237 16 Updated Jul 2, 2025

Library for reading and processing ML training data.

Python 471 40 Updated Jul 2, 2025

A flexible, adaptive classification system for dynamic text classification

Python 320 20 Updated Jun 20, 2025

Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.

Python 8,111 688 Updated Jun 5, 2025

GenMedia Creative Studio is a Vertex AI generative media example user experience to highlight the use of Imagen, Veo and other generative media APIs on Google Cloud.

Jupyter Notebook 132 44 Updated Jul 1, 2025

Jeo: Jax model training lib for Earth Observation

Python 128 18 Updated May 5, 2025

GeeFlow - generate and process large-scale geospatial datasets with Google Earth Engine.

Python 71 10 Updated Jun 23, 2025

Building blocks for rapid development of GenAI applications

Python 1,421 107 Updated Jul 2, 2025

Structured Outputs

Python 11,991 608 Updated Jul 1, 2025

JaxPP is a library for JAX that enables flexible MPMD pipeline parallelism for large-scale LLM training

Python 51 Updated May 22, 2025
Python 190 28 Updated Jun 23, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

TypeScript 14,617 1,762 Updated Jun 29, 2025

Set of tools to assess and improve LLM security.

Python 3,543 585 Updated Jul 1, 2025

An open-source tool for seamless migration from other LLMs to Llama, and for general prompt optimization.

Python 536 62 Updated Jul 1, 2025

UniversalRAG: Retrieval-Augmented Generation over Corpora of Diverse Modalities and Granularities

Python 85 7 Updated May 21, 2025

Super-fast Structured Outputs

Rust 320 32 Updated Jun 23, 2025

Python library to use Pleias-RAG models

Python 58 5 Updated May 1, 2025

Anthropic's Interactive Prompt Engineering Tutorial

Jupyter Notebook 14,051 1,300 Updated Jul 11, 2024

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.

TypeScript 29,330 2,560 Updated Jul 2, 2025
Python 128 17 Updated Jul 2, 2025

A collection of reusable, high-performance, well-documented, thorough-tested layers and models in Jax

Python 16 10 Updated Jun 8, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 18,298 1,851 Updated Jul 1, 2025
Next