Skip to content
View hemanthkumarak's full-sized avatar

Block or report hemanthkumarak

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 6,805 757 Updated Mar 5, 2025

A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM …

Jupyter Notebook 7,886 1,250 Updated Jun 9, 2025

Fully open data curation for reasoning models

Python 1,956 165 Updated Jun 5, 2025

The Open All-in-One Multimodal AI Agent Stack connecting Cutting-edge AI Models and Agent Infra.

TypeScript 14,912 1,322 Updated Jul 2, 2025

Create Epic Math and Physics Animations From Text.

Python 997 113 Updated May 30, 2025

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

C 5,236 1,049 Updated Jun 11, 2025

This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or look…

355 32 Updated Feb 22, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 46,843 5,993 Updated Jul 2, 2025

🍏 + 🎯 + 🐍 = Query Apple's FindMy Network with Python!

Python 2,163 77 Updated Jul 1, 2025

Examples and guides for using the Gemini API

Jupyter Notebook 13,878 1,897 Updated Jul 2, 2025

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 309,450 50,956 Updated May 21, 2025

On-device Speech Recognition for Apple Silicon

Swift 4,777 420 Updated Jun 24, 2025

This SDK is now deprecated, use the unified Firebase SDK.

Swift 1,075 174 Updated May 12, 2025

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Jupyter Notebook 28,963 13,081 Updated Jun 13, 2024

Play with neural networks!

TypeScript 12,417 2,627 Updated Jun 24, 2025

100 numpy exercises (with solutions)

Python 12,982 6,091 Updated May 9, 2025

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Python 6,078 1,217 Updated Mar 28, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 100,984 13,430 Updated Jul 2, 2025

Deep Learning Visualization Toolkit(『飞桨』深度学习可视化工具 )

HTML 4,846 625 Updated Jan 22, 2025
Python 36 2 Updated Dec 11, 2024

This repo contains the sample code of the Azure Search and Cognitive Services used to provide insights and analysis around the JFK Files.

TypeScript 408 227 Updated Jul 9, 2024

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 4,755 479 Updated Jun 6, 2025

a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image

Python 3,793 443 Updated Jun 19, 2025

[AAAI 2021] Split then Refine: Stacked Attention-guided ResUNets for Blind Single Image Visible Watermark Removal

Python 251 58 Updated Dec 10, 2023

An open-source RAG-based tool for chatting with your documents.

Python 22,721 1,813 Updated Jul 2, 2025

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 4,090 465 Updated Apr 15, 2025

Stable Diffusion web UI

Python 7,900 877 Updated Aug 14, 2024

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 1,165 93 Updated Jun 30, 2025

🚀 Awesome list of open source applications for macOS. https://t.me/s/opensourcemacosapps

44,375 2,383 Updated Apr 26, 2025
Next