Lists (4)
Sort Name ascending (A-Z)
Stars
Your AI Operator for Web, Android, Automation & Testing.
Record Audio from the User's Microphone in Apps that are Deployed to the Web. (via Browser Media-API, REACT-based, Streamlit Custom Component)
Parkinson's Disease Detection using Parkinson's Spiral Drawing test
Localization of Handwritten and Printed Text in doctors' prescription
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction…
This project is a digital human that can talk and listen to you. It uses OpenAI's GPT to generate responses, OpenAI's Whisper to transcript the audio, Eleven Labs to generate voice and Rhubarb Lip …
[CVPR 2024] The official repo for "GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians"
Analyze PDFs. With colors. And Yara.
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IF…
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Re-implementation for ligoudaner377/font_translator_gan for our team project (CS492I, KAIST 2022F)
scalable end-to-end extraction of information from receipts using OCR and semi-supervised GCNs
Extract structured data from PDF invoices
An AI-powered Personal Identifiable Information (PII) scanner.
An open-source digital image forensic toolset