-
SUSTech
- ShenZhen China
-
10:26
(UTC +08:00) - https://www.mayseee.com
- https://zhenghao.tiiny.site/
- https://scholar.google.com.sg/citations?hl=zh-CN&pli=1&user=NIrmfTIAAAAJ
Highlights
-
-
-
openlrc Public
Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。
-
interSeg3D-Studio Public
A web-based interactive 3D point cloud segmentation and annotation tool, using the AGILE3D click‑based segmentation algorithm and Gemini for 3D object recognition.
-
-
-
-
-
-
faster-whisper Public
Forked from SYSTRAN/faster-whisperFaster Whisper transcription with CTranslate2
Python MIT License UpdatedDec 17, 2024 -
img2dataset Public
Forked from rom1504/img2datasetEasily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Python MIT License UpdatedAug 27, 2023 -
aggregator Public
Forked from wzdnzd/aggregator自动签到、自动注册、订阅聚合及爬取脚本等
Python Apache License 2.0 UpdatedJul 20, 2023 -
video-to-pose3D Public
Convert video to 3D pose in one-key.
-
faker-openai Public
Generate fake data with OpenAI's GPT-3 API.
-
Caption-Anything Public
Forked from ttengwang/Caption-AnythingCaption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedApr 14, 2023 -
-
SUSTech-CS308 Public
Course materials of SUSTech CS308 Computer Vision (2021 fall)
-
accelerate Public
Forked from huggingface/accelerate🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Python Apache License 2.0 UpdatedAug 5, 2022 -
-
Vision-Language Pretraining & Efficient Transformer Papers.
-
mmsegmentation Public
Forked from open-mmlab/mmsegmentationOpenMMLab Semantic Segmentation Toolbox and Benchmark.
Python Apache License 2.0 UpdatedNov 23, 2021 -
-
-
-
awesome-vision-language-pretraining-papers Public
Forked from yuewang-cuhk/awesome-vision-language-pretraining-papersRecent Advances in Vision and Language PreTrained Models (VL-PTMs)
1 UpdatedNov 24, 2020 -
-
CMR-CNN-New-Baseline Public
Pytorch implementation of "Cross-Modal Retrieval With CNN Visual Features: A New Baseline".
-
learnopencv Public
Forked from spmallick/learnopencvLearn OpenCV : C++ and Python Examples
-
-
pose-group-work Public
Group work of a group in SUSTech Vision.





