Stars
A Library for Advanced Deep Time Series Models.
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
《Effective Modern C++》- 完成翻译
Repo for the Deep Learning Nanodegree Foundations program.
NeurIPS 2016. Linear-time interpretable nonparametric two-sample test.
NeurIPS 2017 best paper. An interpretable linear-time kernel goodness-of-fit test.
An unofficial pytorch implementation of 《Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains》
An empirical study on evaluation metrics of generative adversarial networks.
TATTER (Two-sAmple TesT EstimatoR) is a tool to perform two-sample hypothesis test.
🔠Foreign language reading and translation assistant based on copy and translate.
Best Practices on Recommendation Systems
a TensorFlow-based distributed training framework optimized for large-scale sparse data.
The code for 2020 Tencent College Algorithm Contest, and the online result ranks 1st.
Header-only C++/python library for fast approximate nearest neighbors
A Abstractive Summarization Implementation with Transformer and Pointer-generator
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Chinese NER using Lattice LSTM. Code for ACL 2018 paper.
中文命名实体识别(包括多种模型:HMM,CRF,BiLSTM,BiLSTM+CRF的具体实现)
Via is a simple browser, and this repository is set for localization.
MySQL Server, the world's most popular open source database, and MySQL Cluster, a real-time, open source transactional database.
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
2019-SOTA简繁中文拼写检查工具:FASPell Chinese Spell Checker (Chinese Spell Check / 中文拼写检错 / 中文拼写纠错 / 中文拼写检查)