Skip to content
View YangRonghai's full-sized avatar

Block or report YangRonghai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Natural Language Processing Tutorial for Deep Learning Researchers

Jupyter Notebook 14,656 3,965 Updated Feb 21, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,896 277 Updated Jun 9, 2025

Collections of Orange Tsai's public presentation slides.

734 76 Updated Jan 1, 2025

API Security Project aims to present unique attack & defense methods in API Security field

285 50 Updated Mar 6, 2022

A secure low code honeypot framework, leveraging LLM for System Virtualization.

Go 1,190 85 Updated Jul 1, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 14,173 2,046 Updated Jul 1, 2025

专注于JVM的运行时防御系统RASP

283 68 Updated Jun 14, 2024

🔥 Web-application firewalls (WAFs) from security standpoint.

Python 6,727 1,097 Updated Oct 28, 2024

Small and highly portable detection tests based on MITRE's ATT&CK.

C 10,712 2,936 Updated Jun 30, 2025

Neo-reGeorg is a project that seeks to aggressively refactor reGeorg

Python 3,109 460 Updated Feb 18, 2025

ShellCheck, a static analysis tool for shell scripts

Haskell 37,626 1,824 Updated May 17, 2025

Java 1-21 Parser and Abstract Syntax Tree for Java with advanced analysis functionalities.

Java 5,854 1,201 Updated Jul 1, 2025

Grammars written for ANTLR v4; expectation that the grammars are free of actions.

ANTLR 10,658 3,760 Updated Jun 30, 2025

An incremental parsing system for programming tools

Rust 21,129 1,901 Updated Jul 1, 2025

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 1 Updated Apr 12, 2019

An advanced memory forensics framework

Python 7,738 1,322 Updated May 16, 2025

每日分享免费的ss、vmess、vless、trojan、hysteria2等各类节点

Dockerfile 3,507 446 Updated Mar 4, 2025

Protocol Analysis/Decoder Framework

Python 493 111 Updated Dec 19, 2022

all kinds of text classification models and more with deep learning

Python 7,925 2,569 Updated Sep 28, 2023

A little tool to play with Windows security

C 20,413 3,912 Updated May 11, 2025

Impacket is a collection of Python classes for working with network protocols.

Python 14,497 3,737 Updated Jul 1, 2025

Not Suitable for Work (NSFW) classification using deep neural network Caffe models.

Python 5,960 1,051 Updated Nov 21, 2018

Applying text model to Detection Task

Python 73 23 Updated May 9, 2017

Simplified/Traditional Chinese Converter

Python 2 Updated Apr 23, 2015
HTML 3,449 2,015 Updated Dec 27, 2024

对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字

Python 461 137 Updated Mar 28, 2024

Lime: Explaining the predictions of any machine learning classifier

JavaScript 11,922 1,840 Updated Jul 25, 2024

Tutorial for Sentiment Analysis using Doc2Vec in gensim (or "getting 87% accuracy in sentiment analysis in under 100 lines of code")

Jupyter Notebook 692 245 Updated Mar 27, 2019

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

Python 9,380 1,164 Updated Jun 30, 2025

Python library for processing Chinese text

Python 6,558 1,370 Updated Jan 19, 2020
Next