Stars
Natural Language Processing Tutorial for Deep Learning Researchers
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
Collections of Orange Tsai's public presentation slides.
API Security Project aims to present unique attack & defense methods in API Security field
A secure low code honeypot framework, leveraging LLM for System Virtualization.
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
🔥 Web-application firewalls (WAFs) from security standpoint.
Small and highly portable detection tests based on MITRE's ATT&CK.
Neo-reGeorg is a project that seeks to aggressively refactor reGeorg
ShellCheck, a static analysis tool for shell scripts
Java 1-21 Parser and Abstract Syntax Tree for Java with advanced analysis functionalities.
Grammars written for ANTLR v4; expectation that the grammars are free of actions.
An incremental parsing system for programming tools
xiaoshuzh / funNLP
Forked from fighting41love/funNLP中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
An advanced memory forensics framework
每日分享免费的ss、vmess、vless、trojan、hysteria2等各类节点
all kinds of text classification models and more with deep learning
A little tool to play with Windows security
Impacket is a collection of Python classes for working with network protocols.
Not Suitable for Work (NSFW) classification using deep neural network Caffe models.
Applying text model to Detection Task
对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字
Lime: Explaining the predictions of any machine learning classifier
Tutorial for Sentiment Analysis using Doc2Vec in gensim (or "getting 87% accuracy in sentiment analysis in under 100 lines of code")
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
Python library for processing Chinese text