Stars
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
📚 Freely available programming books
The biggest collection of R books (and maybe later some other resources too)
Ravin515 / r-data-practice
Forked from renkun-ken/r-data-practiceR语言数据操作练习
Code for Tiny Python Projects (Manning, 2020, ISBN 1617297518). Learning Python through test-driven development of games and puzzles.
Python for Data Science. This repository hosts the code behind the online book that teaches you how to use Python for data science.
A lightweight, modern and flexible, log4j and futile.logger inspired logging utility for R
Introduction to DuckDB and Polars
More efficient tidyverse code, using polars in the background
Extremely fast Query Engine for DataFrames, written in Rust
duckdblabs / db-benchmark
Forked from h2oai/db-benchmarkreproducible benchmark of database-like ops
🎨 Visualisation toolbox for beautiful and publication-ready figures
Awesome resources for learning more about Apache Arrow
🎓 A collection of interactive courses for the swirl R package.
Easily generate information-rich, publication-quality tables from R
R's data.table package extends data.frame:
《利用Python进行数据分析·第2版》
dplyr-style piping operations for pandas dataframes
Source code for my collection of articles on using pandas.