Skip to content
View geooo109's full-sized avatar

Highlights

  • Pro

Block or report geooo109

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 2 Updated Jul 23, 2025

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 164 19 Updated Nov 12, 2025

Styx: Transactional Stateful Functions on Streaming Dataflows

Python 37 4 Updated Aug 2, 2025

Build resilient language agents as graphs.

Python 20,898 3,674 Updated Nov 10, 2025

SIGMOD Contest 2025 Winning Solution

C++ 50 8 Updated Jun 2, 2025

Implementation of a cache-efficient, multithreaded, lock-free, hash-based join pipeline utilizing a memory-efficient hash table optimized for joins. This project was created for the SIGMOD 2025 Pro…

C++ 12 1 Updated Oct 9, 2025

Official Implementation of Poly2vec presented @ [ICML '25]

Python 15 3 Updated Aug 27, 2025
Python 1 Updated Feb 3, 2025

TPC-DS benchmark kit with some modifications/fixes

C 347 228 Updated Apr 16, 2024

Mirror of the official PostgreSQL GIT repository. Note that this is just a *mirror* - we don't work with pull requests on github. To contribute, please see https://wiki.postgresql.org/wiki/Submitti…

C 19,030 5,200 Updated Nov 12, 2025

QuestDB is a high performance, open-source, time-series database

Java 16,348 1,508 Updated Nov 11, 2025

Learn Low Level Design (LLD) and prepare for interviews using free resources.

Java 19,309 4,788 Updated Oct 25, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 22,897 2,603 Updated Oct 30, 2025

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

Rust 10,512 242 Updated Nov 10, 2025

A list of learning materials to understand databases internals

10,450 1,177 Updated Aug 29, 2024

Apache DataFusion SQL Query Engine

Rust 8,002 1,750 Updated Nov 12, 2025

Extremely fast non-cryptographic hash algorithm

C 20 1 Updated May 30, 2024

Learn System Design concepts and prepare for interviews using free resources.

Java 27,560 6,414 Updated Oct 15, 2025

Learn how to design systems at scale and prepare for system design interviews

38,557 4,770 Updated Apr 10, 2024

CMU-DB's Cascades optimizer framework

Rust 404 29 Updated Jan 6, 2025

Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

77,881 8,464 Updated Apr 4, 2025

Thesis class for undergraduate theses at the University of Athens

TeX 1 Updated Oct 21, 2021

Papers for database systems powered by artificial intelligence (machine learning for database)

757 94 Updated Sep 16, 2025
C 65 3 Updated Mar 9, 2023

Database system for AI-powered apps

Python 2,686 262 Updated May 17, 2024

PVLDB Artifact Availability for the paper "Asymptotically Better Query Optimization Using Indexed Algebra"

Shell 4 Updated Aug 1, 2023

Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-on part. This tutorial will be given at SIGMOD 2023.

21 Updated Jun 29, 2023
Jupyter Notebook 4 Updated Oct 19, 2022

StreamDQ is a library built on top of Apache Flink for defining "unit tests for data", which measure data quality in large data streams.

Kotlin 12 4 Updated Aug 4, 2023

Data-Centric What-If Analysis for Native Machine Learning Pipelines

Jupyter Notebook 16 3 Updated Jun 14, 2023
Next