Lists (1)
Sort Name ascending (A-Z)
Stars
Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
🧮 A collection of resources to learn mathematics for machine learning
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch
A probabilistic programming language in TensorFlow. Deep generative models, variational inference.
📚 Parameterize, execute, and analyze notebooks
The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports comp…
A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.
Python library for interactive topic model visualization. Port of the R LDAvis package.
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
General purpose unsupervised sentence representations
Deep Learning Pipelines for Apache Spark
Public facing notes page
120+ interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.
Area-weighted venn-diagrams for Python/matplotlib
A pure-python implementation of the UpSet suite of visualisation methods by Lex, Gehlenborg et al.
Columnar storage extension for Postgres built as a foreign data wrapper. Check out https://github.com/citusdata/citus for a modernized columnar storage implementation built as a table access method.
Python port of Google's libphonenumber
📙 Amazon Web Services — a practical guide
Ansible Provisioner for Test Kitchen
Test Kitchen is an integration tool for developing and testing infrastructure code and software on isolated target platforms