Lists (1)
Sort Name ascending (A-Z)
Stars
Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
🧮 A collection of resources to learn mathematics for machine learning
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch
A probabilistic programming language in TensorFlow. Deep generative models, variational inference.
📚 Parameterize, execute, and analyze notebooks
The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports comp…
A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.
Python library for interactive topic model visualization. Port of the R LDAvis package.
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
General purpose unsupervised sentence representations
Deep Learning Pipelines for Apache Spark
Public facing notes page
120+ interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.
Area-weighted venn-diagrams for Python/matplotlib
A pure-python implementation of the UpSet suite of visualisation methods by Lex, Gehlenborg et al.
Columnar storage extension for Postgres built as a foreign data wrapper. Check out https://github.com/citusdata/citus for a modernized columnar storage implementation built as a table access method.
(Deprecated) Scikit-learn integration package for Apache Spark
Python port of Google's libphonenumber
📙 Amazon Web Services — a practical guide
Ansible Provisioner for Test Kitchen