Biomedical engineering - Big Data analytics platforms
Anaconda Conda cheatsheet
Git Git cheatsheet, Git tutorial
Jupyter Notebook Tips&Tricks
Using Markdown 3 min tutorial
Data manipulation: NumPy Quick tutorial
LAB: Simple data exploration and making notes
CPU, GPU, Multinode clusters
AWS, GoogleCloud
Chef, Puppet
Docker, Vagrant
Python, Dask
LAB: Design and execute algorithms on a cluster
Hadoop ecosystem: HDFS, MapReduce, Impala, HBase
Hadoop ecosystem: Pig, Hive, Sqoop, Flume
Hadoop ecosystem: Hue, Mahout
Apache Spark, Apache Storm
Cloudera, Databricks
HDFS, HBase
neo4j, flockDB
Cassandra
Redis, RiakKV, RiakTS
Scikit-Learn
TensorFlow
Spark MLlib
LAB: ML with Large Datasets
LAB: Real-time data processing
Stack: Redis, Apache Storm, Flask, d3js, TensorFlow