Production-Ready Infrastructure as Code + Comprehensive Data Engineering Course
A complete Databricks learning platform combining:
- 🎓 27 hands-on notebooks (fundamentals to production apps + data modelling)
- 🏗️ Terraform automation for Unity Catalog, users, permissions, and RBAC
- 🚀 Zero-setup learning for students + 15-minute deployment for admins
👉 Visit the Web UI:
- 🌍 Live Site: https://datatribe-collective-labs.github.io/databricks-infra (GitHub Pages)
- 📁 Local: Open index.html
The web UI provides:
- Student Guide - Get workspace access and start learning in 3 steps
- Admin Guide - Deploy complete infrastructure in 15 minutes
- Course Curriculum - Browse all 27 notebooks organized by module
- 🌙 Day/Night Mode - Toggle theme for comfortable viewing
- 📱 Mobile-Friendly - Responsive design with hamburger menu
databricks-infra/
├── web/ # Web UI (START HERE)
│ ├── index.html # Main landing page
│ ├── data-engineer.html # Student guide
│ ├── platform-engineer.html # Admin guide
│ ├── curriculum.html # Course curriculum
│ └── styles.css # Shared styles
├── README.md # This file
├── CLAUDE.md # Technical docs for AI assistance
├── docs/ # Reference documentation
│ ├── DataEngineer-readme.md # Detailed student guide
│ ├── DataPlatformEngineer-readme.md # Detailed admin guide
│ ├── USER_SCHEMA_GUIDE.md # User isolation technical guide
│ └── assets/ # Logo and images
├── course/ # Course content
│ ├── notebooks/ # 27 Databricks notebooks
│ └── datasets/ # Sample data files
├── terraform/ # Infrastructure as Code
│ ├── main.tf, groups.tf, catalogs.tf
│ ├── users.json # User configuration
│ └── versions.tf # Provider config
├── src/ # Python package
│ ├── cli.py # CLI tools
│ └── utils.py # Utilities
└── tests/ # Test suite
- 🌐 Web Guide: web/data-engineer.html
- 📖 Detailed Docs: docs/DataEngineer-readme.md
- 🔗 Workspace: https://dbc-d8111651-e8b1.cloud.databricks.com
- 🌐 Web Guide: web/platform-engineer.html
- 📖 Detailed Docs: docs/DataPlatformEngineer-readme.md
- 🔧 Technical Reference: CLAUDE.md
- 🌐 Curriculum: web/curriculum.html
- 📁 Notebooks: course/notebooks/
Foundational Knowledge:
- Week 1: Databricks Fundamentals (5 notebooks)
- Foundations: Data Modelling Patterns (4 notebooks)
Applied Learning:
- Week 2: Data Ingestion (5 notebooks)
- Week 3: Advanced Transformations (4 notebooks)
- Week 4: End-to-End Workflows (3 notebooks)
- Week 5: Production Deployment (4 notebooks)
Advanced Topics:
- Advanced: Databricks Apps with Streamlit (2 notebooks)
- 8 users with role-based access control
- 5 Unity Catalogs (sales, marketing, course)
- 24 schemas (medallion architecture: bronze, silver, gold)
- User isolation - each student gets personal workspace
- CI/CD pipeline - automated deployment via GitHub Actions
# Open the web UI to get workspace access
open web/index.html
# Then navigate to: /Shared/terraform-managed/course/notebooks/ in Databricks# Clone and setup
git clone https://github.com/chanukyapekala/databricks-infra
cd databricks-infra
poetry install
# Configure authentication (requires workspace admin)
databricks configure --token --profile datatribe
# Deploy infrastructure
cd terraform
terraform init
terraform applyVisit the web UI and click "Request Workspace Access" - you'll be directed to a Google Form to submit your request. We'll respond within 24 hours.
The web UI uses a Google Form for access requests (replacing the old broken mailto: link). To set up:
-
Create Google Form at https://forms.google.com with fields:
- Full Name (short answer, required)
- Email Address (short answer with email validation, required)
- Background (multiple choice: Student/Professional/Academic/Other)
- Why do you want to learn Databricks? (paragraph, required)
- GDPR Consent checkbox (required)
-
Configure Settings:
- Enable "Collect email addresses"
- Enable "Limit to 1 response" (optional)
- Set confirmation message: "Thank you! We'll contact you within 24 hours."
-
Get Form URL: Click Send → Link icon → Copy URL
-
Update Web UI: Edit
web/data-engineer.htmlline 220 and replaceYOUR_FORM_ID_HEREwith your actual form URL -
Enable Notifications: In form settings, enable email notifications for new responses
Benefits: No email client required, mobile-friendly, structured data collection, GDPR compliant
- 🐛 Issues: Use GitHub Issues
- 📖 Technical Docs: See CLAUDE.md for AI-assisted development
- 🔍 Troubleshooting: Check guides in docs/
- 💬 Contact: Reach out via Discord data-engg channel here: DataTribe Discord
🎓 Ready to learn? 🏗️ Ready to deploy? Start your Databricks journey with DataTribe today! 🚀