Statistician • Data Scientist • Research Analyst
M.S. in Statistics (UCLA) | B.S. in Applied Mathematics (UCLA)
I recently completed my M.S. in Statistics at UCLA and am currently searching for research-oriented statistician or data scientist/analyst opportunities, particularly in the public sector. I’m passionate about using my mathematical background and programming skills to build tools like data dashboards and web applications and production-level statistical learning models. I have experience in statistical and machine learning research in academia and a diverse set of data analysis/science skills In particular, I specialize in:
- Reproducible data workflows in Python, R, and SQL
- Statistical modeling, machine learning, and spatial data analysis
- Data visualization and Interactive dashboards using Dash, Plotly, and Streamlit
- Privacy-aware modeling and synthetic data evaluation
I recently conducted research at the UCLA Trustworthy AI Lab, developing tools to audit privacy risks in synthetic data models — especially in the context of generative models in the healthcare domain.
Languages:
Python · R · PostgreSQL · MySQL · HTML/CSS · LaTeX
Libraries & Frameworks:
pandas · NumPy · scikit-learn · PyTorch · XGBoost · tidyverse
Dashboards & Visualization:
Dash · Streamlit · Plotly · Matplotlib · ggplot2 · Excel
Tools & Technologies:
FastAPI · Flask · AWS · Git · Jupyter
An interactive basketball analytics platform with video playback
A full-stack web app for visualizing NBA play-by-play data using Dash and FastAPI.
- Heatmaps, pass networks, scoring timelines
- Filter by player, game, or play type
- Dynamically loads relevant video clips
Tech stack: Python, Plotly, Dash, FastAPI, MoviePy, SQLite
Visualizing correspondence networks and document metadata from the Founding Era
An interactive Dash app that aggregates and analyzes metadata from letters written by and to key American founders.
- Interactive graphs showing communication frequency, relationships, and geographic trends
- Historical document metadata enrichment and network analysis
- Spatial and temporal visualizations of early U.S. political discourse
Tech stack: Python, Dash, Plotly, NetworkX, spaCy, pandas
Evaluating privacy risks in synthetic health records
A research pipeline assessing de-identification and attack resilience in generative health data.
- Membership inference simulations
- Differential privacy scoring
- Visualizations of trade-offs between utility and privacy
Tech stack: Python, NumPy, pandas, scikit-learn, PyTorch
- Website / Portfolio: nicklauskim.github.io
- LinkedIn: linkedin.com/in/nicklauskim
- Résumé: Available on request or via portfolio site
Thanks for stopping by! Feel free to explore my pinned projects or reach out if you’re interested in collaborating or hiring.
