Skip to content
View YannPhamVan's full-sized avatar

Highlights

  • Pro

Block or report YannPhamVan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
YannPhamVan/README.md

👋 Welcome to My GitHub Profile

🚀 About Me

I am a Data Scientist specializing in Machine Learning, with a strong focus on anomaly detection and scoring. My goal is to help businesses leverage the power of their data to enhance decision-making and automate processes.

With a background in industrial engineering and teaching, I have developed a rigorous approach to data, the ability to explain complex concepts clearly, and expertise in cloud tools for deploying models into production.

🛠️ Skills & Experience

🔹 Machine Learning (Scikit-learn, XGBoost, TensorFlow, PyTorch)
🔹 Data Engineering (SQL, BigQuery, dbt, dlt, Kestra)
🔹 MLOps & Cloud Computing (AWS, GCP, Docker, Kubernetes)
🔹 Development & Automation (Python, FastAPI, Flask)
🔹 Data Visualization & Storytelling (Matplotlib, Seaborn, Streamlit, Looker Studio)

📌 Notable Projects

🌟 Industrial Equipment Failure Prediction
➡️ Predicting industrial equipment failures with a machine learning pipeline, deployed on AWS Elastic Beanstalk.

🌟 Financial Distress Prediction
➡️ Classification model for predicting corporate bankruptcies, with a Flask API and cloud deployment.

🌟 OptiFund: Data-Driven Portfolio Optimization
➡️ A full-stack data engineering project using Kestra, GCS, and BigQuery to ingest and transform global stock indices. Business dashboards built in Looker Studio to compare performance and correlations.

🏆 Certifications & Achievements

Machine Learning Zoomcamp - DataTalksClub
Data Engineering Zoomcamp - DataTalksClub
Data Scientist - OpenClassrooms/CentraleSupélec
Publications & Sharing on LinkedIn about Machine Learning, Data Engineering, and MLOps


💡 Always looking for new challenges in data science and data engineering! Feel free to explore my projects and reach out.

Pinned Loading

  1. OptiFund-Data-Driven-Portfolio-Optimization OptiFund-Data-Driven-Portfolio-Optimization Public

    Jupyter Notebook 1 1

  2. Industrial-Equipment-Failure-Prediction Industrial-Equipment-Failure-Prediction Public

    Jupyter Notebook

  3. financial-distress-prediction financial-distress-prediction Public

    Jupyter Notebook

  4. Projet7-Implementez_un_modele_de_scoring Projet7-Implementez_un_modele_de_scoring Public

    HTML 1

  5. Projet5-Segmentez_des_clients_d_un_site_e-commerce Projet5-Segmentez_des_clients_d_un_site_e-commerce Public

    Jupyter Notebook

  6. Projet4-Anticipez_les_besoins_en_consommation_de_batiments Projet4-Anticipez_les_besoins_en_consommation_de_batiments Public

    Jupyter Notebook