đź‘‹ Hi, I'm Diego

🧠LLMs & RAG🤖AI / Machine Learning🛠️MLOps & ML Systems🧩Data Engineering

Welcome to my portfolio.

I am a Machine Learning Engineer / Data Scientist with a strong interest in applied mathematics, statistical learning, and model design. My work focuses on understanding, building, and evaluating machine learning models from first principles, while ensuring clean data pipelines and reproducible experimentation.

I enjoy bridging theory and practice, from mathematical foundations to end-to-end ML systems.

Objectives

Objectives

My objective is to contribute to research-oriented or applied machine learning teams, where mathematical rigor, model understanding, and careful evaluation matter.

I am particularly interested in roles involving:

  • Machine learning and statistical modeling
  • Model evaluation, optimization, and interpretability
  • MLOps practices for reproducible and maintainable ML systems

I am currently seeking a 6-month internship in Machine Learning or Data Science in 2026, with a flexible start date.

Additional Activities

Additional Activities

Beyond my core academic and technical projects, I actively explore modern AI systems, including LLMs, RAG, and practical MLOps patterns for building reproducible ML workflows.

I value continuous learning through personal projects, technical reading, and participation in hackathons or collaborative initiatives, with a strong emphasis on clarity, reproducibility, and engineering discipline.

In the medium term, I plan to:

  • Participate in Kaggle competitions to strengthen my applied ML skills and benchmarking mindset
  • Share learnings through short technical write-ups (e.g., Medium / Substack) and community initiatives (e.g., DeepLearning.AI)
  • Contribute to open-source projects when relevant, especially around data pipelines and ML tooling

Outside of tech, I also enjoy guitar and singing, which helps me stay creative and consistent 🎸

Education

École 42 Paris

Expert in IT Architecture · ML / Data / Systems

Intensive, project-based training focused on software engineering and data systems (algorithms, C/C++, Python, SQL, Linux), with an emphasis on rigor, autonomy, and reproducibility. Hands-on projects also cover Machine Learning fundamentals (preprocessing, training, evaluation).

2023 – 2026

Education

MSc – Civil Engineering (IMRO)

Université de Limoges

Master’s degree in structural analysis and infrastructure diagnostics, with a strong emphasis on data-driven engineering and applied mathematics, building a rigorous foundation in quantitative methodology.

2018 – 2020

Background

Engineering Background

Infrastructure & Structural Engineering

Several years of experience on large-scale infrastructure projects, combining analysis, quality control, stakeholder coordination, and technical reporting in high-constraint environments.

2014 – 2023

Location

Rueil-Malmaison, France

GMT+1

Professional Journey

A timeline of my education, professional experience, and transition into machine learning.

2026

Machine Learning Projects & Applied Practice

CURRENT

@ Personal & Academic Projects

🚀 Project

2025 – Present

Designed and implemented end-to-end ML projects, covering data preprocessing, model training, evaluation, and reproducible experimentation.

Machine LearningModel EvaluationExperimentationReproducibilityEnd-to-End Projectsscikit-learnTensorFlow/Keras
2026

École 42 – Software, Data & ML Systems

CURRENT

@ École 42 Paris

🎓 Education

2023 - 2026

Project-based training focused on algorithms, systems programming, data systems, and Machine Learning fundamentals, with strong emphasis on rigor, autonomy, and reproducibility.

AlgorithmsSystems ProgrammingData SystemsSoftware EngineeringMachine Learning Foundations
2024

Transition to Computer Science & Python

@ Self-study · Online courses · Projects

🔄 Transition

2023 - 2024

Initiated a self-directed transition into computer science, starting with Python fundamentals and progressively building strong programming habits through online courses and hands-on practice.

I then expanded into C and C++, focusing on low-level concepts, problem-solving, and software engineering foundations.

Self-learningComputer ScienceProblem SolvingProgramming FoundationsAlgorithms
2023

Engineering Foundations

@ Civil & Structural Engineering

đź’Ľ Employment

2014 - 2023

Built strong foundations in applied mathematics, engineering analysis, and methodology through infrastructure and structural engineering projects in high-constraint environments.

ExcelVBAData AnalysisQuality ControlTechnical ReportingStakeholder Coordination
2020

MSc – Applied Mathematics & Data-Driven Engineering

@ Université de Limoges

🎓 Education

2018 - 2020

Strengthened quantitative reasoning through applied mathematics, structural analysis, diagnostics, and data-driven engineering methodologies.

Applied MathematicsModelingQuantitative ReasoningData AnalysisStatistics

Featured Projects

A selection of my recent work across software engineering and machine learning.

Multilayer Perceptron (From Scratch)

Built a multilayer perceptron from scratch for breast cancer diagnosis (benign vs malignant), implementing forward/backpropagation and gradient-based training. Inspired by the Keras/TensorFlow API, including a custom Sequential-like design to understand deep learning internals.

PythonNumPyDeep LearningBackpropagationpandasKeras/TensorFlow-inspired classes

Leaffliction (Computer Vision)

Built a modular and reproducible deep learning pipeline for image classification using PyTorch, including preprocessing, data augmentation, transformation and CNN training with performance evaluation.

PythonOpenCVPyTorchmatplotlibFeature engineeringReproducible ML pipelines

Total Perspective Vortex (BCI / EEG)

EEG signal processing pipeline with CSP and logistic regression for motor imagery classification.

PythonNumPySciPyscikit-learnSignal ProcessingmatplotlibFeature engineeringData modelingReproducible ML pipelines

DSLR

Built a complete multiclass Logistic Regression pipeline from scratch (One-vs-All) to classify Hogwarts houses from student scores. Implemented data cleaning, feature selection, visualizations, and custom gradient descent variants (BGD / SGD / Mini-batch), with systematic evaluation — no pandas, no scikit-learn.

PythonLogistic RegressionGradient DescentData VisualizationFeature EngineeringFrom ScratchPythonPandasmatplotlibData modelingReproducible ML pipelinesBusiness Intelligence (BI)

Inception-of-Things

Hands-on DevOps project to learn Kubernetes fundamentals through progressive cluster setups (K3s/K3d), automated provisioning (Vagrant), and application deployments. Implemented a GitOps workflow with ArgoCD, reinforcing an MLOps-ready mindset around reproducibility, automation, and scalable deployment patterns.

Kubernetes (K3s/K3d)VagrantArgoCDGitOpsDocker

Technical Skills

A snapshot of the tools, frameworks, and systems I work with.

Machine Learning

Project-basedscikit-learn
Project-basedTensorFlow
Project-basedKeras
Hands-onPyTorch
Project-basedClassical ML
Project-basedDeep Learning
Project-basedFeature engineering

Mathematics

Project-basedLinear Algebra
Project-basedStatistics

Tools

Project-basedGit & GitHub
Hands-onVS Code
Hands-onJupyter

DevOps & Infrastructure

Project-basedLinux
Project-basedBash
Hands-onNginx
Project-basedELK Stack

MLOps & Cloud

Project-basedDocker
Hands-onKubernetes (K3s/K3d)
Hands-onGitOps - ArgoCD
FoundationsCI/CD
Project-basedCloud fundamentals

Data Science

Hands-onSQL
Hands-onPandas
Project-basedNumPy
Project-basedMatplotlib
Project-basedElasticsearch (search & indexing)
Project-basedData preprocessing
Project-basedData modeling

Programming

Project-basedPython
Project-basedC
Project-basedC++
Hands-onJavaScript
Hands-onTypeScript
Hands-onFastAPI

More Projects

Explore my collection of personal projects and creative experiments

Writing & Content

Sharing knowledge through articles, tutorials, and data science notebooks

Kaggle

Data Science

Data science notebooks and competitions

Coming

Medium

Technical Articles

AI/ML insights and technical deep dives

Coming

DeepLearning.AI

Forum

Technical discussions within the global AI learning community

Coming

Substack

Newsletter

Personal newsletter on AI, data science, and learning notes.

Coming

Open Source & Community Contributions

Community and open-source contributions through technical collaboration and learning.

42AI

Technical Pole Member

Planned participation in the 42AI technical pole, contributing to internal AI/ML initiatives and tooling.

PythonMachine LearningDocumentation
Planned