About Skills Projects Experience Education Contact
Open to new opportunities

Saad
Driouech

Machine Learning Engineer bridging research and production: from training diffusion models and pre-training transformers to building RAG pipelines, agentic systems, and end-to-end ML automation. 5 peer-reviewed publications. Based in Fürth, Germany.

5 Publications
3+ Years ML exp.
1.2 M.Sc. grade
4 Languages
Saad Driouech

Researcher. Engineer.
Builder.

I'm a Machine Learning Engineer with a research background in generative AI, NLP, and applied ML. Currently pursuing my M.Sc. in Artificial Intelligence at Friedrich-Alexander-Universität Erlangen-Nürnberg (grade 1.2, top 1%), while working as a Generative AI Research Engineer at Fraunhofer IIS.

My work spans the full spectrum: training diffusion models for signal reconstruction, pre-training transformer language models for low-resource languages, building production RAG systems, and designing agentic LLM pipelines. I care deeply about rigorous experimentation and shipping systems that actually work.

I have 5 peer-reviewed publications across NLP, TTS, and applied ML. I love the challenge of translating research insights into reliable, maintainable software.

🇬🇧 English — Fluent 🇫🇷 French — Advanced 🇲🇦 Arabic — Native 🇩🇪 German — Learning
🧠

Deep Learning

Diffusion models, transformers, TTS: trained and evaluated across diverse domains and modalities.

🔗

LLM Systems

RAG pipelines, agentic workflows with LangGraph, multi-LLM orchestration, and streaming backends.

⚙️

MLOps & Automation

Airflow, n8n, FastAPI, Docker: end-to-end pipelines from data to deployed, monitored systems.

📄

Published Researcher

5 papers in IEEE, Springer, MDPI spanning NLP, TTS, and e-commerce ML.

Tools of the trade

🔬

ML & Deep Learning

PyTorch TensorFlow Scikit-learn XGBoost Diffusion Models Transformers TensorBoard
💬

NLP & Speech

LLMs Hugging Face Fine-tuning (LoRA/PEFT) TTS (FastSpeech 2) HiFi-GAN spaCy
🤖

LLM Systems & RAG

LangChain LangGraph LlamaIndex RAG Qdrant FAISS OpenAI API Anthropic API Groq API

Automation & Backend

n8n Apache Airflow FastAPI REST APIs SSE Streaming Docker Docker Compose
🛠

Infrastructure

Python SQL Git CI/CD Linux
🌍

Languages

English (Fluent) French (Advanced) Arabic (Native) German (Learning)

Things I've built

Side projects exploring the frontier of LLM systems, agentic AI, and ML automation.

✍️

MotiGen

2026

Agentic motivation letter generator built on a LangGraph 4-node pipeline with typed state: CV parsing → JD parsing → live company research (dual Tavily queries for culture + role context) → letter generation. Supports Claude, Groq, and OpenAI via a unified model factory. FastAPI backend with SSE streaming for real-time output; Streamlit UI for file-upload workflows.

LangGraph LangChain Claude / Groq / OpenAI FastAPI + SSE Tavily Pydantic v2 Streamlit

Where I've worked

Generative AI Research Engineer

Mar 2025 — Present

Fraunhofer IIS · Nuremberg, Germany · Working Student

  • Training diffusion-based generative models for GNSS signal reconstruction under real-world interference conditions; iterating on architectures and loss functions to stabilise training and prevent mode collapse.
  • Designed systematic evaluation frameworks measuring model robustness under noise and distribution shift; experimented with spectrogram and complex IQ data representations.
  • Managed the full experimental lifecycle: hypothesis formulation, implementation, ablation studies, and documentation. Tracked all experiments with TensorBoard.
  • Collaborated with signal processing engineers to translate experimental findings into concrete modelling decisions.

Applied Machine Learning Engineer

Dec 2023 — Feb 2025

August-Wilhelm Scheer Institut · Saarbrücken, Germany · Working Student

  • Built end-to-end ML pipelines for garment return prediction on cold-start products with no transaction history, achieving 86% balanced accuracy; addressed class imbalance and feature sparsity.
  • Refactored and parallelized preprocessing pipelines, achieving a 5× runtime speedup and significantly improving iteration speed and reproducibility.
  • Conducted feature importance analysis to identify key return drivers, enabling interpretable recommendations for business stakeholders.

Development Engineer

Jun 2023 — Aug 2023

Hightech Payment Systems · Casablanca, Morocco

  • Enhanced PowerCARD, HPS's global payment switching and card management platform, to meet VISA and Mastercard compliance requirements.
  • Worked with SQL databases, Docker containers, CI/CD pipelines, and Linux environments in a professional engineering setting.

Applied NLP Research Engineer

Sep 2022 — May 2023

Al Akhawayn University · Ifrane, Morocco · Part-time

  • Pre-trained two transformer language models (DarELECTRA 52M, DarRoBERTa 80M) for low-resource Moroccan Darija on a 1 GB code-mixed corpus; fine-tuned on three downstream tasks.
  • Text summarization: DarELECTRA achieved ROUGE-1 19.25 / ROUGE-L 18.01, state-of-the-art among all tested models including ARBERT and MARBERT.
  • Topic classification: F1 0.84 / accuracy 0.86; offensive language detection: 90% accuracy / 85% F1; published at IEEE CiSt 2023 and MDPI 2024.

Applied ML Engineer (Intern)

May 2022 — Jul 2022

Wenov, Attijariwafa Bank Innovation Lab · Casablanca, Morocco

  • Built a transformer-based intent classification system to automatically route multilingual client inquiries to relevant departments, replacing a legacy rule-based system.
  • Outperformed traditional ML baselines by +7% accuracy / +4% F1; evaluated across multiple language variants and edge cases.

Academic background

M.Sc. Artificial Intelligence

Friedrich-Alexander-Universität Erlangen-Nürnberg

Grade 1.2 · Top 1% Oct 2023 — May 2026

Thesis: Spatial Control Mechanisms for Scale-Wise Transformers (SWITTI). ViT-based spatial encoder with cross-attention conditioning for autoregressive image generation.

Pattern Recognition ML for Time Series Advanced Programming

B.Sc. Computer Science

Al Akhawayn University

GPA 3.83 / 4.0 · Summa Cum Laude Sep 2018 — Jul 2022

Capstone: Darija Text-to-Speech Synthesis using FastSpeech 2 + HiFi-GAN on a 2-hour low-resource dataset. MOS 3.905; published at ICDTA 2025.

Probability & Statistics Database Systems Big Data Data Structures

Let's connect

Open to full-time roles, research collaborations, and interesting problems. Based in Fürth, Germany, open to relocation.