IA Générative

Building a recommendation engine that doesn't trust the LLM

This is the engineering companion to the production architecture piece. Instead of re-arguing why open-ended agents are risky in commerce, it walks through the actual implementation choices in `ai-florist`: FastAPI boundaries, LangGraph orchestration, pgvector retrieval, learned scoring weights, deterministic fallbacks, and runtime observability.

Par

Cyril Noirot

12 avril 2026

11 min de lecture

The easiest way to build an AI recommendation system is to let the model browse the catalog, reason in a loop, and pick products.

When I built ai-florist, I wanted the opposite property: if a recommendation is wrong, I want to know which layer was wrong. Was intent parsing bad? Did delivery filtering remove too much? Did vector retrieval miss the right candidates? Were the scoring weights off? Did the rationale overstate the fit?

That requirement leads to a very different design from the usual "agent with tools" pattern. The LLM is still there, but it is treated like a narrow component with typed inputs and typed outputs. The engine that decides what to show is deterministic, observable, and debuggable.

This article is the engineering companion to What agentic commerce actually requires in production. That piece makes the architectural argument. This one walks through the implementation.

À propos de l'auteur

Cyril Noirot

Lead Data Scientist

Data scientist freelance. Je conçois et déploie des systèmes de décision — prévision, pricing, marketing measurement, optimisation.

Plus sur Cyril Réalisations LinkedIn

En pratique

Études de cas anonymisées où ces idées ont été appliquées à de vrais problèmes de décision.

Generative AIPremium Florist (reference implementation)

AI-guided recommendation engine for premium floral e-commerce

A production-oriented recommendation system that guides customers through emotionally loaded floral purchases — using a deterministic state machine with LLM components constrained to intent parsing and rationale generation only.

Lire l'étude

From SaaS to intelligence native: the feedback loop.

Intelligence-native systems need agent access to decision artefacts and feedback loops. Why context, not models, is the differentiator — and how MCP, traditional ML, and versioned artefacts fit together.

IA Générative8 min

Who controls what your AI recommends?

Seasons, campaigns, and weekly events — retail runs on overlapping cycles, and the AI recommender has to keep up with all of them. Notes on the business-rules control surface that lets merchandising teams steer a conversational recommender without editing prompts, filing tickets, or waiting for a deploy.

IA Générative11 min

Ce que le commerce agentique exige vraiment en production

Macy's annonce que les clients utilisant son assistant IA dépensent 4,75x plus. Sephora vient de lancer une app dans ChatGPT. Zalando déploie son assistant dans 25 marchés. La question pour tous les autres retailers n'est plus 'faut-il le faire ?' mais 'comment l'architecturer pour que ça tienne en production ?'

Newsletter

Articles techniques sur la prévision, le pricing et les systèmes de décision. Aucune fréquence imposée.

Enter your email