Flantier-Nuclear-Reglementation-1

A specialized vision-language model optimized for nuclear regulatory document analysis and retrieval.

Overview

Flantier-Nuclear-Reglementation-1 is a fine-tuned version of HuggingFaceTB/SmolVLM-Instruct, specifically optimized for nuclear regulatory document retrieval tasks. The model demonstrates exceptional performance in analyzing technical documents, diagrams, and regulatory content in both English and French, achieving state-of-the-art results in nuclear domain applications.

Read our blog post here.

Key Features

  • Nuclear Domain Expertise: Fine-tuned on 40,000 nuclear regulatory examples from IAEA, NEA/OECD, WENRA, EU directives, and French nuclear authorities

  • Multimodal Analysis: Simultaneously processes regulatory text, technical diagrams, safety flowcharts, and parameter tables

  • Bilingual Performance: Optimized for both English and French nuclear documentation

  • High Precision Retrieval: Achieves 74% accuracy (EN) and 61% accuracy (FR) on nuclear regulatory document retrieval tasks

  • European Sovereignty: Built on European open-source architecture for strategic autonomy in critical sectors

Benchmark

Nuclear Regulatory Document Retrieval Performance

Nuclear Regulatory Document Retrieval Performance

Performance on nuclear regulatory document retrieval (NDCG@1):

Model English French
HuggingFaceTB/SmolVLM-Instruct 0.17 0.04
llamaindex/vdr-2b-multi-v1 0.66 0.48
racineai/Flantier-SmolVLM-2B-dse 0.69 0.57
Flantier-Nuclear-Reglementation-1 0.74 0.61

Applications

  • Nuclear Regulatory Compliance: Retrieve relevant safety standards and regulatory requirements

  • Technical Documentation Analysis: Process nuclear technical diagrams, safety flowcharts, and parameter tables

  • Multilingual Regulatory Search: Handle international nuclear documentation in English and French

  • Safety Assessment Support: Assist in nuclear safety evaluations and compliance verification

Training Methodology

This model was fine-tuned using LoRA (Low-Rank Adaptation) on our specialized OGC Nuclear Dataset, which includes:

  • Regulatory documents from IAEA, NEA/OECD, WENRA
  • European Union nuclear safety directives
  • French nuclear regulatory framework (ASN orders, IRSN guides)
  • Technical documentation from nuclear operators

The training focused on nuclear-specific terminologies including criticality, containment, radiation protection, and regulatory compliance requirements.

Dataset

Our training utilized the Organized Grouped Cleaned (OGC) Nuclear Dataset, available as an open-source resource for nuclear AI research and development.

Citation

@misc{flantier-nuclear-reglementation-1,
    author = {Appourchaux, Léo and Brandolini, Noé and Ye, Yumeng},
    title = {Flantier-Nuclear-Reglementation-1: European Vision-Language Model for Nuclear Regulatory Data},
    year = {2025},
    organization = {Racine.ai, TW3 Partners and École Centrale d'Électronique},
    url = {https://huggingface.co/racineai/Flantier-Nuclear-Reglementation-1}
}

Acknowledgments

This work was developed in collaboration with the Intelligence Lab of École Centrale d'Électronique and built upon Hugging Face's foundational SmolVLM architecture. We thank the nuclear regulatory organizations whose public documentation enabled this research.

License

This model is released under the Apache 2.0 license.

Authors

  • Yumeng Ye: R&D at Racine.ai (Project Lead)
  • Léo Appourchaux: AI Developer at TW3 Partners
  • Noé Brandolini: R&D at TW3 Partners
Downloads last month
12
Safetensors
Model size
2.25B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support