Ce que vous devez savoir avant
Vous commencez

Débute 4 June 2026 04:36

Se termine 4 June 2026

00 Jours
00 Heures
00 Minutes
00 Secondes
course image

Maîtriser l'Intelligence Artificielle Générative et le Déploiement des Modèles de Langage de Grande Taille.

Compétence sur OpenAI, LangChain, MidJourney, LLama3 ; Applications Javascript pour des prototypes d'inférence 20X plus rapides. Soyez embauché.
via Udemy

4160 Cours


14 hours 57 minutes

Amélioration optionnelle disponible

Not Specified

Progressez à votre rythme

Paid Course

Amélioration optionnelle disponible

Aperçu

Proficiency on OpenAI,LangChain,MidJourney,LLama3,;Javascript Applications for 20X Fast Inference Prototypes.Get Hired What you'll learn:

What is Docker and How to use DockerAdvance Docker UsageWhat are OpenCL and OpenGL and when to use ?(LAB) Tensorflow and Pytorch Installation, Configuration with Docker(LAB)DockerFile, Docker Compile and Docker Compose Debug file configuration(LAB)Different YOLO version, comparisons, and when to use which version of YOLO according to your problem(LAB)Jupyter Notebook Editor as well as Visual Studio Coding Skills(LAB)Learn and Prepare yourself for full stack and c++ coding exercies(LAB)TENSORRT PRECISION FLOAT 32/16 MODEL QUANTIZIATIONKey Differences:

Explicit vs. Implicit Batch Size(LAB)TENSORRT PRECISION INT8 MODEL QUANTIZIATION(LAB) Visual Studio Code Setup and Docker Debugger with VS and GDB Debugger(LAB) what is ONNX framework C Plus and how to use apply onnx to your custom C ++ problems(LAB) What is TensorRT Framework and how to use apply to your custom problems(LAB) Custom Detection, Classification, Segmentation problems and inference on images and videos(LAB) Advance C ++ Object Oriented Programming(LAB) Deep Learning Problem Solving Skills on Edge Devices, and Cloud Computings with C++ Programming Language(LAB) How to generate High Performance Inference Models on Embedded Device, in order to get high precision, FPS detection as well as less gpu memory consumption(LAB) Visual Studio Code with Docker(LAB) GDB Debugger with SonarLite and SonarCube Debuggers(LAB) yolov4 onnx inference with opencv c++ dnn libraries(LAB) yolov5 onnx inference with opencv c++ dnn libraries(LAB) yolov5 onnx inference with Dynamic C++ TensorRT Libraries(LAB) C++(11/14/17) compiler programming exerciesKey Differences:

OpenCV AND CUDA/ OPENCV AND TENSORRT(LAB) Deep Dive on React Development with Axios Front End Rest API(LAB) Deep Dive on Flask Rest API with REACT with MySqlUnderstand model optimization techniques:

Pruning, Distillation, and Quantization(LAB) Deep Dive on Text Summarization Inference on Web App(LAB) Prompt Penetration Testing(LAB) Deep Dive on BERT (LLM) Fine tunning and Emotion Analysis on Web App(LAB) Deep Dive On Distributed GPU Programming with Natural Language Processing (Large Language Models))(LAB) Deep Dive on BERT (LLM) Fine tunning and Emotion Analysis on Web App(LAB) Prompt Engineering from basics to advance(LAB) Deep Dive on Generative AI use cases, project lifecycle, and model pre-training(LAB) OPENAI GPT models with Specific prompt Engineering techniques(LAB) Fine-tuning and evaluating large language models(LAB) Reinforcement learning and LLM-powered applications, ALIGN Fine tunning with User Feedback(LAB) Quantization of Large Language Models with Modern Nvidia GPU's(LAB) C++ OOP TensorRT Quantization and Fast Inference(LAB) Deep Dive on Hugging FACE Library(LAB)Translation ● Text summarization ● Question answering(LAB)Sequence-to-sequence models, ONLY Encoder Based Models, Only Decoder Based Models(LAB)Define the terms Generative AI, large language models, prompt, and describe the transformer architecture that powers LLMs(LAB)Discuss computational challenges during model pre-training and determine how to efficiently reduce memory footprint(LAB)Describe how fine-tuning with instructions using prompt datasets can improve performance on one or more tasks(LAB)Explain how PEFT decreases computational cost and overcomes catastrophic forgetting(LAB)Describe how RLHF uses human feedback to improve the performance and alignment of large language models(LAB)Discuss the challenges that LLMs face with knowledge cut-offs, and explain how information retrieval and augmentation techniques can overcome these challenRecognize and understand the various strategies and techniques used in fine-tuning language models for specialized applications.Master the skills necessary to preprocess datasets effectively, ensuring they are in the ideal format for AI training.Investigate the vast potential of fine-tuned AI models in practical, real-world scenarios across multiple industries.Acquire knowledge on how to estimate and manage the costs associated with AI model training, making the process efficient and economicDistributing Computing for (DDP) Distributed Data Parallelization and Fully Shared Data Parallel across multi GPU/CPU with Pytorch together with Retrieval AugmeThe RoBERTa model was proposed in RoBERTa:

A Robustly Optimized BERT Pretraining ApproachMaster downcasting from FP32 to BF16 and FP32 to INT8Learn the difference between symmetric and asymmetric quantizationImplement quantization techniques in Python with real examplesApply quantization to make models more efficient and deployment-readyLearn the basics of data types like FP32, FP16, BFloat16, and INT8Gain practical skills to optimize models for edge devices and resource-constrained environmentsAdvance Image Generation and Editing This course is diving into Generative AI State-Of-Art Scientific Challenges.

It helps to uncover ongoing problems and develop or customize your Own Large Models Applications. Course mainly is suitable for any candidates(students, engineers,experts) that have great motivation to Large Language Models with Todays-Ongoing Challenges as well as their deeployment with Python Based and Javascript Web Applications, as well as with C/C++ Programming Languages.

Candidates will have deep knowledge on TensorFlow , Pytorch, Keras models, HuggingFace with Docker Service. In addition, one will be able to optimize and quantize TensorRT frameworks for deployment in variety of sectors.

Moreover, They will learn deployment of LLM quantized model to Web Pages developed with React, Javascript and FLASKHere you will also learn how to integrate Reinforcement Learning(PPO) to Large Language Model, in order to fine them with Human Feedback based. Candidates will learn how to code and debug in C/C++ Programming languages at least in intermediate level.LLM Models used:

The Falcon, LLAMA2, BLOOM, MPT, Vicuna,FLAN-T5, GPT2/GPT3, GPTNEOXBERT 101, Distil BERTFINE-Tuning Small Models under supervision of BIG ModelsImage Generation :

LLAMA modelsGemini Dall-E OpenAIHugging face ModelsLearning and Installation of Docker from scratchKnowledge of Javscript, HTML ,CSS, BootstrapReact Hook,DOM and Javacscript Web DevelopmentDeep Dive on Deep Learning Transformer based Natural Language ProcessingPython FLASK Rest API along with MySqlPreparation of DockerFiles, Docker Compose as well as Docker Compose Debug fileConfiguration and Installation of Plugin packages in Visual Studio CodeLearning, Installation and Confguration of frameworks such as Tensorflow, Pytorch, Kears with docker images from scratchPreprocessing and Preparation of Deep learning datasets for training and testingOpenCV DNNwith C++ InferenceTraining, Testing and Validation of Deep Learning frameworksConversion of prebuilt models to Onnx and Onnx Inference on images with C++ ProgrammingConversion of onnx model to TensorRT engine with C++ RunTime and Compile Time APITensorRT engine Inference on images and videosComparison of achieved metrices and result between TensorRT and Onnx InferencePrepare Yourself for C++ Object Oriented Programming Inference!Ready to solve any programming challenge with C/C++ Read to tackle Deployment issues on Edge Devices as well as CloudAreasLarge Language Models Fine TunningLarge Language Models Hands-On-Practice:

BLOOM, GPT3-GPT3.5, FLAN-T5 familyLarge Language Models Training, Evaluation and User-Defined Prompt IN-Context Learning/On-Line LearningHuman FeedBack Alignment on LLM with Reinforcement Learning (PPO) with Large Language Model :

BERT and FLAN-T5How to Avoid Catastropich Forgetting Program on Large Multi-Task LLM Models.How to prepare LLM for Multi-Task Problems such as Code Generation, Summarization, Content Analizer, Image Generation.Quantization of Large Language Models with various existing state-of-art techniquesImportante Note:

In this course, there is not nothing to copy &paste, you will put your hands in every line of project to be successfully LLM and Web Application Developer!You DO NOT need any Special Hardware component.

You will be delivering project either on CLOUD or on Your Local Computer.

Programme

  • Introduction à l'IA Générative et Déploiement des MLL
  • Vue d'ensemble des Concepts de l'IA Générative
    Introduction aux Grands Modèles de Langage (MLL)
    Objectifs et Résultats du Cours
  • Notions de Base de Docker et Utilisation Avancée
  • Qu'est-ce que Docker et Comment l'Utiliser
    DockerFile, Compilation Docker, et Docker Compose
    Configuration et Débogage des Fichiers Docker Compose
  • OpenCL, OpenGL, et Cadres de Développement pour le Deep Learning
  • Vue d'ensemble d'OpenCL et OpenGL
    Installation et Configuration de TensorFlow et PyTorch avec Docker
    Jupyter Notebook et Visual Studio Code pour le Développement
  • Techniques de Déploiement et d'Optimisation
  • Précision TensorRT : Quantification des Modèles FP32/FP16 et INT8
    Optimisation de Modèle : Pruning, Distillation, et Quantification
    Framework ONNX : Application dans les Problèmes Personnalisés en C++
  • Programmation C++ Avancée pour l'IA
  • Programmation Orientée Objet en C++
    Inférence Rapide sur les Dispositifs Edge et le Cloud
    OpenCV DNN avec Inférence en C++
  • Modèles YOLO pour la Détection d'Objets
  • Comparaison des Versions de YOLO
    Inférence ONNX de YOLOv4 et YOLOv5 avec C++ et TensorRT
  • Grands Modèles de Langage et Ajustement Fin
  • Vue d'ensemble des Modèles : Falcon, LLAMA2, BLOOM, MPT, Vicuna, FLAN-T5, GPT-2/3
    Ajustement Fin et Évaluation des Grands Modèles avec Retour d'Utilisateurs
    Techniques de Quantification pour les MLL
  • Développement Frontend et Backend pour le Déploiement IA
  • Développement React avec Axios et APIs REST
    Développement API REST Flask avec MySQL
    Intégration et Déploiement des Modèles vers les Applications Web
  • Traitement Automatique du Langage avec Transformateurs
  • Exploration Approfondie de l'Architecture Transformateur
    Ingénierie de Prompt des Bases aux Avancées
    Modèles Séquence-à-séquence : Basés sur l'Encodeur et le Décodeur
  • Cas d'Utilisation Avancés en IA Générative
  • IA Générative en Génération et Édition d'Images
    Défis à la Pointe de l'IA Générative
    Cycles de Vie des Projets et Pré-entraînement des Modèles
  • Calcul Distribué et Traitement Multi-GPU
  • Parallélisation des Données Distribuées et Parallèle Entièrement Partagé
    Techniques d'Augmentation de la Récupération avec le Calcul Distribué
  • Application Pratique et Laboratoires Pratiques
  • Exercices de Programmation en C++
    Exploration Approfondie sur Synthèse de Texte et Ajustement Fin de BERT
    Débogage dans Visual Studio Code avec GDB et SonarCube
  • Apprentissage par Renforcement avec Retour Humain
  • Application de PPO aux MLL
    Traiter l'Oubli Catastrophique dans les Modèles Multi-Tâches
  • Scénarios Réels et Applications Industrielles
  • Prétraitement des Données pour l'Entraînement IA
    Optimisation des Modèles pour les Dispositifs Edge
    Gestion Économique des Coûts d'Entraînement des Modèles
  • Conclusion et Prochaines Étapes
  • Récapitulatif du Cours et Points Clés
    Chemins pour un Apprentissage et un Développement Supplémentaires
    Projets Finaux et Présentations des Projets de Fin d'Études

Enseigné par

PhD Researcher AI & Robotics Scientist Fikrat Gasimov


Matières

Computer Science