* Cantinho Satkeys

Refresh History
  • j.s.: try65hytr a todos  49E09B4F
    24 de Março de 2026, 18:55
  • FELISCUNHA: Votos de um santo domingo para todo o auditório  49E09B4F
    22 de Março de 2026, 11:36
  • j.s.: tenham um ex celente fim de semana  4tj97u<z 4tj97u<z
    20 de Março de 2026, 18:34
  • j.s.: dgtgtr a todos  49E09B4F
    20 de Março de 2026, 18:34
  • FELISCUNHA: ghyt74  pessoal   4tj97u<z
    19 de Março de 2026, 11:14
  • j.s.: try65hytr a todos  49E09B4F
    16 de Março de 2026, 19:20
  • FELISCUNHA: ghyt74  e bom fim de semana  4tj97u<z
    14 de Março de 2026, 11:15
  • JPratas: try65hytr Pessoal  4tj97u<z 2dgh8i k7y8j0 yu7gh8
    13 de Março de 2026, 05:26
  • FELISCUNHA: ghyt74  pessoal   4tj97u<z
    10 de Março de 2026, 11:00
  • j.s.: dgtgtr a todos  49E09B4F 49E09B4F
    09 de Março de 2026, 17:12
  • FELISCUNHA: ghyt74   49E09B4F  e bom fim de semana  4tj97u<z
    07 de Março de 2026, 11:37
  • JPratas: try65hytr Pessoal  4tj97u<z 2dgh8i k7y8j0 yu7gh8
    06 de Março de 2026, 05:31
  • FELISCUNHA: ghyt74  pessoal   49E09B4F
    04 de Março de 2026, 10:47
  • Kool.king1: french
    02 de Março de 2026, 22:47
  • j.s.: dgtgtr a todos  49E09B4F
    01 de Março de 2026, 16:54
  • FELISCUNHA: Votos de um santo domingo para todo o auditório  101041
    01 de Março de 2026, 10:42
  • cereal killa: try65hytr pessoal e bom fim semana de solinho  535reqef34 r4v8p
    28 de Fevereiro de 2026, 20:31
  • FELISCUNHA: ghyt74  Pessoal   4tj97u<z
    27 de Fevereiro de 2026, 10:51
  • JPratas: try65hytr Pessoal  4tj97u<z 2dgh8i k7y8j0 classic
    27 de Fevereiro de 2026, 04:57
  • FELISCUNHA: Votos de um santo domingo para todo o auditório  4tj97u<z
    22 de Fevereiro de 2026, 11:06

Autor Tópico: Llm Model Quantization: An Overview  (Lida 302 vezes)

0 Membros e 1 Visitante estão a ver este tópico.

Offline mitsumi

  • Sub-Administrador
  • ****
  • Mensagens: 130685
  • Karma: +0/-0
Llm Model Quantization: An Overview
« em: 16 de Novembro de 2023, 11:16 »

Llm Model Quantization: An Overview
Published 11/2023
MP4 | Video: h264, 1920x1080 | Audio: AAC, 44.1 KHz
Language: English | Size: 242.65 MB | Duration: 0h 44m

A General Introduction and Overview of LLM Model Quantization Techniques and Practices

What you'll learn
Understand the fundamental principles of model quantization and its critical role in optimizing Large Language Models (LLMs) for diverse applications.
Explore and differentiate between various types of model quantization methods, including post-training quantization, quantization-aware training.
Gain proficiency in implementing model quantization using major frameworks like TensorFlow, PyTorch, ONNX, and NVIDIA TensorRT.
Develop skills to effectively evaluate the performance and quality of quantized LLMs using standard metrics and real-world testing scenarios.
Requirements
Understanding of Python, Neural Networks, and Hugging Face Libraries is recommended for this course.
Description
Course Description:This course offers a deep dive into the world of model quantization, specifically focusing on its application in Large Language Models (LLMs). It is tailored for students, professionals, and enthusiasts interested in machine learning, natural language processing, and the optimization of AI models for various platforms. The course covers fundamental concepts, practical methodologies, various frameworks, and real-world applications, providing a well-rounded understanding of model quantization in LLMs.Course Objectives:Understand the basic principles and necessity of model quantization in LLMs.Explore different types and methods of model quantization, such as post-training quantization, quantization-aware training, and dynamic quantization.Gain proficiency in using major frameworks like PyTorch, TensorFlow, ONNX, and NVIDIA TensorRT for model quantization.Learn to evaluate the performance and quality of quantized models in real-world scenarios.Master the deployment of quantized LLMs on both edge devices and cloud platforms.Course Structure:Lecture 1: Introduction to Model QuantizationOverview of model quantizationSignificance in LLMsBasic concepts and benefitsLecture 2: Types and Methods of Model QuantizationPost-training quantizationQuantization-aware trainingDynamic quantizationComparative analysis of each typeLecture 3: Frameworks for Model QuantizationPyTorch's quantization toolsTensorFlow and TensorFlow LiteONNX quantization capabilitiesNVIDIA TensorRT's role in quantizationLecture 4: Evaluating Quantized ModelsPerformance metrics: accuracy, latency, and throughputQuality metrics: perplexity, BLEU, ROUGEHuman evaluation and auto-evaluation techniquesLecture 5: Deploying Quantized ModelsStrategies for edge device deploymentCloud platform deployment: OpenAI and Azure OpenAITrade-offs, benefits, and challenges in deploymentTarget Audience:AI and Machine Learning enthusiastsData Scientists and EngineersStudents in Computer Science and related fieldsProfessionals in AI and NLP industries
Overview
Section 1: Introduction
Lecture 1 Introduction
Lecture 2 Types and Methods of Model Quantization
Lecture 3 Frameworks and Libraries That Can Be Used to Apply Model Quantization to LLMs
Lecture 4 Performance and Quality Evaluation of Quantized LLMs
Lecture 5 Deploying Quantized LLMs on Edge Devices and Cloud Platforms
Lecture 6 Summary
Anyone who is interested in learning about model quantization, the steps, and the process.

Screenshots


Download link

rapidgator.net:
Citar
https://rapidgator.net/file/410f264f19290502defd3dfabc567ab6/athqi.Llm.Model.Quantization.An.Overview.rar.html

uploadgig.com:
Citar
https://uploadgig.com/file/download/d30212F3c7e869ad/athqi.Llm.Model.Quantization.An.Overview.rar

ddownload.com:
Citar
https://ddownload.com/jgpim7mqsrw6/athqi.Llm.Model.Quantization.An.Overview.rar