About 283,000 results
Open links in new tab
  1. Quantization (signal processing) - Wikipedia

    In mathematics and digital signal processing, quantization is the process of mapping input values from a large set (often a continuous set) to output values in a (countable) smaller set, often …

  2. 什么是模型量化(Quantization)?为什么需要模型量化?看这一 …

    一、Quantization(模型量化)到底是什么? 模型量化是一种参数压缩与加速技术,其核心逻辑是将模型中的高精度浮点数(如32位浮点数 FP32)转换为低精度整数(如8位整数 INT8 或4位 …

  3. What is quantization? - IBM

    Quantization is the process of reducing the precision of a digital signal, typically from a higher-precision format to a lower-precision format. This technique is widely used in various fields, …

  4. Model Quantization: Concepts, Methods, and Why It Matters

    Nov 24, 2025 · Quantization has emerged as a crucial technique to address this challenge, enabling resource-intensive models to run on constrained hardware. The NVIDIA TensorRT …

  5. 大模型量化(Quantization)技术全景解读文档-CSDN博客

    9 hours ago · 在生成式 AI 时代,大模型(LLM)的参数量动辄百亿甚至千亿,这给硬件显存和计算带宽带来了巨大挑战。 量化技术 作为大模型压缩的核心手段,通过将高精度的权重和激活 …

  6. 【科普】人工智能中的“量化”,到底是什么? | FisherAI

    Feb 6, 2025 · 量化(Quantization)技术正是在这一背景下应运而生,并在实际应用中发挥了巨大作用。 本文将带你深入了解人工智能中的量化技术,探讨它的原理、优点以及所面临的挑战。

  7. What is quantization in machine learning? - Cloudflare

    What is quantization in machine learning? Quantization is a technique for lightening the load of executing machine learning and artificial intelligence (AI) models. It aims to reduce the …

  8. What is Quantization - GeeksforGeeks

    Nov 6, 2025 · Quantization is a model optimization technique that reduces the precision of numerical values such as weights and activations in models to make them faster and more …

  9. What Is Quantization? | How It Works & Applications

    Quantization is the process of mapping continuous infinite values to a smaller set of discrete finite values. In the context of simulation and embedded computing, it is about approximating real …

  10. Quantization concepts - Hugging Face

    There are two main types of quantization techniques. Post-Training Quantization (PTQ): Quantization is applied after the model is fully trained. Quantization-Aware Training (QAT): …