模型开发接口
推理部署接口
API 相关说明
Apply fake quantization to bias, with the special scale from input tensor and weight tensor, the quantized type set to qint32 also.
bias (Tensor) – the bias tensor which need to be faked.
Tensor
inp (Tensor) – the input tensor which contain the quantization parameters.
w_qat (Tensor) – the weight tensor which contain the quantization parameters.
Warning
Only work for symmetric quantization method now.
previous
megengine.quantization.create_qparams
next
megengine.quantization.fake_quant_tensor