furiosa.quantizer.frontend.onnx.transformer.experimental package

Submodules

furiosa.quantizer.frontend.onnx.transformer.experimental.eliminate_detection_postprocess module

class furiosa.quantizer.frontend.onnx.transformer.experimental.eliminate_detection_postprocess.EliminateSSDDetectionPostprocess(ssd_outputs: List)

Bases: furiosa.quantizer.interfaces.transformer.Transformer

from: Softmax -> graph_output to: graph_output

Assume NCHW Input

get_postprocess_nodes(model, ssd_output_tensors)
transform(model: onnx.onnx_ml_pb2.ModelProto) onnx.onnx_ml_pb2.ModelProto

furiosa.quantizer.frontend.onnx.transformer.experimental.fuse_div_for_bert module

class furiosa.quantizer.frontend.onnx.transformer.experimental.fuse_div_for_bert.FuseDivForBert

Bases: furiosa.quantizer.interfaces.transformer.Transformer

Only works for some BERT Models

transform(model: onnx.onnx_ml_pb2.ModelProto) onnx.onnx_ml_pb2.ModelProto
transform_matmul_add(model)

furiosa.quantizer.frontend.onnx.transformer.experimental.reify_conv_for_bert module

class furiosa.quantizer.frontend.onnx.transformer.experimental.reify_conv_for_bert.ReifyConvForBert

Bases: furiosa.quantizer.interfaces.transformer.Transformer

from: MatMul + Add to: Conv

Assume NCHW Input

transform(model: onnx.onnx_ml_pb2.ModelProto) onnx.onnx_ml_pb2.ModelProto
transform_matmul_add(model)

Module contents