pytorch模型轉trt部署

阿新 • • 發佈：2022-05-06

pytorch 轉onnx

首先載入pytorch模型

# load model
import torch
def load_model(ckpt)
    # build model
    model = build_model()   # depending on your own model build function
    # load chpt
    checkpoint = torch.load(ckpt, map_location=torch.device('cpu'))
    model.load_state_dict(checkpoint["model_state"])
    return model

使用torch.onnx將pytorch 模型轉為onnx

def export_onnx(model, onnx_name, batch_size):
    x, y = height, width
    img = torch.randn((batch_size, 3, x, y)).cuda()
    torch.onnx.export(model,
                      img,
                      onnx_name,
                      export_params=True,
                      opset_version=11,
                      input_names=["input"],
                      output_names=["output"],
                      do_constant_folding=True,
                      verbose=True
    )

onnx 轉 trt

首先要安裝tensorrt，安裝教程可以參考link，之後可以選擇以下兩種方式進行轉換，1.是用trtexec命令 2.用python指令碼轉

trtexec命令

 trtexec --onnx=path/to/onnx --saveEngine=path/to/save/trt --explicitBatch --fp16 --workspace=15000

如果提示trtexec command not found，找到你的tensorrt安裝目錄，例如/usr/local/tensorrt, 將上述中的trtexec替換為/usr/local/tensorrt/bin/trtexec，如果嫌麻煩的話可以在～/.bashrc
中新增下邊一句

alias trtexec="/usr/local/tensorrt/bin/trtexec"

儲存退出然後source ～/.bashrc就可以使用trtexec命令了

python指令碼


TRT_LOGGER = trt.Logger(trt.Logger.INFO)
EXPLICIT_BATCH = 1 << (int)(trt.NetworkDefinitionCreationFlag.EXPLICIT_BATCH)
def get_engine(onnx_file_path, engine_file_path, using_half):
    """Attempts to load a serialized engine if available, otherwise builds a new TensorRT engine and saves it."""
    def build_engine():
        device = torch.device('cuda:{}'.format(0))
        """Takes an ONNX file and creates a TensorRT engine to run inference with"""
        with trt.Builder(TRT_LOGGER) as builder, \
                builder.create_network(EXPLICIT_BATCH) as network, \
                trt.OnnxParser(network, TRT_LOGGER) as parser:

            config = builder.create_builder_config()
            config.max_workspace_size = 1 << 30
            if using_half:
                config.set_flag(trt.BuilderFlag.FP16)

            # Parse model file
            if not os.path.exists(onnx_file_path):
                print('ONNX file {} not found, please  first to generate it.'.format(onnx_file_path))
                exit(0)
            with open(onnx_file_path, 'rb') as model:
                print('Beginning ONNX file parsing')
                parser.parse(model.read())
            with torch.cuda.device(device):
                engine = builder.build_engine(network, config)
            assert engine is not None, 'Failed to create TensorRT engine'
            with open(engine_file_path, "wb") as f:
                f.write(engine.serialize())
            return engine

    if os.path.exists(engine_file_path):
        # If a serialized engine exists, use it instead of building an engine.
        print("Reading engine from file {}".format(engine_file_path))
        with open(engine_file_path, "rb") as f, trt.Runtime(TRT_LOGGER) as runtime:
            return runtime.deserialize_cuda_engine(f.read())
    else:
        return build_engine()


if __name__ == '__main__':
    batch_size = 1  # only works for TRT. perf reported by torch is working on non-batched data.
    using_half = True
    model_name = 'your_model_name'
    model_path = 'path/to/pth'
    onnx_path = '{name}.onnx'.format(name=model_name)

    with torch.no_grad():
        model = load_model(model_path)
        export_onnx(model, onnx_path, batch_size)
        engine = get_engine(onnx_path,
                            '{name}.trt'.format(name=model_name),
                            using_half)

加速前處理一張圖片大約50ms，加速後的推理速度位10ms

參考： pytorch模型轉TensorRT模型部署

pytorch模型轉trt部署

pytorch 轉onnx 首先載入pytorch模型 # load model import torch def load_model(ckpt) # build model model = build_model()# depending on your own model build function

pytorch轉onnx驗證_端側部署好助手：pytorch 模型轉 onnx，並驗證結果

技術標籤：pytorch轉onnx驗證 ONNX(Open Neural Network Exchange)是一種針對機器學習所設計的開放式的檔案格式，用於儲存訓練好的模型。它使得不同的人工智慧框架(如Pytorch、MXNet)可以採用相同格式儲存模型

pytorch模型轉NCNN模型在手機部署

記錄一下自己的轉換過程，中間也踩了很多坑。。第一步， pytorch轉onnx 這一步比較方便，pyroch自身就支援，注意input和output_names一定要填寫正確。

Pytorch模型轉onnx模型例項

如下所示： import io import torch import torch.onnx from models.C3AEModel import PlainC3AENetCBAM device = torch.device(\"cuda:0\" if torch.cuda.is_available() else \"cpu\")

pytorch模型預測結果與ndarray互轉方式

預測結果轉為numpy： logits=model(feature) #如果模型是跑在GPU上 result=logits.data.cpu().numpy()/logits.cpu().numpy()

Pytorch通過儲存為ONNX模型轉TensorRT5的實現

1 Pytorch以ONNX方式儲存模型 def saveONNX(model,filepath): \'\'\' 儲存ONNX模型 :param model: 神經網路模型

使用Cortex把PyTorch模型部署到生產中

Skip to content PullrequestsIssues Marketplace Explore cortexlabs/cortex Watch173 Unstar6.7k Fork496 Code Issues178

Pytorch_模型轉Caffe（二）解析Pytorch模型*.pth

目錄Pytorch_模型轉Caffe（二）解析Pytorch模型*.pth1. Pytorch模型保存於讀取a. 儲存、載入權重b.儲存、載入網路和權重2. Pytorch模型結構1). summary檢視網路整體結構2). net.state_dict()解析權重值3). net.named

Pytorch_模型轉Caffe（三）pytorch轉caffemodel

目錄Pytorch_模型轉Caffe（三）pytorch轉caffemodel1. Pytorch下生成模型2. pth轉換成caffemodel和prototxt3. pytorch_to_caffe_alexNet.py剖析4. 用轉換後的模型進行推理5. prototxt注意問題

caffe模型轉pytorch---LSTM

之前完成了幾個網路的caffe轉pytorch。 refinenethttps://www.cnblogs.com/yanghailin/p/13096258.html

TensorFlow與PyTorch模型部署效能比較

TensorFlow與PyTorch模型部署效能比較前言 2022了，選 PyTorch 還是 TensorFlow？之前有一種說法：TensorFlow 適合業界，PyTorch 適合學界。這種說法到 2022 年還成立嗎？從模型可用性、部署便捷度和生態系統三個方

pytorch標籤轉onehot形式例項

程式碼： import torch class_num = 10 batch_size = 4 label = torch.LongTensor(batch_size,1).random_() % class_num

把vgg-face.mat權重遷移到pytorch模型示例

最近使用pytorch時，需要用到一個預訓練好的人臉識別模型提取人臉ID特徵，想到很多人都在用用vgg-face，但是vgg-face沒有pytorch的模型，於是寫個vgg-face.mat轉到pytorch模型的程式碼

pytorch模型儲存的2種實現方法

1、儲存整個網路結構資訊和模型引數資訊： torch.save(model_object,\'./model.pth\') 直接載入即可使用：

pytorch 模型的train模式與eval模式例項

原因對於一些含有batch normalization或者是Dropout層的模型來說，訓練時的froward和驗證時的forward有計算上是不同的，因此在前向傳遞過程中需要指定模型是在訓練還是在驗證。

淺談pytorch 模型 .pt, .pth, .pkl的區別及模型儲存方式

我們經常會看到字尾名為.pt,.pth,.pkl的pytorch模型檔案，這幾種模型檔案在格式上有什麼區別嗎？

MxNet預訓練模型到Pytorch模型的轉換方式

預訓練模型在不同深度學習框架中的轉換是一種常見的任務。今天剛好DPN預訓練模型轉換問題，順手將這個過程記錄一下。

tensorflow模型轉ncnn的操作方式

第一步把tensorflow儲存的.ckpt模型轉為pb模型,並記下模型的輸入輸出名字. 第二步去ncnn的github上把倉庫clone下來,按照上面的要求裝好依賴並make.

tensorflow 2.0模式下訓練的模型轉成 tf1.x 版本的pb模型例項

升級到tf 2.0後,訓練的模型想轉成1.x版本的.pb模型,但之前提供的通過ckpt轉pb模型的方法都不可用(因為儲存的ckpt不再有.meta)檔案,嘗試了好久,終於找到了一個方法可以迂迴轉到1.x版本的pb模型.

視覺化pytorch 模型中不同BN層的running mean曲線例項

載入模型字典逐一判斷每一層，如果該層是bn 的 running mean，就取出引數並取平均作為該層的代表

pytorch模型轉trt部署

pytorch 轉onnx

onnx 轉 trt

相關推薦