pytorch-自編碼器與變分自編碼器-有損影象壓縮

阿新 • • 發佈：2020-09-08

import  torch
from    torch import nn, optim
from    torch.utils.data import DataLoader
from    torchvision import transforms, datasets

import  visdom

1. 自編碼器（Auto-Encoder）

class AE(nn.Module):

    def __init__(self):
        super(AE, self).__init__()

        # [b, 784] => [b, 20]
        self.encoder = nn.Sequential(
            nn.Linear(784, 256),
            nn.ReLU(),
            nn.Linear(256, 64),
            nn.ReLU(),
            nn.Linear(64, 20),
            nn.ReLU()
        )
        # [b, 20] => [b, 784]
        self.decoder = nn.Sequential(
            nn.Linear(20, 64),
            nn.ReLU(),
            nn.Linear(64, 256),
            nn.ReLU(),
            nn.Linear(256, 784),
            nn.Sigmoid()
        )

    def forward(self, x):                 #x.shape=[b, 1, 28, 28]

        batchsz = x.size(0)
        x = x.view(batchsz, 784)          #flatten
        x = self.encoder(x)               #encoder [b, 20]
        x = self.decoder(x)               #decoder [b, 784]
        x = x.view(batchsz, 1, 28, 28)    #reshape [b, 1, 28, 28]

        return x, None

2. 變分自動編碼器（Variational Auto-Encoder）

程式碼中的h和圖中的ci，計算方法略有不同，程式碼中沒有用指數。

KL散度計算公式（程式碼中與sigma相乘的torch.randn_like(sigma)符合正態分佈）：

class VAE(nn.Module):

    def __init__(self):
        super(VAE, self).__init__()

        # [b, 784] => [b, 20]
        self.encoder = nn.Sequential(
            nn.Linear(784, 256),
            nn.ReLU(),
            nn.Linear(256, 64),
            nn.ReLU(),
            nn.Linear(64, 20),
            nn.ReLU()
        )
        # [b, 20] => [b, 784]
        self.decoder = nn.Sequential(
            nn.Linear(10, 64),
            nn.ReLU(),
            nn.Linear(64, 256),
            nn.ReLU(),
            nn.Linear(256, 784),
            nn.Sigmoid()
        )

        self.criteon = nn.MSELoss()

    def forward(self, x):              #x.shape=[b, 1, 28, 28]

        batchsz = x.size(0)
        x = x.view(batchsz, 784)                 #flatten

        h_ = self.encoder(x)                     #encoder  [b, 20], including mean and sigma
        mu, sigma = h_.chunk(2, dim=1)           #[b, 20] => mu[b, 10] and sigma[b, 10]
        h = mu + sigma * torch.randn_like(sigma) #reparametrize trick, epison~N(0, 1)
        x_hat = self.decoder(h)                  #decoder  [b, 784]
        x_hat = x_hat.view(batchsz, 1, 28, 28)   #reshape  [b, 1, 28, 28]

        kld = 0.5 * torch.sum(mu**2 + sigma**2 - torch.log(1e-8 + sigma**2) - 1) / (batchsz*28*28)   #KL散度計算

        return x_hat, kld

3. MINIST資料集上分別呼叫上面的編碼器

def main():
    mnist_train = datasets.MNIST('mnist', train=True, transform=transforms.Compose([transforms.ToTensor()]), download=True)
    mnist_train = DataLoader(mnist_train, batch_size=32, shuffle=True)

    mnist_test = datasets.MNIST('mnist', train=False, transform=transforms.Compose([transforms.ToTensor()]), download=True)
    mnist_test = DataLoader(mnist_test, batch_size=32, shuffle=True)

    x, _ = iter(mnist_train).next()    #x: torch.Size([32, 1, 28, 28]) _: torch.Size([32])

    model = AE()
    # model = VAE()

    criteon = nn.MSELoss()             #均方損失
    optimizer = optim.Adam(model.parameters(), lr=1e-3)
    print(model)

    viz = visdom.Visdom()

    for epoch in range(20):

        for batchidx, (x, _) in enumerate(mnist_train):

            x_hat, kld = model(x)
            loss = criteon(x_hat, x)        #x_hat和x的shape=[b, 1, 28, 28]

            if kld is not None:
                elbo = - loss - 1.0 * kld   #elbo為證據下界
                loss = - elbo

            optimizer.zero_grad()
            loss.backward()
            optimizer.step()

        print(epoch, 'loss:', loss.item())
        # print(epoch, 'loss:', loss.item(), 'kld:', kld.item())

        x, _ = iter(mnist_test).next()

        with torch.no_grad():
            x_hat, kld = model(x)
        viz.images(x, nrow=8, win='x', opts=dict(title='x'))
        viz.images(x_hat, nrow=8, win='x_hat', opts=dict(title='x_hat'))


if __name__ == '__main__':
    main()

開啟監聽程序： python -m visdom.server

訪問：http://localhost:8097

當呼叫AE時：

當呼叫VAE時：

pytorch-自編碼器與變分自編碼器-有損影象壓縮

筆記摘抄 importtorch fromtorch import nn, optim fromtorch.utils.data import DataLoader fromtorchvision import transforms, datasets

基於圖嵌入的高斯混合變分自編碼器的深度聚類(Deep Clustering by Gaussian Mixture Variational Autoencoders with Graph Embedding, DGG)

基於圖嵌入的高斯混合變分自編碼器的深度聚類 Deep Clustering by Gaussian Mixture Variational Autoencoders with Graph Embedding, DGG

[tensorflow2.0]採用自定義層和模型在minist資料集上實現VAE(變分自編碼器)

技術標籤：tensorflowpython 使用keras的API進行搭建 from tensorflow.keras import layers

#MNIST資料集上條件變分自編碼器#程式碼

import torch from torch import nn import torch.nn.functional as F import torchvision from torch.utils.data import DataLoader

VAE(變分自編碼器的torch實現) —— jupyter實現(注意tqdm模組不同)

簡單實現了torch版本的變分自編碼器參考大佬TensorFlow版本的VAE：膜拜大佬 import os

變分自動編碼器

變分自動編碼器 Diederik Kingma和Max Welling於2013年推出了自動編碼器的另一個重要類別，並迅速成為最受歡迎的自動編碼器型別之一：變分自動編碼器

【影象去噪】基於matlab全變分演算法（TV）影象去噪【含Matlab原始碼 625期】

一、簡介全變分（Total variation），也稱為全變差，是圖象復原中常用的一個名詞。本文簡要介紹全變分的概念以及在圖象去噪中的應用。

es - elasticsearch自定義分析器 - 內建分詞器

技術標籤：stack - eses 世界上並沒有完美的程式，但是我們並不因此而沮喪，因為寫程式就是一個不斷追求完美的過程。

MySQL中主鍵為0與主鍵自排約束的關係詳解(細節)

前言本文主要介紹了關於MySQL主鍵為0與主鍵自排約束的關係，分享出來供大家參考學習，下面話不多說了，來一起看看詳細的介紹吧。

python GUI庫圖形介面開發之PyQt5訊號與槽的高階使用技巧(自定義訊號與槽)詳解與例項

PyQt5訊號與槽高階自定義訊號與槽所謂高階自定義訊號與槽，指的就是我們可以以自己喜歡的方式定義訊號與槽函式，並傳遞引數，自定義訊號的一般流程如下

SQL Server 2019下載與安裝教程(自定義安裝)

1.SQL Server2019安裝包下載 1.1進入官網 SQL Server 2019 1.2下載安裝包 1點選Continue 2.填寫個人資訊，再點選Continue

淺談django不使用restframework自定義介面與使用的區別

django可以使用restframework快速開發介面，返回前端所需要的json資料，但是有時候利用restframework開發的介面並不能滿足所有的需求，這時候就需要自己手動開發介面，也就是將需要用到的某些物件轉化為需要使用的js

drf自定義異常與封裝response物件

1 異常處理 REST framework提供了異常處理，我們可以自定義異常處理函式。 #統一介面返回

自適應佈局與錨點隨筆

三欄自適應佈局： <style> .a{ width: 200px; height: 300px; background-color: #f00; float: left;

.NET 自定義使用者控制元件分頁

1 <%if(total>0&&totalPage>0){%> 2 <div class=\"dataTables_info\"> 3共 <strong><%=total %></strong> 條

類載入器（1） -- 自定義ClassLoader類載入器

　　java類載入器分四大類：根載入器、擴充套件類載入器、系統類載入器以及自定義載入器。

專案初始化、元件資料區域性化處理、子元件、父元件、路由邏輯跳轉、元件傳參、元件的生命週期鉤子、路由傳、全域性配置自定義css與js、

```python\"\"\"1）路由：邏輯跳轉、路由傳參2）專案元件的資料區域性化處理：data: {} => data: function(){ return {} } => data(){ return{} }3）元件的宣告週期4）元件間通訊5）各種第三方外掛：vuex、axi

海康：java sdk 自定義方法與結構體

海康雖然提供了 Java demo，但是隻提供了少量介面，大量的介面方法及結構體定義需要自己去完成，

小程式之自定義tabbar與許可權控制

一、效果圖二、自定義tabbar Component({ options: { multipleSlots: true // 在元件定義時的選項中啟用多slot支援

pyqt5-自定義訊號與槽

自動關聯的槽函式 pyqt5由ui轉化成的py檔案中，在setup函式下最後一行為 QtCore.QMetaObject.connectSlotsByName(Form)

pytorch-自編碼器與變分自編碼器-有損影象壓縮

1. 自編碼器（Auto-Encoder）

2. 變分自動編碼器（Variational Auto-Encoder）

3. MINIST資料集上分別呼叫上面的編碼器

相關推薦