Pytorch風格遷移

阿新 • • 發佈：2022-02-16

最近研究了一下風格遷移，主要是想應用於某些主題節日時動態融合背景，生成一些抽象的藝術圖片，這裡給大家分享一個現成的程式碼，我本地把環境搭建好後跑了試試，有興趣的可以直接拿去執行：

  1 import torch
  2 import torch.nn as nn
  3 import torch.nn.functional as F
  4 import torch.optim as optim
  5 
  6 from PIL import Image
  7 import matplotlib.pyplot as plt
  8 
  9 import torchvision.transforms as transforms
 
 10 import torchvision.models as models
 11 import datetime
 12 
 13 device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
 14 
 15 
 16 num_steps = 10000
 17 save_path = "data/drew/img/end_%s.jpg" % datetime.datetime.now().strftime("%Y%m%d%H%M%S")
 18 content_img_path = "data/drew/img/dancing.jpg 
"
 19 style_img_path = "data/drew/img/picasso.jpg"
 20 
 21 
 22 def get_img_size(img_name):
 23     im = Image.open(img_name).convert('RGB')
 24     return im, im.height, im.width
 25 
 26 
 27 def image_loader(img, im_h, im_w):
 28     loader = transforms.Compose([transforms.Resize([im_h, im_w]), transforms.ToTensor()])
 
 29     im_l = loader(img).unsqueeze(0)
 30     return im_l.to(device, torch.float)
 31 
 32 
 33 c_image, c_im_h, c_im_w = get_img_size(content_img_path)
 34 s_image, s_im_h, s_im_w = get_img_size(style_img_path)
 35 content_img = image_loader(c_image, c_im_h, c_im_w)
 36 style_img = image_loader(s_image, c_im_h, c_im_w)
 37 
 38 
 39 assert style_img.size() == content_img.size(), "we need to import style and content images of the same size"
 40 unloader = transforms.ToPILImage()
 41 
 42 plt.ion()
 43 
 44 
 45 def imshow(tensor, title=None):
 46     image = tensor.cpu().clone()  # we clone the tensor to not do changes on it
 47     image = image.squeeze(0)      # remove the fake batch dimension
 48     image = unloader(image)
 49     plt.imshow(image)
 50     if title is not None:
 51         plt.title(title)
 52     plt.pause(0.001) # pause a bit so that plots are updated
 53 
 54 
 55 # plt.figure()
 56 # imshow(style_img, title='Style Image')
 57 #
 58 # plt.figure()
 59 # imshow(content_img, title='Content Image')
 60 
 61 
 62 class ContentLoss(nn.Module):
 63 
 64     def __init__(self, target,):
 65         super(ContentLoss, self).__init__()
 66         self.target = target.detach()
 67 
 68     def forward(self, input):
 69         self.loss = F.mse_loss(input, self.target)
 70         return input
 71 
 72 
 73 def gram_matrix(input):
 74     a, b, c, d = input.size()  # a=batch size(=1)
 75 
 76     features = input.view(a * b, c * d)  # resise F_XL into \hat F_XL
 77 
 78     G = torch.mm(features, features.t())  # compute the gram product
 79 
 80     return G.div(a * b * c * d)
 81 
 82 
 83 class StyleLoss(nn.Module):
 84 
 85     def __init__(self, target_feature):
 86         super(StyleLoss, self).__init__()
 87         self.target = gram_matrix(target_feature).detach()
 88 
 89     def forward(self, input):
 90         G = gram_matrix(input)
 91         self.loss = F.mse_loss(G, self.target)
 92         return input
 93 
 94 
 95 cnn = models.vgg19(pretrained=True).features.to(device).eval()
 96 
 97 
 98 cnn_normalization_mean = torch.tensor([0.485, 0.456, 0.406]).to(device)
 99 cnn_normalization_std = torch.tensor([0.229, 0.224, 0.225]).to(device)
100 
101 
102 class Normalization(nn.Module):
103     def __init__(self, mean, std):
104         super(Normalization, self).__init__()
105         self.mean = mean.clone().detach().view(-1, 1, 1)
106         self.std = std.clone().detach().view(-1, 1, 1)
107 
108     def forward(self, img):
109         # normalize img
110         return (img - self.mean) / self.std
111 
112 
113 content_layers_default = ['conv_4']
114 style_layers_default = ['conv_1', 'conv_2', 'conv_3', 'conv_4', 'conv_5']
115 
116 
117 def get_style_model_and_losses(cnn, normalization_mean, normalization_std, style_img, content_img,
118                                content_layers=content_layers_default, style_layers=style_layers_default):
119     normalization = Normalization(normalization_mean, normalization_std).to(device)
120 
121     content_losses = []
122     style_losses = []
123 
124     model = nn.Sequential(normalization)
125 
126     i = 0  # increment every time we see a conv
127     for layer in cnn.children():
128         if isinstance(layer, nn.Conv2d):
129             i += 1
130             name = 'conv_{}'.format(i)
131         elif isinstance(layer, nn.ReLU):
132             name = 'relu_{}'.format(i)
133             layer = nn.ReLU(inplace=False)
134         elif isinstance(layer, nn.MaxPool2d):
135             name = 'pool_{}'.format(i)
136         elif isinstance(layer, nn.BatchNorm2d):
137             name = 'bn_{}'.format(i)
138         else:
139             raise RuntimeError('Unrecognized layer: {}'.format(layer.__class__.__name__))
140 
141         model.add_module(name, layer)
142 
143         if name in content_layers:
144             # add content loss:
145             target = model(content_img).detach()
146             content_loss = ContentLoss(target)
147             model.add_module("content_loss_{}".format(i), content_loss)
148             content_losses.append(content_loss)
149 
150         if name in style_layers:
151             # add style loss:
152             target_feature = model(style_img).detach()
153             style_loss = StyleLoss(target_feature)
154             model.add_module("style_loss_{}".format(i), style_loss)
155             style_losses.append(style_loss)
156 
157     # now we trim off the layers after the last content and style losses
158     for i in range(len(model) - 1, -1, -1):
159         if isinstance(model[i], ContentLoss) or isinstance(model[i], StyleLoss):
160             break
161 
162     model = model[:(i + 1)]
163 
164     return model, style_losses, content_losses
165 
166 
167 input_img = content_img.clone()
168 
169 # plt.figure()
170 # imshow(input_img, title='Input Image')
171 
172 
173 def get_input_optimizer(input_img):
174     optimizer = optim.LBFGS([input_img])
175     return optimizer
176 
177 
178 def run_style_transfer(cnn, normalization_mean, normalization_std,
179                        content_img, style_img, input_img, num_steps=num_steps,
180                        style_weight=1000000, content_weight=1):
181     """Run the style transfer."""
182     print('Building the style transfer model..')
183     model, style_losses, content_losses = get_style_model_and_losses(cnn,
184         normalization_mean, normalization_std, style_img, content_img)
185 
186     # We want to optimize the input and not the model parameters so we
187     # update all the requires_grad fields accordingly
188     input_img.requires_grad_(True)
189     model.requires_grad_(False)
190 
191     optimizer = get_input_optimizer(input_img)
192 
193     print('Optimizing..')
194     run = [0]
195     while run[0] <= num_steps:
196 
197         def closure():
198             # correct the values of updated input image
199             with torch.no_grad():
200                 input_img.clamp_(0, 1)
201 
202             optimizer.zero_grad()
203             model(input_img)
204             style_score = 0
205             content_score = 0
206 
207             for sl in style_losses:
208                 style_score += sl.loss
209             for cl in content_losses:
210                 content_score += cl.loss
211 
212             style_score *= style_weight
213             content_score *= content_weight
214 
215             loss = style_score + content_score
216             loss.backward()
217 
218             run[0] += 1
219             if run[0] % 50 == 0:
220                 print("run {}:".format(run))
221                 print('Style Loss : {:4f} Content Loss: {:4f}'.format(
222                     style_score.item(), content_score.item()))
223                 print()
224 
225             return style_score + content_score
226 
227         optimizer.step(closure)
228 
229     # a last correction...
230     with torch.no_grad():
231         input_img.clamp_(0, 1)
232 
233     return input_img
234 
235 
236 begin_time = datetime.datetime.now()
237 print("******************開始時間*****************", begin_time)
238 output = run_style_transfer(cnn, cnn_normalization_mean, cnn_normalization_std,
239                             content_img, style_img, input_img)
240 try:
241     plt.figure()
242     imshow(output, title='Output Image')
243 
244     # sphinx_gallery_thumbnail_number = 4
245     plt.ioff()
246     plt.savefig(save_path)
247 except Exception as e:
248     print(e)
249 print("******************結束時間*****************", datetime.datetime.now())
250 print("******************耗時*****************", datetime.datetime.now()-begin_time)
251 # plt.show()

dancing.jpg

picasso.jpg

我這遷移後的影象，還是不錯的。

風格：

內容：

遷移融合後：

有興趣的可以去研究一下原文：

原文地址：

https://pytorch.org/tutorials/advanced/neural_style_tutorial.html

原GitHub程式碼地址：

https://github.com/pytorch/tutorials/blob/master/advanced_source/neural_style_tutorial.py

需要準備：

有顯示卡並且支援pytorch訓練的伺服器，只是cpu的話就算了，GPU伺服器跑幾分鐘，cpu伺服器跑跑一小時，cpu還100%！

Pytorch風格遷移

《深度學習框架PyTorch入門與實踐》示例——AI藝術家：神經網路風格遷移

這是我在學習《深度學習框架PyTorch入門與實踐》第九章的筆記。原書實現了Fast Neural Style，實現將輸入圖片轉換為對應圖片風格的型別。

基於神經網路的風格遷移目標損失解析

今天我想談談神經型別的轉移和卷積神經網路。已有相當多的文章和教程可供使用。有時內容只是複製，有些則提供了一種新穎的實現。它們的共同之處在於對細節的快速鑽研。在我看來太具體了。不僅如此，通常還

風格遷移程式碼復現

原始論文 Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes 專案地址 tensorflow版本

風格遷移訓練實踐

前一篇文章分享了Pytorch簡單風格遷移的程式碼，本著不跑掛伺服器不死心的態度，不停的增加計算步驟，看看圖片融合生成的效果，

風格遷移網路（vgg19提取特徵，gram矩陣提取風格特徵）

from __future__ import division from torchvision import models from torchvision import transforms from PIL import Image

把vgg-face.mat權重遷移到pytorch模型示例

最近使用pytorch時，需要用到一個預訓練好的人臉識別模型提取人臉ID特徵，想到很多人都在用用vgg-face，但是vgg-face沒有pytorch的模型，於是寫個vgg-face.mat轉到pytorch模型的程式碼

Pytorch-影象分類和CNN模型的遷移學習

導包： 1 import torch 2 import torch.nn as nn 3 import torch.nn.functional as F 4 import torch.optim as optim

基於遷移學習的 PyTorch 狗狗分類器

技術標籤：遷移學習機器學習、深度學習python演算法pythontensorflow機器學習人工智慧

【pytorch->mindspore】1.自定義運算元遷移

要遷移的專案為影象壓縮演算法https://github.com/ywz978020607/HESIC 1.自定義運算元遷移--LowerBoundFunction類

【譯】遷移至微服務——比你想象的更加簡單

原文連結遷移到微服務的路線圖遷移到微服務聽起來似乎是一項巨大而複雜的任務。雖然過程可能稍顯複雜，但實際上比你想象的更為簡單。這篇部落格以一個標準的J2EE應用程式為例建立了一個基礎遷移路線圖，從一體化架

restful風格詳解

一.概念 RESTful架構，就是目前最流行的一種網際網路軟體架構。它結構清晰、符合標準、易於理解、擴充套件方便，所以正得到越來越多網站的採用。

一篇文章告訴你什麼是架構模式和架構風格

本文探討如下幾個問題：架構模式和架構風格有區別嗎？什麼是架構模式？什麼是架構風格？

iOS FMDB遷移到WCDB

移動端的資料庫，除了使用\"SQLite\"這個共識，基本各自為政。 iOS這邊之前使用的是基於SQLite封裝的FMDB。一開始使用並無問題。但在長期的使用中反映出，有效能瓶頸，比如說某個使用者長期未登入，在登入時收到大量

阿里雲開源 image-syncer 工具，容器映象遷移同步的終極利器

為什麼要做這個工具？由於阿里雲上的容器服務 ACK 在使用成本、運維成本、方便性、長期穩定性上大大超過公司自建自維護 Kubernets 叢集，有不少公司紛紛想把之前自己維護 Kubernetes 負載遷移到阿里雲 ACK 服務上。

web專案遷移到k8s

這篇文章應該會很長。前言小說是我以前寫的一個專案，主要是為了練手vue，後端用lumen（php框架）寫的。附上碼雲的連結 vue-novel，前後端程式碼都很簡單。最近把它遷移到了k8s，配合gitlab-runner和rancher-ui實

面試官：兩個Redis叢集如何平滑資料遷移

專案推薦:Spring Cloud 、Spring Security OAuth2的RBAC許可權管理系統歡迎關注問題由於生產環境的各種原因，我們需要對現有伺服器進行遷移，包括線上正在執行的 redis 叢集環境如何去做?

擼一個Java腳手架，一統團隊專案結構風格

雖然maven已經提供了maven-archetype-webapp、maven-archetype-quickstart等專案骨架幫助我們快速構建專案架構，但是預設提供的archetype初始化的專案架構並不能滿足開發需求，這時候就有必要自己寫一個滿足專案需求

使用指令碼對資料庫進行遷移

為什麼要進行資料遷移與修復在日常工作開發中，隨著我們產品不斷迭代發展，我們希望在重構功能的同時，還需要保證在版本迭代之前操作資料保留並且變更得能夠適應新的功能結構，這個時候往往會存在資料表的大量修改與

Java幾種常用的斷言風格你怎麼選

日常工作中，不管你是寫Unit Test，還是採用TDD的程式設計方式進行開發，都會遇到斷言。而斷言的風格常見的會有Assert、BDD風格，對於這些常見的斷言風格你怎麼選擇呢？

Pytorch風格遷移

相關推薦