關於Numpy+TensorFlow+PyTorch構造NN的總結

阿新 • • 發佈：2018-11-16

使用Tensor的理由+靜態圖和動態圖的區別

implement the network using numpy and pytorch

import numpy as np
import torch

dtype = torch.float
device = torch.device("cpu")
# device = torch.device("cuda:0")

# N: batch size
# D_in input dimension
# H: hidden dimension
# D_out: output dimension
N, D_in, H, D_out = 
 64, 1000, 100, 10

# input and output data
# x = np.random.randn(N, D_in)
# y = np.random.randn(N, D_out)

# tensor input and output
x = torch.randn(N, D_in, device=device, dtype=dtype)
y = torch.randn(N, D_out, device=device, dtype=dtype)


# Randomly initialize weights
# w1 = np.random.randn(D_in, H)
# w2 = np.random.randn(H, D_out) 


# tensor
# w1 = torch.randn(D_in, H, device=device, dtype=dtype)
# w2 = torch.randn(H, D_out, device=device, dtype=dtype)

# tensor and autograd
w1 = torch.randn(D_in, H, device=device, dtype=dtype, requires_grad=True)
w2 = torch.randn(H, D_out, device=device, dtype=dtype, requires_grad=True)

learning_rate = 
 1e-6

for t in range(3):
    
    # h = x.dot(w1)
    # mm() matrix multiplication 
#     h = x.mm(w1)
    
    # h_relu = np.maximum(h, 0)
    # clamp(input, min, max) input \in [min, max] 
#     h_relu = h.clamp(min=0)
    
    # y_pred = h_relu.dot(w2)
#     y_pred = h_relu.mm(w2)
    
    y_pred = x.mm(w1).clamp(min=0).mm(w2)
    
    
    # loss = np.square(y_pred - y).sum()
    loss = (y_pred - y).pow(2).sum()
    
    # print(t, loss)
    print(t, loss.item())
    
    # Backprop
#     grad_y_pred = 2.0 * (y_pred - y)
    
    # grad_w2 = h_relu.T.dot(grad_y_pred)
#     grad_w2 = h_relu.t().mm(grad_y_pred)
    
    # grad_h_relu = grad_y_pred.dot(w2.T)
#     grad_h_relu = grad_y_pred.mm(w2.t())
    
    # grad_h = grad_h_relu.copy()
#     grad_h = grad_h_relu.clone()
    
#     grad_h[h<0] = 0
    
    # grad_w1 = x.T.dot(grad_h)
#     grad_w1 = x.t().mm(grad_h)

    
#     w1 -= learning_rate * grad_w1
#     w2 -= learning_rate * grad_w2
    
    loss.backward()
    with torch.no_grad():
        w1 -= learning_rate * w1.grad
        w2 -= learning_rate * w2.grad
        
        w1.grad.zero_()
        w2.grad.zero_()

0 24548232.0
1 19390818.0
2 19421688.0

PyTorch: Defining new autograd functions

import torch


class MyReLU(torch.autograd.Function):
    """
    We can implement our own custom autograd Functions by subclassing
    torch.autograd.Function and implementing the forward and backward passes
    which operate on Tensors.
    """

    @staticmethod
    def forward(ctx, input):
        """
        In the forward pass we receive a Tensor containing the input and return
        a Tensor containing the output. ctx is a context object that can be used
        to stash information for backward computation. You can cache arbitrary
        objects for use in the backward pass using the ctx.save_for_backward method.
        """
        ctx.save_for_backward(input)
        return input.clamp(min=0)

    @staticmethod
    def backward(ctx, grad_output):
        """
        In the backward pass we receive a Tensor containing the gradient of the loss
        with respect to the output, and we need to compute the gradient of the loss
        with respect to the input.
        """
        input, = ctx.saved_tensors
        grad_input = grad_output.clone()
        grad_input[input < 0] = 0
        return grad_input


dtype = torch.float
device = torch.device("cpu")
# device = torch.device("cuda:0") # Uncomment this to run on GPU

# N is batch size; D_in is input dimension;
# H is hidden dimension; D_out is output dimension.
N, D_in, H, D_out = 64, 1000, 100, 10

# Create random Tensors to hold input and outputs.
x = torch.randn(N, D_in, device=device, dtype=dtype)
y = torch.randn(N, D_out, device=device, dtype=dtype)

# Create random Tensors for weights.
w1 = torch.randn(D_in, H, device=device, dtype=dtype, requires_grad=True)
w2 = torch.randn(H, D_out, device=device, dtype=dtype, requires_grad=True)

learning_rate = 1e-6
for t in range(3):
    # To apply our Function, we use Function.apply method. We alias this as 'relu'.
    relu = MyReLU.apply

    # Forward pass: compute predicted y using operations; we compute
    # ReLU using our custom autograd operation.
    y_pred = relu(x.mm(w1)).mm(w2)

    # Compute and print loss
    loss = (y_pred - y).pow(2).sum()
    print(t, loss.item())

    # Use autograd to compute the backward pass.
    loss.backward()

    # Update weights using gradient descent
    with torch.no_grad():
        w1 -= learning_rate * w1.grad
        w2 -= learning_rate * w2.grad

        # Manually zero the gradients after updating weights
        w1.grad.zero_()
        w2.grad.zero_()

0 24658138.0
1 19849594.0
2 19743964.0

Use TensorFlow to fit a simple two-layer net

import tensorflow as tf
import numpy as np

N, D_in, D, D_out = 64, 1000, 100, 10

x = tf.placeholder(tf.float32, shape=(None, D_in))
y = tf.placeholder(tf.float32, shape=(None, D_out))

w1 = tf.Variable(tf.random_normal((D_in, H)))
w2 = tf.Variable(tf.random_normal((H, D_out)))

h = tf.matmul(x, w1)
h_relu = tf.maximum(h, tf.zeros(1))
y_pred = tf.matmul(h_relu, w2)

loss = tf.reduce_sum((y - y_pred)**2.0)

grad_w1, grad_w2 = tf.gradients(loss, [w1, w2])

learning_rate = 1e-6
new_w1 = w1.assign(w1 - learning_rate * grad_w1)
new_w2 = w2.assign(w2 - learning_rate * grad_w2)

with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    x_value = np.random.randn(N, D_in)
    y_value = np.random.randn(N, D_out)
    for _ in range(3):
        loss_value, _, _ = sess.run([loss, new_w1, new_w2], feed_dict={x:x_value, y:y_value})
        print(loss_value)

29360906.0
24105692.0
22037684.0

Use the torch.nn package to implement our two-layer network

import random
import torch
import torch.nn as nn

N, D_in, H, D_out = 64, 1000, 100, 10

x = torch.randn(N, D_in)
y = torch.randn(N, D_out)

# sequential model
# model = nn.Sequential(nn.Linear(D_in, H), 
#                      nn.ReLU(), 
#                      nn.Linear(H, D_out))


# cumtom model
# class TwoLayerNet(torch.nn.Module):
#     def __init__(self, D_in, H, D_out):
#         super(TwoLayerNet, self).__init__()
#         self.linear1 = nn.Linear(D_in, H)
#         self.linear2 = nn.Linear(H, D_out)
        
#     def forward(self, x):
#         h_relu = self.linear1(x).clamp(min=0)
#         y_pred = self.linear2(h_relu)
#         return y_pred
    
# dynamic graphs and weight sharing
class DynamicNet(torch.nn.Module):
    def __init__(self, D_in, H, D_out):
        super(DynamicNet, self).__init__()
        self.input_linear = torch.nn.Linear(D_in, H)
        self.middle_linear = torch.nn.Linear(H, H)
        self.output_linear = torch.nn.Linear(H, D_out)

    def forward(self, x):
        """
        Since each forward pass builds a dynamic computation graph, we can use normal
        Python control-flow operators like loops or conditional statements when
        defining the forward pass of the model.

        Here we also see that it is perfectly safe to reuse the same Module many
        times when defining a computational graph. 
        """
        h_relu = self.input_linear(x).clamp(min=0)
        for _ in range(random.randint(0, 3)):
            h_relu = self.middle_linear(h_relu).clamp(min=0)
        y_pred = self.output_linear(h_relu)
        return y_pred

# model = TwoLayerNet(D_in, H, D_out)
model = DynamicNet(D_in, H, D_out)

criterion = nn.MSELoss(reduction='sum')

learning_rate = 1e-4

optimizer = torch.optim.Adam(model.parameters(), lr=learning_rate)

for t in range(3):
    y_pred = model(x)
    loss = criterion(y_pred, y)
    
    print(t, loss.item())
    
#     model.zero_grad()
    optimizer.zero_grad()
    
    loss.backward()
    
    # torch.no_grad(): 構建不需要track的上下文環境
    # with torch.no_grad() or .data to avoid tracking history in autograd

#     with torch.no_grad():
#         # SGD
#         for param in model.parameters():
#             param -= learning_rate * param.grad

    optimizer.step()

0 663.4624633789062
1 650.722900390625
2 641.6253051757812

關於Numpy+TensorFlow+PyTorch構造NN的總結

使用Tensor的理由+靜態圖和動態圖的區別 implement the network using numpy and pytorch import numpy as np import torch dtype = torch.float device = torch.dev

tensorflow安裝調試總結（持續更新）

做的更新但我 secure 軟件 tar -o cal ipconfig 這段時間需要部署tensorflow到linux上，由於堡壘機不能連外網，所以pip、apt-get、wget、git統統不能用，然後就是各種調試了，下面整理了一些遇到的問題和解決方案，供大家參考

python/numpy/tensorflow中，對矩陣行列操作，下標是怎麽回事兒？

flow round mean 數據 ria lis .html 錯誤表示 Python中的list/tuple，numpy中的ndarrray與tensorflow中的tensor。用python中list/tuple理解，僅僅是從內存角度理解一個序列數據

C# 構造器總結

實例否則私有 null 並且們的 intern col type類在C#中，構造器分為實例構造器和類型構造器，一、實例構造器構造引用類型的對象時，在調用實例構造器之前，為對象分配的內存總是歸0，沒有被構造器顯示重寫的字段都

Ubuntu深度學習環境搭建 tensorflow+pytorch

源安裝 class x86 port ORC 鏈接庫 mon latest news 目前電腦配置：Ubuntu 16.04 + GTX1080顯卡配置深度學習環境，利用清華源安裝一個miniconda環境是非常好的選擇。尤其是今天發現conda install -c m

TensorFlow之tf.nn.dropout()：防止模型訓練過程中的過擬合問題

AC -- 輸出 array 全連接 spa () 激活 odin 一：適用範圍：　　tf.nn.dropout是TensorFlow裏面為了防止或減輕過擬合而使用的函數，它一般用在全連接層二：原理：　　dropout就是在不同的訓練過程中隨機扔掉一部分神經元。也就是

基於numpy的隨機數構造

creat raw int adc num values pan des excludes class numpy.random.RandomState(seed=None)　　RandomState 是一個基於Mersenne Twister算法的偽隨機數生成類　　Ran

【TensorFlow】tf.nn.softmax_cross_entropy_with_logits的用法

white 交叉 none padding tomat ros true const cross 在計算loss的時候，最常見的一句話就是 tf.nn.softmax_cross_entropy_with_logits ，那麽它到底是怎麽做的呢？首先明確一點，loss是代

Tensorflow動態seq2seq使用總結（r1.3）

when tex scalar edi tac 幹什麽 spa googl lse https://www.jianshu.com/p/c0c5f1bdbb88 動機其實差不多半年之前就想吐槽Tensorflow的seq2seq了（後面博主去幹了些別的事情），官

python包-numpy的函式和屬性總結（一）

NumPy是高效能科學計算和資料分析的基礎包。接下來為大家總結一些它的一些基礎知識。目錄 0.匯入numpy的包 1.資料型別 2.常用函式 3.NumPy.ndarray的屬性 4.NumPy.ndarray的函式 5.NumPy.ndarray的索引與切片

Ubuntu 14.04 Caffe和TensorFlow的ARM NN SDK編譯環境搭建及MNIST程式測試

Ubuntu 14.04下Caffe和TensorFlow的ARM NN SDK的aarch64編譯環境搭建及MNIST程式測試 ARM官方測試環境 1. SCons安裝 2.安裝CMake 3.下載安裝boost 4.使用 S

tensorflow之tf.nn.l2_normalize與l2_loss的計算

1.tf.nn.l2_normalize tf.nn.l2_normalize(x, dim, epsilon=1e-12, name=None) 上式： x為輸入的向量； dim為l2範化的維數，dim取值為0或0或1； eps

Tensorflow程式設計構造一個簡單的線性迴歸模型

模型本次使用的是線性迴歸模型 y=Wx+b 其中 W 為權重， b 為偏置。 # -*- coding: utf-8 -*- import numpy as np import tensorflow as tf import matpl

Anaconda+tensorflow+pytorch 的GPU版安裝docker

1 執行映象: nvidia-docker run -it -p 8000:80 -v ~/qxq/docker:/root/workspace --name "pytorchTensorflow" kaixhin/cuda-caffe:8.0 2 檢視容器: 先用exit 退出,然

檢視Ubuntu/cuda/Tensorflow/Pytorch版本

檢視Cuda版本： cat /usr/local/cuda/version.txt CUDA Version 9.0.176 檢視Linux/Ubuntu版本：（1） cat /proc/version 輸出: Linux versio

Anaconda+tensorflow+pyTorch安裝

Ubuntu 16.04藉助Conda安裝Pytorch的GPU版本詳細非常詳細 1. 檢視顯示卡版本。lspci | grep -i nvidia 檢視你的電腦上的顯示卡，是否是nvidia及版本 2.安裝顯示卡驅動。這個也可以不用，在第三步的時候選擇自動安裝。在 http://www.n

'tensorflow.python.ops.nn' has no attribute 'rnn_cell

For people using the newer version of tensorflow, add this to the code: from tensorflow.contrib import rnn lstm_cell = rnn.BasicLSTMCell(rnn_size

Tensorflow】tf.nn.atrous_conv2d如何實現空洞卷積？

value：指需要做卷積的輸入影象，要求是一個4維Tensor，具有[batch, height, width, channels]這樣的shape，具體含義是[訓練時一個batch的圖片數量, 圖片高度, 圖片寬度, 影象通道數] filters：

TensorFlow 資料讀取方法總結

作者：黑暗星球原文地址：https://blog.csdn.net/u014061630/article/details/80712635 ====================下一篇：tf.data 官方教程==================== ==============

【TensorFlow】tf.nn.conv2d_transpose是怎樣實現反捲積的？

三個月沒更新了啊，回來更一發～～ csdn上主要講一些coding過程中遇到的函式，問題，解決方案。偏實踐另外，如果你想看一些理論方面的東西，歡迎加我的知乎知乎主頁 csdn私信幾乎不看，有問題交流可以發郵箱：[email protected]或者知乎私

關於Numpy+TensorFlow+PyTorch構造NN的總結

使用Tensor的理由+靜態圖和動態圖的區別

implement the network using numpy and pytorch

PyTorch: Defining new autograd functions

Use TensorFlow to fit a simple two-layer net

Use the torch.nn package to implement our two-layer network

相關推薦