pytorch基础学习（一）

　　在炼丹师的路上越走越远，开始入手pytorch框架的学习，越炼越熟吧。。。

1. 张量的创建和操作

　　创建为初始化矩阵，并初始化

a = torch.empty(, )    #创建一个5*3的未初始化矩阵

nn.init.zeros_(a)        #初始化a为0

nn.init.constant_(a, )  # 初始化a为3

nn.init.uniform_(a)      #初始化为uniform分布

　　随机数矩阵

torch.rand(, )              # * ， [, )的随机数
torch.rand_like(m)            #创建和m的size一样的随机数矩阵

torch.rand(, )              # * ， mean=， variance=，的正态分布

torch.randint(, , (,))   #*3的整数矩阵（-10之间）

　　tensor类型和形状

a = torch.Tensor([1, 2, 3])   #通过列表创建Tensor
a = torch.eye(,)            #对角矩形
a = torch.ones(3,4)
b = torch.ones_like(a)        #创建和a一样size

a = torch.zeros(, , dtype = torch.long)

a.dtype  #查看数据类型

    32位浮点型：torch.Float (默认的就是这种类型, float32)

    64位整型：torch.Long     (int64 )

    32位整型：torch.Int      (int32)

    16位整型：torch.Short    (int16)

    64位浮点型：torch.Double  (float64)

a.size()

    h, w = torch.Size([, ])   #*3维 ,h=, w=

a.view(, )        #将a进行reshape成3*5的矩阵

a.view(-, )      # -1表示自动计算， 即转变为1*

　　tensor和numpy（array）的相互转换

b = a.numpy()    #Tensor转numpy

c = np.ones((,))

d = torch.from_numpy(c)  #numpy 转tensor

2. 张量的操作

　　索引：支持numpy的常用索引和切片操作

　　加法：和numpy类似的广播原则

索引操作

y = torch.rand(,)

y[:, ]  切片和索引

y[y>0.5] 花式索引

加法操作(和numpy一样的广播原则)

result = x+y

reslut = torch.add(x, y)

y.add_(x)  #直接对y的值进行修改,

（以_结尾的方法会直接在原地修改变量， 如x.copy_(y), x.t_()会修改x.）

result = torch.empty(,)

torch.add(x, y, out=result)  #这里的result必须先定义

对于一个元素的张量，可以直接通过x.item()拿到元素值

x = torch.ones(,)

y = torch.sum(x)

print(y.item())     #得到整数12.

　　cuda Tensor: pytorch 支持Gpu操作，可以在Gpu上创建tensor，通过to()方法可以在cpu和Gpu上间转换tensor

if torch.cuda.is_available():

    device = torch.device("cuda")

    y = torch.ones_like(x, device=device)   #直接在Gpu上创建tensor

    x = x.to(device)  #从cpu上转移到gpu

    z = x+y

    print(z.to("cpu", torch.double))  #转回到cpu，并改变数据类型

3. 自动求导（Autograd）

　　在pytorch搭建的神经网络中，Tensor 和Function为最主要的两个类，一起组成了一个无环图。在前向传播时，Function操作tensor的值，而进行反向传播时，需要计算function的导数来更新参数tensor， pytorch为我们自动实现了求导。每一个tensor都有一个requires_grad属性，若tensor.requires_grad=True, 则无环图会记录对该tensor的所有操作，当进行backward时，pytorch就会自动计算其导数值，并保存在tensor的grad属性中。

x = torch.ones(, , requires_grad=True)  #设置requires_grad=True， backward时会计算导数

y = x+

    属性值

        y.requirs_grad     是否autograd, 会自动继承x的requires_grad

        y.grad               导数或梯度值

        y.grad_fn           对x的操作function，grad_fn=<AddBackward0>

tensor.detach()           将tensor从计算历史（无环图）中脱离出来？

with torch.no_grad():     从计算历史（无环图）中脱离, backward时不求导

with torch.set_grad_enabled(phase == 'train')：  （phase == 'train'）为True时求导

tensor.backward()    #反向传播， 计算梯度，如果tensor只包含一个数时，backward不需要参数， 否则需要指明参数

backward:

#out为标量，所以backward时不带参数

x = torch.ones(, , requires_grad=True)

y = x+

z = y*y*

out = z.mean()

out.backward()

print(x.grad)     #tensor([[4.5000, 4.5000],[4.5000, 4.5000]])

print(y.grad)     #None

backward计算过程如下：

#y不为为标量，backward时需要带参数

x = torch.ones(, , requires_grad=True)

y = *x+

y.backward(torch.tensor([[,],[,]], dtype=torch.float))  #可以理解为tensor([1, 1, 1, 1]) * dy/dx

print(x.grad)    # tensor([[.,.],[.,.]])

#y不为为标量，backward时需要带参数

x = torch.ones(, , requires_grad=True)

y = *x+

y.backward(torch.tensor([[,0.1],[,0.1]], dtype=torch.float))  #可以理解为tensor([1, 0.1, 1, 0.1]) * dy/dx

print(x.grad)    # tensor([[2.0000,0.2000],[2.0000,0.2000]])

4. 神经网络(Neutral Networks)

　　训练神经网络的典型步骤如下：

　  定义神经网络(权重参数)

    在数据集上进行迭代

    前向传播，神经网络逐层计算输入的数据

    计算loss（神经网络的计算值和正确值的距离）

    计算梯度，反向传递到神经网络中，更新神经网络的权重(weight = weight - learning_rate * gradient)

4.1 定义神经网络

　　定义一个神经网络，需要继承torch.nn.Module, 并实现初始化方法和前向传播。下面代码为AlexNet的实现(通过三种方式定义网络结构)：

#coding:utf-

#pytorch implementation of AlexNet

import torch.nn as nn

import torch.nn.functional as F

import torch

class AlexNet(nn.Module):

    def __init__(self):               # image size *

        super(AlexNet, self).__init__()

        self.features = nn.Sequential(

            nn.Conv2d(, , kernel_size=(, ), stride=(, ), padding=(,)),

            nn.ReLU(inplace=True),

            nn.MaxPool2d(kernel_size=, stride=, padding=, dilation=, ceil_mode=False),    # when ceil_mode is False, floor will be used to calculate the shape, else ceil

            nn.Conv2d(, , kernel_size=(, ), stride=(, ), padding=(, )),

            nn.ReLU(inplace=True),

            nn.MaxPool2d(kernel_size=, stride=, padding=, dilation=, ceil_mode=False),

            nn.Conv2d(, , kernel_size=(, ), stride=(, ), padding=(, )),

            nn.ReLU(inplace=True),

            nn.Conv2d(, , kernel_size=(, ), stride=(, ), padding=(, )),

            nn.ReLU(inplace=True),

            nn.Conv2d(, , kernel_size=(, ), stride=(, ), padding=(, )),

            nn.ReLU(inplace=True),

            nn.MaxPool2d(kernel_size=, stride=, padding=, dilation=, ceil_mode=False)

        )

        # self.avgpool = nn.AdaptiveAvgPool2d(output_size=(,))       #使输出的形状变成6*

        self.classifier=nn.Sequential(

            nn.Dropout(p=0.5),

            nn.Linear(in_features=, out_features=, bias=True),

            nn.ReLU(inplace=True),

            nn.Dropout(p=0.5),

            nn.Linear(in_features=, out_features=, bias=True),

            nn.ReLU(inplace=True),

            nn.Linear(in_features=, out_features=, bias=True)

        )

    def forward(self, x):

        x = self.features(x)

        x = x.view(-, **)

        x = self.classifier(x)

        return x

class AlexNet1(nn.Module):

    def __init__(self):

        super(AlexNet1, self).__init__()

        self.conv1 = nn.Sequential(

            nn.Conv2d(, , kernel_size=(, ), stride=(, ), padding=(, )),

            nn.ReLU(inplace=True),

            nn.MaxPool2d(kernel_size=, stride=, padding=, dilation=, ceil_mode=False),

        )

        self.conv2 = nn.Sequential(

            nn.Conv2d(, , kernel_size=(, ), stride=(, ), padding=(, )),

            nn.ReLU(inplace=True),

            nn.MaxPool2d(kernel_size=, stride=, padding=, dilation=, ceil_mode=False),

        )

        self.conv3 = nn.Sequential(

            nn.Conv2d(, , kernel_size=(, ), stride=(, ), padding=(, )),

            nn.ReLU(inplace=True),

        )

        self.conv4 = nn.Sequential(

            nn.Conv2d(, , kernel_size=(, ), stride=(, ), padding=(, )),

            nn.ReLU(inplace=True),

        )

        self.conv5 = nn.Sequential(

            nn.Conv2d(, , kernel_size=(, ), stride=(, ), padding=(, )),

            nn.ReLU(inplace=True),

            nn.MaxPool2d(kernel_size=, stride=, padding=, dilation=, ceil_mode=False)

        )

        self.classifier=nn.Sequential(

            nn.Dropout(p=0.5),

            nn.Linear(in_features=, out_features=, bias=True),

            nn.ReLU(inplace=True),

            nn.Dropout(p=0.5),

            nn.Linear(in_features=, out_features=, bias=True),

            nn.ReLU(inplace=True),

            nn.Linear(in_features=, out_features=, bias=True)

        )

    def forward(self, x):

        x = self.conv1(x)

        x = self.conv2(x)

        x = self.conv3(x)

        x = self.conv4(x)

        x = self.conv5(x)

        x = x.view(-, **)

        x = self.classifier(x)

        return x

class AlexNet2(nn.Module):

    def __init__(self):

        super(AlexNet2, self).__init__()

        self.conv1 = nn.Conv2d(, , kernel_size=(, ), stride=(, ), padding=(,))

        self.relu1 = nn.ReLU(inplace=True)

        self.pool1 = nn.MaxPool2d(kernel_size=, stride=, padding=, dilation=, ceil_mode=False)   # when ceil_mode is False, floor will be used to calculate the shape, else ceil

        self.conv2 = nn.Conv2d(, , kernel_size=(, ), stride=(, ), padding=(, ))

        self.relu2 = nn.ReLU(inplace=True)

        self.pool2 = nn.MaxPool2d(kernel_size=, stride=, padding=, dilation=, ceil_mode=False)

        self.conv3 = nn.Conv2d(, , kernel_size=(, ), stride=(, ), padding=(, ))

        self.relu3 = nn.ReLU(inplace=True)

        self.conv4 = nn.Conv2d(, , kernel_size=(, ), stride=(, ), padding=(, ))

        self.relu4 = nn.ReLU(inplace=True)

        self.conv5 = nn.Conv2d(, , kernel_size=(, ), stride=(, ), padding=(, ))

        self.relu5 = nn.ReLU(inplace=True)

        self.pool5 = nn.MaxPool2d(kernel_size=, stride=, padding=, dilation=, ceil_mode=False)

        self.dropout = nn.Dropout(p=0.5)

        self.linear1 = nn.Linear(in_features=, out_features=, bias=True)

        self.relu6 = nn.ReLU(inplace=True)

        self.linear2 = nn.Linear(in_features=, out_features=, bias=True)

        self.relu7 = nn.ReLU(inplace=True)

        self.linear3 = nn.Linear(in_features=, out_features=, bias=True)

    def forward(self, x):

        x = self.pool1(self.relu1(self.conv1(x)))

        x = self.pool2(self.relu2(self.conv2(x)))

        x = self.relu3(self.conv3(x))

        x = self.relu4(self.conv4(x))

        x = self.pool5(self.relu5(self.conv5(x)))

        x = x.view(-, **)

        x = self.dropout(x)

        x = self.dropout(self.relu6(self.linear1(x)))

        x = self.relu7(self.linear2(x))

        x = self.linear3(x)

        return x

if __name__=="__main__":

    # net = AlexNet()

    # net = AlexNet1()

    net = AlexNet2()

    input = torch.randn(,,,)  #torch中net输入为四维的： batch_size*channel*W*H

    output = net(input)

    print(output)

AlexNet

4.2 定义loss函数

　　pytorch中常用的loss函数有：

　　nn.L1Loss()　　

nn.L1Loss(): 取预测值和真实值差的绝对值，最后平均数

x = torch.tensor([[,],[,]], dtype=torch.float)  #计算loss时需要float类型

y = torch.tensor([[,],[,]], dtype=torch.float)

criterion = nn.L1Loss()

loss = criterion(x, y)   #(|-|+|-|+|-|+|-|)/=2.0

print(loss.item()) # 2.0

　　nn.SmoothL1Loss(size_average=None, reduce=True,reduction='mean')

　　#reduce为False时，返回向量，返回整个bacth的每一个loss

　　# reduce 默认为True，返回标量， size_average为True时返回batch_loss的平均值，为False时返回batch_loss的和（size_average废弃，由reduction取代）

nn.SmoothL1Loss():  在(-, )范围内是平方loss(L2 loss), 其他范围内是L1 loss

x = torch.tensor([[,],[,]], dtype=torch.float)  #计算loss时需要float类型

y = torch.tensor([[1.5,1.5],[,]], dtype=torch.float)

criterion = nn.SmoothL1Loss()

loss = criterion(x, y)   #(((1.5-)**)/+((1.5-)**)/+|-|-0.5+|-|-0.5)/=2.0

print(loss.item()) # 0.8125

　　nn.MSELoss()

nn.MSELoss():  平方loss(L2 loss), 最后平均数

x = torch.tensor([[,],[,]], dtype=torch.float)  #计算loss时需要float类型

y = torch.tensor([[,],[,]], dtype=torch.float)

criterion = nn.MSELoss()

loss = criterion(x, y)   #((-)**+(-)**+(-)**+-)**)/=4.0

print(loss.item()) #

　　nn.NLLLoss() : 负对数似然损失函数（Negative Log Likelihood）

　　　　和CrossEntropyLoss()的唯一区别是，不会对输入值进行softmax计算。（因此model计算输出时，最后一层需要加上LogSoftmax）

　　　　　　　　　　　　　　　　　　　　　　　　　　　　（假如x=[1, 2, 3], class=0; 则f=x[0]=1）

nn.NLLLoss()  : 负对数似然损失函数（Negative Log Likelihood）

input = torch.randn(, , requires_grad=True)

target = torch.empty(, dtype=torch.long).random_()  #必须是torch.long类型

criterion = nn.NLLLoss()

loss = criterion(nn.LogSoftmax(dim=)(input), target)

print(loss.item())

　　nn.CrossEntropyLoss() 交叉熵函数
　　nn.LogSoftmax()和nn.NLLLoss()结合体：会对输入值使用softmax，再进行计算（因此model计算输出时不需要进行softmax）

　　　　（参考： https://www.cnblogs.com/marsggbo/p/10401215.html）

loss = nn.CrossEntropyLoss()

#input = torch.randn(, , requires_grad=True)

#target = torch.empty(, dtype=torch.long).random_()  #必须是torch.long类型

loss = nn.CrossEntropyLoss()

input = torch.tensor([[-0.7678,  0.2773, -0.9249,  1.4503,  0.5256],

        [-0.8529, -1.4283, -0.3284,  1.8608, -0.3206],

        [ 0.1201, -0.7239,  0.6798, -0.8335, -2.1710]], requires_grad=True)

target = torch.tensor([, , ], dtype=torch.long)  #必须是torch.long类型, 且input中每一行对应的target中的一个数字（不是one-hot）

output = loss(input, target)

print(output.item())

#nn.LogSoftmax()和nn.NLLLoss()分开计算如下：

input = torch.tensor([[-0.7678,  0.2773, -0.9249,  1.4503,  0.5256],

        [-0.8529, -1.4283, -0.3284,  1.8608, -0.3206],

        [ 0.1201, -0.7239,  0.6798, -0.8335, -2.1710]], requires_grad=True)

target = torch.tensor([, , ], dtype=torch.long)  #必须是torch.long类型

sft = nn.LogSoftmax(dim=)(input)

nls = nn.NLLLoss()(sft, target)

　　　numpy实现交叉熵函数

def label_encoder(target, nclass):

    label = np.zeros((target.shape[],nclass))

    for i in range(target.shape[]):

        label[i][target[i]]=

    print(label)

    return label

def cross_entropy_loss(pred, target):

    target = label_encoder(target, pred.shape[])  #one-hot编码

    pred_exp = np.exp(pred)

    pred_sft = pred_exp/(np.sum(pred_exp, axis=)[:,None])

    print(np.log(pred_sft))

    loss = -np.sum(np.log(pred_sft)*target)/pred.shape[]  #取一个batch的平均值

    print(loss)

    return loss

if __name__=="__main__":

    input = np.array([[-0.7678,  0.2773, -0.9249,  1.4503,  0.5256],

        [-0.8529, -1.4283, -0.3284,  1.8608, -0.3206],

        [ 0.1201, -0.7239,  0.6798, -0.8335, -2.1710]])

    target = np.array([, , ])

    loss = cross_entropy_loss(input,target)

numpy实现交叉熵

　　nn.BCELoss() 二分类时的交叉熵(Bianry cross entropy),

　　　　nn.CrossEntropyLoss()的特例,即分类限定为二分类，label必须为0,1; 模型输出最后一层需要用sigmoid函数

criterion = nn.BCELoss()

input = torch.randn(, , requires_grad=True)

target = torch.empty(,).random_()

pre = nn.Sigmoid()(input)

loss = criterion(pre, target)

print(loss.item())

　　nn.BCEWithLogitsLoss(): 将nn.sigmoid()和nn.BCELoss()结合

criterion = nn.BCEWithLogitsLoss()

input = torch.randn(, , requires_grad=True)

target = torch.empty(,).random_()

loss = criterion(input, target)

print(loss.item())

4.3 定义优化器

　　通过loss函数计算出网络的预测值和真实值之间的loss后，loss.backward()能将梯度反向传播，需要根据梯度来更新网络的权重系数。优化器能帮我们实现权重系数的更新。

　　不采用优化器：

#不用优化器，更新系数
criterion = nn.CrossEntropyLoss()

input = torch.randn(, , requires_grad=True)

target = torch.empty(,).random_()

pre = net(input)

loss = criterion(pre, target)

net.zero_grad()

loss.backward()

learing_rate=0.01

for f in net.parameters():

    f.data.sub_(f.grad.data*learing_rate)  #更新系数

　　采用优化器：

#采用优化器，更新系数

import torch.optim as optim

optimizer = optim.SGD(net.parameters(), lr=0.01)

criterion = nn.CrossEntropyLoss()

input = torch.randn(, , requires_grad=True)

target = torch.empty(,).random_()

optimizer.zero_grad()

pre = net(input)

loss = criterion(pre, target)

loss.backward()

optimizer.step()    #更新系数

4.4 定义DataLoader

　　进行网络训练时，可以一次性导入所有数据(BGD，batch gradient descent), 可以一次导入一条数据（SGD, stochastic gradient descent），还可以一次导入部分数据（MBGD, Mini-batch gradient descent），目前MBGD为较为常用的方法。而且一般情况，无特殊说明时，论文里面提及SGD，都指代的时MGBD的方式进行数据导入和网络训练。

　　采用pytorch自带的数据集：

#采用pytorch自带的数据集

import torchvision

import torchvision.transform as transforms

transform = transforms.Compose(

    [transforms.ToTensor(),

     transforms.Normalize((0.5, 0.5, 0.5),(0.5, 0.5, 0.5))

    ]

    )

trainset = torchvision.datasets.CIFAR10(root="./data", train=True, download=True, transform = transform)  #下载数据集

trainloader = torch.utils.data.DataLoader(trainset, batch_size=, shuffle=True, num_workers=)             #每一个mini-batch，导入4条数据

　　采用自定义的数据集：

class RandomDataset(Dataset):

    def __init__(self, size, length):

        self.len = length

        self.data = torch.randn(length, size)

    def __getitem__(self, index):

        return self.data[index]

    def __len__(self):

        return self.len

rand_loader = DataLoader(dataset=RandomDataset(input_size, data_size),

                         batch_size=batch_size, shuffle=True)

自定义数据集一

class MyDataset(Dataset):

    def __init__(self, root_dir, annotations_file, transform=None):

        self.root_dir = root_dir

        self.annotations_file = annotations_file

        self.transform = transform

        if not os.path.isfile(self.annotations_file):

            print(self.annotations_file + 'does not exist!')

        self.file_info = pd.read_csv(annotations_file, index_col=)

        self.size = len(self.file_info)

    def __len__(self):

        return self.size

    def __getitem__(self, idx):

        image_path = self.file_info['path'][idx]

        if not os.path.isfile(image_path):

            print(image_path + '  does not exist!')

            return None

        image = Image.open(image_path).convert('RGB')

        label_species = int(self.file_info.iloc[idx]['species'])

        sample = {'image': image, 'species': label_species}

        if self.transform:

            sample['image'] = self.transform(image)

        return sample

train_transforms = transforms.Compose([transforms.Resize((, )),

                       transforms.RandomCrop(),

                                       transforms.RandomHorizontalFlip(),

                                       transforms.ToTensor(),

                                       ])

val_transforms = transforms.Compose([transforms.Resize((, )),

                                     transforms.ToTensor()

                                     ])

train_dataset = MyDataset(root_dir= ROOT_DIR + TRAIN_DIR,

                          annotations_file= TRAIN_ANNO,

                          transform=train_transforms)

test_dataset = MyDataset(root_dir= ROOT_DIR + VAL_DIR,

                         annotations_file= VAL_ANNO,

                         transform=val_transforms)

train_loader = DataLoader(dataset=train_dataset, batch_size=, shuffle=True)

test_loader = DataLoader(dataset=test_dataset)

自定义数据集二

4.5 神经网络训练：

　　准备好数据集，定义好模型，loss函数，优化器，数据迭代器，便可以进行网络训练了。下面是一个简单的图片分类器，将图片分成三类：兔子，老鼠，鸡

　　A. 准备数据集的label文件：

import pandas as pd

import os

from PIL import Image

# ROOTS = '../Dataset/'

ROOTS = '/home/ai/project/data/project_I/Dataset/'

PHASE = ['train', 'val']

SPECIES = ['rabbits', 'rats', 'chickens']  # [,,]

DATA_info = {'train': {'path': [], 'species': []},

             'val': {'path': [], 'species': []}

             }

for p in PHASE:

    for s in SPECIES:

        DATA_DIR = ROOTS + p + '/' + s

        DATA_NAME = os.listdir(DATA_DIR)

        for item in DATA_NAME:

            try:

                img = Image.open(os.path.join(DATA_DIR, item))

            except OSError:

                pass

            else:

                DATA_info[p]['path'].append(os.path.join(DATA_DIR, item))

                if s == 'rabbits':

                    DATA_info[p]['species'].append()

                elif s == 'rats':

                    DATA_info[p]['species'].append()

                else:

                    DATA_info[p]['species'].append()

    ANNOTATION = pd.DataFrame(DATA_info[p])

    ANNOTATION.to_csv('Species_%s_annotation.csv' % p)

    print('Species_%s_annotation file is saved.' % p)

生成label文件

　　生成的label文件格式如下，包括图片路径，以及对应的分类（0，1， 2依次代表'rabbits', 'rats', 'chickens'）

　　B. 定义网络结构：

　　　　可以自己搭建全新的网络结构，也可以采用成熟的网络架构，如AlexNet，VGG，GoogleNet, Resnet等。下面分别展示了自定义和借用Resnet网络结构：

import torch.nn as nn

import torchvision

import torch.nn.functional as F

class Net(nn.Module):

    def __init__(self):

        super(Net, self).__init__()

        self.conv1 = nn.Conv2d(, , )

        self.maxpool1 = nn.MaxPool2d(kernel_size=)

        self.relu1 = nn.ReLU(inplace=True)

        self.conv2 = nn.Conv2d(, , )

        self.maxpool2 = nn.MaxPool2d(kernel_size=)

        self.relu2 = nn.ReLU(inplace=True)

        self.conv3 = nn.Conv2d(,,)

        self.fc1 = nn.Linear( *  * , )

        self.relu3 = nn.ReLU(inplace=True)

        self.drop = nn.Dropout2d()

        self.fc2 = nn.Linear(, )

        # self.softmax1 = nn.Softmax(dim=)

    def forward(self, x):

        x = self.conv1(x)

        x = self.maxpool1(x)

        x = self.relu1(x)

        x = self.conv2(x)

        x = self.maxpool2(x)

        x = self.relu2(x)

        x =nn.ReLU(nn.MaxPool2d(self.conv3(x),kernel_size=))

        # print(x.shape)

        x = x.view(-,  *  * )

        x = self.fc1(x)

        x = self.relu3(x)

        x = F.dropout(x, training=self.training)

        x_species = self.fc2(x)

        # x_species = self.softmax1(x_species)

        return x_species

自定义网络结构

import torch.nn as nn

import torchvision

import torch.nn.functional as F

from torchvision import models

class Net(nn.Module):

    def __init__(self, model):

        super(Net, self).__init__()

        # self.conv1 = nn.Conv2d(, , , padding=)

        # self.conv2 = nn.Conv2d(, , , padding=)

        # self.conv3 = nn.Conv2d(, , , padding=)

        # self.conv4 = nn.Conv2d(, , , padding=)

        # self.conv5 = nn.Conv2d(, , , padding=)

        # self.conv6 = nn.Conv2d(, , , padding=)

        # self.conv7 = nn.Conv2d(, , , padding=)

        self.resnet18_layer = nn.Sequential(*list(model.children())[:-])    

        self.fc1 = nn.Linear( *  * , )

        self.relu3 = nn.ReLU(inplace=True)

        # self.drop = nn.Dropout2d()

        self.fc2 = nn.Linear(, )

        self.softmax1 = nn.Softmax(dim=)

    def forward(self, x):

        # x = F.relu(self.conv1(x))

        # x = F.max_pool2d(F.relu(self.conv2(x)),)

        # x = F.relu(self.conv3(x))

        # x = F.max_pool2d(F.relu(self.conv4(x)),)

        # x = F.relu(self.conv5(x))

        # x = F.relu(self.conv6(x))

        # x = F.max_pool2d(F.relu(self.conv7(x)),)

        x = self.resnet18_layer(x)    

        # x = F.dropout(x, self.training)

        #print(x.shape)

        x = x.view(-,  *  * )

        x = self.fc1(x)

        x = self.relu3(x)

        # x = F.dropout(x, training=self.training)

        x_species = self.fc2(x)

        #x_species = self.softmax1(x_species)

        return x_species

采用Resnet

　　C. 定义训练主函数：

　　训练网络时，可以对自定义的网络从头训练，也可以采用在ImageNet上预训练好的网络，进行finetune，这里采用预训练好的Resnet16.

import os

import matplotlib.pyplot as plt

from torch.utils.data import DataLoader

import torch

from Species_Network import *

from torchvision.transforms import transforms

from PIL import Image

import pandas as pd

import random

from torch import optim

from torch.optim import lr_scheduler

import copy

ROOT_DIR = '../Dataset/'

TRAIN_DIR = 'train/'

VAL_DIR = 'val/'

TRAIN_ANNO = 'Species_train_annotation.csv'

VAL_ANNO = 'Species_val_annotation.csv'

CLASSES = ['Mammals', 'Birds']

SPECIES = ['rabbits', 'rats', 'chickens']

class MyDataset():

    def __init__(self, root_dir, annotations_file, transform=None):

        self.root_dir = root_dir

        self.annotations_file = annotations_file

        self.transform = transform

        if not os.path.isfile(self.annotations_file):

            print(self.annotations_file + 'does not exist!')

        self.file_info = pd.read_csv(annotations_file, index_col=)

        self.size = len(self.file_info)

    def __len__(self):

        return self.size

    def __getitem__(self, idx):

        image_path = self.file_info['path'][idx]

        if not os.path.isfile(image_path):

            print(image_path + '  does not exist!')

            return None

        image = Image.open(image_path).convert('RGB')

        label_species = int(self.file_info.iloc[idx]['species'])

        sample = {'image': image, 'species': label_species}

        if self.transform:

            sample['image'] = self.transform(image)

        return sample

train_transforms = transforms.Compose([transforms.Resize((, )),

                                       transforms.RandomHorizontalFlip(),

                                       transforms.ToTensor(),

                                       ])

val_transforms = transforms.Compose([transforms.Resize((, )),

                                     transforms.ToTensor()

                                     ])

train_dataset = MyDataset(root_dir= ROOT_DIR + TRAIN_DIR,

                          annotations_file= TRAIN_ANNO,

                          transform=train_transforms)

test_dataset = MyDataset(root_dir= ROOT_DIR + VAL_DIR,

                         annotations_file= VAL_ANNO,

                         transform=val_transforms)

train_loader = DataLoader(dataset=train_dataset, batch_size=, shuffle=True)

test_loader = DataLoader(dataset=test_dataset)

data_loaders = {'train': train_loader, 'val': test_loader}

device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")

print(device)

def visualize_dataset():

    print(len(train_dataset))

    idx = random.randint(, len(train_dataset))

    sample = train_loader.dataset[idx]

    print(idx, sample['image'].shape, SPECIES[sample['species']])

    img = sample['image']

    plt.imshow(transforms.ToPILImage()(img))

    plt.show()

visualize_dataset()

def train_model(model, criterion, optimizer, scheduler, num_epochs=):

    Loss_list = {'train': [], 'val': []}

    Accuracy_list_species = {'train': [], 'val': []}

    best_model_wts = copy.deepcopy(model.state_dict())

    best_acc = 0.0

    for epoch in range(num_epochs):

        print('Epoch {}/{}'.format(epoch, num_epochs - ))

        print('-*' * )

        # Each epoch has a training and validation phase

        for phase in ['train', 'val']:

            if phase == 'train':

                model.train()

            else:

                model.eval()

            running_loss = 0.0

            corrects_species = 

            for idx,data in enumerate(data_loaders[phase]):

                #print(phase+' processing: {}th batch.'.format(idx))

                inputs = data['image'].to(device)

                labels_species = data['species'].to(device)

                optimizer.zero_grad()

                with torch.set_grad_enabled(phase == 'train'):

                    x_species = model(inputs)

                    x_species = x_species.view(-,)

                    _, preds_species = torch.max(x_species, )

                    loss = criterion(x_species, labels_species)

                    if phase == 'train':

                        loss.backward()

                        optimizer.step()

                running_loss += loss.item() * inputs.size()

                corrects_species += torch.sum(preds_species == labels_species)

            epoch_loss = running_loss / len(data_loaders[phase].dataset)

            Loss_list[phase].append(epoch_loss)

            epoch_acc_species = corrects_species.double() / len(data_loaders[phase].dataset)

            epoch_acc = epoch_acc_species

            Accuracy_list_species[phase].append( * epoch_acc_species)

            print('{} Loss: {:.4f}  Acc_species: {:.2%}'.format(phase, epoch_loss,epoch_acc_species))

            if phase == 'val' and epoch_acc > best_acc:

                best_acc = epoch_acc_species

                best_model_wts = copy.deepcopy(model.state_dict())

                print('Best val species Acc: {:.2%}'.format(best_acc))

    model.load_state_dict(best_model_wts)

    torch.save(model.state_dict(), 'best_model.pt')

    print('Best val species Acc: {:.2%}'.format(best_acc))

    return model, Loss_list,Accuracy_list_species

network = Net().to(device)

optimizer = optim.SGD(network.parameters(), lr=0.005, momentum=0.9, weight_decay=1e-)  #weight_decay：L2正则项惩罚

criterion = nn.CrossEntropyLoss()

exp_lr_scheduler = lr_scheduler.StepLR(optimizer, step_size=, gamma=0.1) # Decay LR by a factor of 0.1 every  epochs

model, Loss_list, Accuracy_list_species = train_model(network, criterion, optimizer, exp_lr_scheduler, num_epochs=)

x = range(, )

y1 = Loss_list["val"]

y2 = Loss_list["train"]

plt.plot(x, y1, color="r", linestyle="-", marker="o", linewidth=, label="val")

plt.plot(x, y2, color="b", linestyle="-", marker="o", linewidth=, label="train")

plt.legend()

plt.title('train and val loss vs. epoches')

plt.ylabel('loss')

plt.savefig("train and val loss vs epoches.jpg")

plt.close('all') # 关闭图 

y5 = Accuracy_list_species["train"]

y6 = Accuracy_list_species["val"]

plt.plot(x, y5, color="r", linestyle="-", marker=".", linewidth=, label="train")

plt.plot(x, y6, color="b", linestyle="-", marker=".", linewidth=, label="val")

plt.legend()

plt.title('train and val Species acc vs. epoches')

plt.ylabel('Species accuracy')

plt.savefig("train and val Species acc vs epoches.jpg")

plt.close('all')

######################################## Visualization ##################################

def visualize_model(model):

    model.eval()

    with torch.no_grad():

        for i, data in enumerate(data_loaders['val']):

            inputs = data['image']

            labels_species = data['species'].to(device)

            x_species = model(inputs.to(device))

            x_species = x_species.view( -,)

            _, preds_species = torch.max(x_species, )

            print(inputs.shape)

            plt.imshow(transforms.ToPILImage()(inputs.squeeze()))

            plt.title('predicted species: {}\n ground-truth species:{}'.format(SPECIES[preds_species],SPECIES[labels_species]))

            plt.show()

visualize_model(model)

自定义网络训练

import os

import matplotlib.pyplot as plt

from torch.utils.data import Dataset, DataLoader

import torch

from Spe_Network import *

from torchvision.transforms import transforms

from PIL import Image

import pandas as pd

import random

from torch import optim

from torch.optim import lr_scheduler

import copy

from torchvision import models

import logging

logging.basicConfig(level=logging.DEBUG, filename="train.log", filemode="a+")

ROOT_DIR = '/home/ai/project/data/project_I/Dataset/'

TRAIN_DIR = 'train/'

VAL_DIR = 'val/'

TRAIN_ANNO = 'Species_train_annotation.csv'

VAL_ANNO = 'Species_val_annotation.csv'

CLASSES = ['Mammals', 'Birds']

SPECIES = ['rabbits', 'rats', 'chickens']

class MyDataset(Dataset):

    def __init__(self, root_dir, annotations_file, transform=None):

        self.root_dir = root_dir

        self.annotations_file = annotations_file

        self.transform = transform

        if not os.path.isfile(self.annotations_file):

            print(self.annotations_file + 'does not exist!')

        self.file_info = pd.read_csv(annotations_file, index_col=)

        self.size = len(self.file_info)

    def __len__(self):

        return self.size

    def __getitem__(self, idx):

        image_path = self.file_info['path'][idx]

        if not os.path.isfile(image_path):

            print(image_path + '  does not exist!')

            return None

        image = Image.open(image_path).convert('RGB')

        label_species = int(self.file_info.iloc[idx]['species'])

        sample = {'image': image, 'species': label_species}

        if self.transform:

            sample['image'] = self.transform(image)

        return sample

train_transforms = transforms.Compose([transforms.Resize((, )),

                       transforms.RandomCrop(),

                                       transforms.RandomHorizontalFlip(),

                                       transforms.ToTensor(),

                                       ])

val_transforms = transforms.Compose([transforms.Resize((, )),

                                     transforms.ToTensor()

                                     ])

train_dataset = MyDataset(root_dir= ROOT_DIR + TRAIN_DIR,

                          annotations_file= TRAIN_ANNO,

                          transform=train_transforms)

test_dataset = MyDataset(root_dir= ROOT_DIR + VAL_DIR,

                         annotations_file= VAL_ANNO,

                         transform=val_transforms)

train_loader = DataLoader(dataset=train_dataset, batch_size=, shuffle=True)

test_loader = DataLoader(dataset=test_dataset)

data_loaders = {'train': train_loader, 'val': test_loader}

device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")

print(device)

def visualize_dataset():

    print(len(train_dataset))

    idx = random.randint(, len(train_dataset))

    sample = train_loader.dataset[idx]

    print(idx, sample['image'].shape, SPECIES[sample['species']])

    img = sample['image']

    plt.imshow(transforms.ToPILImage()(img))

    plt.show()

visualize_dataset()

def train_model(model, criterion, optimizer, scheduler, num_epochs=):

    Loss_list = {'train': [], 'val': []}

    Accuracy_list_species = {'train': [], 'val': []}

    best_model_wts = copy.deepcopy(model.state_dict())

    best_acc = 0.0

    for epoch in range(num_epochs):

        print('Epoch {}/{}'.format(epoch, num_epochs - ))

        print('-*' * )

        # Each epoch has a training and validation phase

        for phase in ['train', 'val']:

            if phase == 'train':

                model.train()

            else:

                model.eval()

            running_loss = 0.0

            corrects_species = 

            for idx,data in enumerate(data_loaders[phase]):

                #print(phase+' processing: {}th batch.'.format(idx))

                inputs = data['image'].to(device)

                labels_species = data['species'].to(device)

                optimizer.zero_grad()

                with torch.set_grad_enabled(phase == 'train'):

                    x_species = model(inputs)

                    x_species = x_species.view(-,)

                    _, preds_species = torch.max(x_species, )

                    loss = criterion(x_species, labels_species)

                    if phase == 'train':

                        loss.backward()

                        optimizer.step()

                running_loss += loss.item() * inputs.size()

                corrects_species += torch.sum(preds_species == labels_species)

            epoch_loss = running_loss / len(data_loaders[phase].dataset)

            Loss_list[phase].append(epoch_loss)

            epoch_acc_species = corrects_species.double() / len(data_loaders[phase].dataset)

            epoch_acc = epoch_acc_species

            Accuracy_list_species[phase].append( * epoch_acc_species)

            print('{} Loss: {:.4f}  Acc_species: {:.2%}'.format(phase, epoch_loss,epoch_acc_species))

            logging.info('{} Loss: {:.4f}  Acc_species: {:.2%}'.format(phase, epoch_loss,epoch_acc_species))

            if phase == 'val' and epoch_acc > best_acc:

                best_acc = epoch_acc_species

                best_model_wts = copy.deepcopy(model.state_dict())

                print('Best val species Acc: {:.2%}'.format(best_acc))

                logging.info('Best val species Acc: {:.2%}'.format(best_acc))

    model.load_state_dict(best_model_wts)

    torch.save(model.state_dict(), 'best_model.pt')

    print('Best val species Acc: {:.2%}'.format(best_acc))

    logging.info('Best val species Acc: {:.2%}'.format(best_acc))

    return model, Loss_list,Accuracy_list_species

# vgg16 = models.vgg16(pretrained=True)

res18 = models.resnet18(pretrained=True)

network = Net(res18).to(device)

optimizer = optim.SGD(network.parameters(), lr=0.005, momentum=0.9, weight_decay=1e-)  #weight_decay：L2正则项惩罚

criterion = nn.CrossEntropyLoss()

exp_lr_scheduler = lr_scheduler.StepLR(optimizer, step_size=, gamma=0.1) # Decay LR by a factor of 0.1 every  epochs

model, Loss_list, Accuracy_list_species = train_model(network, criterion, optimizer, exp_lr_scheduler, num_epochs=)

# x = range(, )

# y1 = Loss_list["val"]

# y2 = Loss_list["train"]

# plt.plot(x, y1, color="r", linestyle="-", marker="o", linewidth=, label="val")

# plt.plot(x, y2, color="b", linestyle="-", marker="o", linewidth=, label="train")

# plt.legend()

# plt.title('train and val loss vs. epoches')

# plt.ylabel('loss')

# plt.savefig("train and val loss vs epoches.jpg")

# plt.close('all') # 关闭图 

# y5 = Accuracy_list_species["train"]

# y6 = Accuracy_list_species["val"]

# plt.plot(x, y5, color="r", linestyle="-", marker=".", linewidth=, label="train")

# plt.plot(x, y6, color="b", linestyle="-", marker=".", linewidth=, label="val")

# plt.legend()

# plt.title('train and val Species acc vs. epoches')

# plt.ylabel('Species accuracy')

# plt.savefig("train and val Species acc vs epoches.jpg")

# plt.close('all')

######################################## Visualization ##################################

def visualize_model(model):

    model.eval()

    with torch.no_grad():

        for i, data in enumerate(data_loaders['val']):

            inputs = data['image']

            labels_species = data['species'].to(device)

            x_species = model(inputs.to(device))

            x_species = x_species.view( -,)

            _, preds_species = torch.max(x_species, )

            print(inputs.shape)

            plt.imshow(transforms.ToPILImage()(inputs.squeeze()))

            plt.title('predicted species: {}\n ground-truth species:{}'.format(SPECIES[preds_species],SPECIES[labels_species]))

            plt.show()

# visualize_model(model)

finetune Resnet网络

参考：

　　https://github.com/Hong-Bo/hands-on-pytorch/blob/master/alex_net/alex_net.py

　　https://pytorch.org/tutorials/

pytorch基础学习（一）

pytorch基础学习（一）的相关教程结束。

相关推荐

如何在anaconda中安装pytorch1.7.1

学习Linux，要把握哪些重点？

Nginx常用基础模块

shell学习总结

SwiftUI学习（一）

SSM整合之基础xml文件以及基础理解

python基础：重新认识装饰器

学习 Linux，101: 使用基本 SQL 命令