Swin Transformer簡(jiǎn)介

目標(biāo)檢測(cè)刷到58.7 AP！

實(shí)例分割刷到51.1 Mask AP！

語(yǔ)義分割在ADE20K上刷到53.5 mIoU！

今年，微軟亞洲研究院的Swin Transformer又開(kāi)啟了吊打CNN的模式，在速度和精度上都有很大的提高。這篇文章帶你實(shí)現(xiàn)Swin Transformer圖像分類(lèi)。

資料匯總

論文: https://arxiv.org/abs/2103.14030

代碼: https://github.com/microsoft/Swin-Transformer

論文翻譯：https://wanghao.blog.csdn.net/article/details/120724040

一些大佬的B站視頻：

1、霹靂吧啦Wz：https://www.bilibili.com/video/BV1yg411K7Yc?from=search&seid=18074716460851088132&spm_id_from=333.337.0.0

2、ClimbingVision社區(qū)：震驚！這個(gè)關(guān)于Swin Transformer的論文分享講得太透徹了！_嗶哩嗶哩_bilibili

關(guān)于Swin Transformer的資料有很多，在這里就不一一列舉了，我覺(jué)得理解這個(gè)模型的最好方式：源碼+論文。

環(huán)境配置

1、電腦環(huán)境：

操作系統(tǒng)：win10

CUDA版本：11.2

2、創(chuàng)建虛擬環(huán)境swin

conda create -n swin python=3.7 -y conda activate swin

3、安裝pytorch

conda install pytorch==1.7.1 torchvision==0.8.2 torchaudio==0.7.2 cudatoolkit=11.0 -c pytorch

4、安裝timm

pip install timm==0.3.2

5、安裝apex

APEX是英偉達(dá)開(kāi)源的，完美支持PyTorch框架，用于改變數(shù)據(jù)格式來(lái)減小模型顯存占用的工具。其中最有價(jià)值的是amp（Automatic Mixed Precision），將模型的大部分操作都用Float16數(shù)據(jù)類(lèi)型測(cè)試，一些特別操作仍然使用Float32。并且用戶僅僅通過(guò)三行代碼即可完美將自己的訓(xùn)練代碼遷移到該模型。實(shí)驗(yàn)證明，使用Float16作為大部分操作的數(shù)據(jù)類(lèi)型，并沒(méi)有降低參數(shù)，在一些實(shí)驗(yàn)中，反而由于可以增大Batch size，帶來(lái)精度上的提升，以及訓(xùn)練速度上的提升。

5.1 下載apex

網(wǎng)址 https://github.com/NVIDIA/apex，下載到本地文件夾。解壓后進(jìn)入到apex的目錄安裝依賴。在執(zhí)行命令；

cd C:\Users\WH\Downloads\apex-master #進(jìn)入apex目錄
pip install -r requirements.txt

5.2 安裝apex

依賴安裝完后，打開(kāi)cmd，cd進(jìn)入到剛剛下載完的apex-master路徑下，運(yùn)行：

python setup.py install

然后跑了一堆東西，最后是這樣的：

安裝完成！

6、安裝一些其他的包

pip install opencv-python==4.4.0.46 termcolor==1.1.0 yacs==0.1.8

數(shù)據(jù)集

數(shù)據(jù)集采用最經(jīng)典的貓狗大戰(zhàn)數(shù)據(jù)集。數(shù)據(jù)集地址：鏈接：https://pan.baidu.com/s/1ZM8vDWEzgscJMnBrZfvQGw 提取碼：48c3

如果連接失效請(qǐng)聯(lián)系我，或者你也可以從別的途徑獲得。

cat.3

dog.2

項(xiàng)目結(jié)構(gòu)

使用tree命令打印整個(gè)項(xiàng)目的結(jié)構(gòu)

Swin-Transformer-main
    ├─configs#配置文件 ├─data#處理數(shù)據(jù)集相關(guān)的操作 │ 
    ├─dataset #數(shù)據(jù)集結(jié)構(gòu) │  ├─test │  ├─train
    │  │  ├─cat
    │  │  └─dog
    │  └─val
    │      ├─cat
    │      └─dog
    ├─figures
    ├─models#Swin的模型文件 │ 
    ├─output#訓(xùn)練模型的輸出

訓(xùn)練

1、獲取代碼和預(yù)訓(xùn)練模型

從https://github.com/microsoft/Swin-Transformer下載代碼，然后放到本地。然后解壓。

在get_started.md找到預(yù)訓(xùn)練模型下載路徑，下載下來(lái)然后放到Swin-Transformer根目錄。

2、制作數(shù)據(jù)集

構(gòu)建數(shù)據(jù)集，數(shù)據(jù)集結(jié)構(gòu)如下：

dataset #數(shù)據(jù)集結(jié)構(gòu)
    ├─test ├─train
    │  ├─cat │  └─dog
    └─val
           ├─cat └─dog

從原數(shù)據(jù)集中取出一部分?jǐn)?shù)據(jù)集放入train對(duì)應(yīng)的類(lèi)別中，一部分放入val對(duì)應(yīng)的類(lèi)別中。把原數(shù)據(jù)集中的test直接復(fù)制到test中。

3、修改config.py文件

_C.DATA.DATA_PATH = 'dataset' # Dataset name _C.DATA.DATASET = 'imagenet' # Model name _C.MODEL.NAME = 'swin_tiny_patch4_window7_224' # Checkpoint to resume, could be overwritten by command line argument
_C.MODEL.RESUME ='swin_tiny_patch4_window7_224.pth' # Number of classes, overwritten in data preparation
_C.MODEL.NUM_CLASSES = 2

對(duì)上面參數(shù)的解釋?zhuān)?

_C.DATA.DATA_PATH ：數(shù)據(jù)集路徑的根目錄，我定義為dataset。

_C.DATA.DATASET：數(shù)據(jù)集的類(lèi)型，這里只有一種類(lèi)型imagenet。

_C.MODEL.NAME：模型的名字，對(duì)應(yīng)configs下面yaml的名字，會(huì)在模型輸出的root目錄創(chuàng)建對(duì)應(yīng)MODEL.NAME的目錄。

_C.MODEL.RESUME：預(yù)訓(xùn)練模型的目錄。

_C.MODEL.NUM_CLASSES：模型的類(lèi)別，默認(rèn)是1000，按照數(shù)據(jù)集的類(lèi)別數(shù)量修改。

4、修改build.py

將nb_classes =1000改為nb_classes = config.MODEL.NUM_CLASSES

5、修改utils.py

由于類(lèi)別默認(rèn)是1000，所以加載模型的時(shí)候會(huì)出現(xiàn)類(lèi)別對(duì)不上的問(wèn)題，所以需要修改load_checkpoint方法。在加載預(yù)訓(xùn)練模型之前增加修改預(yù)訓(xùn)練模型的方法：

if checkpoint['model']['head.weight'].shape[0] == 1000:
    checkpoint['model']['head.weight'] = torch.nn.Parameter(
        torch.nn.init.xavier_uniform(torch.empty(config.MODEL.NUM_CLASSES, 768)))
    checkpoint['model']['head.bias'] = torch.nn.Parameter(torch.randn(config.MODELNUM_CLASSES))
msg = model.load_state_dict(checkpoint['model'], strict=False)

6、修改main.py

將92-94注釋?zhuān)缦聢D：

將312行修改為：torch.distributed.init_process_group('gloo', init_method='file://tmp/somefile', rank=0, world_size=1)

7、運(yùn)行訓(xùn)練命令

打開(kāi)Terminal，運(yùn)行如下命令：

python main.py --cfg configs/swin_tiny_patch4_window7_224.yaml --local_rank 0 --batch-size 16

如果想單獨(dú)驗(yàn)證，運(yùn)行命令：

python  main.py --eval --cfg configs/swin_tiny_patch4_window7_224.yaml --resume ./output/swin_tiny_patch4_window7_224/default/ckpt_epoch_1.pth --data-path dataset --local_rank 0

推理

這個(gè)項(xiàng)目沒(méi)有推理腳本，我自己寫(xiě)了一個(gè)。寫(xiě)這部分需要看懂驗(yàn)證部分的代碼即可。

1、導(dǎo)入包和配置參數(shù)

import torch.utils.data.distributed
import torchvision.transforms as transforms from PIL import Image from torch.autograd import Variable
import os from models import build_model from config import get_config
import argparse

def parse_option():
    parser = argparse.ArgumentParser('Swin Transformer Test script', add_help=False)
    parser.add_argument('--cfg', default='configs/swin_tiny_patch4_window7_224.yaml', type=str, metavar="FILE", help='path to config file', )
    parser.add_argument( "--opts", help="Modify config options by adding 'KEY VALUE' pairs. ", default=None, nargs='+',
    )

    # easy config modification
    parser.add_argument('--batch-size', type=int, help="batch size for single GPU")
    parser.add_argument('--data-path', type=str, help='path to dataset')
    parser.add_argument('--zip', action='store_true', help='use zipped dataset instead of folder dataset')
    parser.add_argument('--cache-mode', type=str, default='part', choices=['no', 'full', 'part'], help='no: no cache, ' 'full: cache all data, ' 'part: sharding the dataset into nonoverlapping pieces and only cache one piece')
    parser.add_argument('--resume', default='output/swin_tiny_patch4_window7_224/default/ckpt_epoch_1.pth', help='resume from checkpoint')
    parser.add_argument('--accumulation-steps', type=int, help="gradient accumulation steps")
    parser.add_argument('--use-checkpoint', action='store_true', help="whether to use gradient checkpointing to save memory")
    parser.add_argument('--amp-opt-level', type=str, default='O1', choices=['O0', 'O1', 'O2'], help='mixed precision opt level, if O0, no amp is used')
    parser.add_argument('--output', default='output', type=str, metavar='PATH', help='root of output folder, the full path is <output>/<model_name>/<tag> (default: output)')
    parser.add_argument('--tag', help='tag of experiment')
    parser.add_argument('--eval', action='store_true', help='Perform evaluation only')
    parser.add_argument('--throughput', action='store_true', help='Test throughput only')
    parser.add_argument("--local_rank", default='0', type=int, help='local rank for DistributedDataParallel')
    args, unparsed = parser.parse_known_args()  config = get_config(args)

    return args, config

這個(gè)配置參數(shù)是為了創(chuàng)建模型，從main.py中復(fù)制過(guò)來(lái)，然后將required=True這樣的字段刪除。

定義class、創(chuàng)建transform

transform_test = transforms.Compose([
    transforms.Resize((224, 224)),
    transforms.ToTensor(),
    transforms.Normalize([0.5, 0.5, 0.5], [0.5, 0.5, 0.5])
])
classes = ("cat", "dog")

將圖像resize為224×224大小

定義類(lèi)別，順序和數(shù)據(jù)集對(duì)應(yīng)。

2、創(chuàng)建模型

DEVICE = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
_, config = parse_option()
model = build_model(config) checkpoint = torch.load('output/swin_tiny_patch4_window7_224/default/ckpt_epoch_1.pth', map_location='cpu')
model.load_state_dict(checkpoint['model'], strict=False)
model.eval()
model.to(DEVICE)

判斷gpu是否可用，如果不可以使用cpu。

獲取config參數(shù)

創(chuàng)建模型

加載訓(xùn)練的模型權(quán)重

將權(quán)重放入model中。

3、開(kāi)始推理

定義測(cè)試集的路徑，然后循環(huán)預(yù)測(cè)每張圖片

path = 'dataset/test/'
testList = os.listdir(path) for file in testList:
    img = Image.open(path + file)
    img = transform_test(img)
    img.unsqueeze_(0)
    img = Variable(img).to(DEVICE) out = model(img)
    # Predict _, pred = torch.max(out.data, 1) print('Image Name:{},predict:{}'.format(file, classes[pred.data.item()]))

結(jié)果如下：

4、完整代碼：

import torch.utils.data.distributed
import torchvision.transforms as transforms from PIL import Image from torch.autograd import Variable
import os from models import build_model from config import get_config
import argparse

def parse_option():
    parser = argparse.ArgumentParser('Swin Transformer Test script', add_help=False)
    parser.add_argument('--cfg', default='configs/swin_tiny_patch4_window7_224.yaml', type=str, metavar="FILE", help='path to config file', )
    parser.add_argument( "--opts", help="Modify config options by adding 'KEY VALUE' pairs. ", default=None, nargs='+',
    )

    # easy config modification
    parser.add_argument('--batch-size', type=int, help="batch size for single GPU")
    parser.add_argument('--data-path', type=str, help='path to dataset')
    parser.add_argument('--zip', action='store_true', help='use zipped dataset instead of folder dataset')
    parser.add_argument('--cache-mode', type=str, default='part', choices=['no', 'full', 'part'], help='no: no cache, ' 'full: cache all data, ' 'part: sharding the dataset into nonoverlapping pieces and only cache one piece')
    parser.add_argument('--resume', default='output/swin_tiny_patch4_window7_224/default/ckpt_epoch_1.pth', help='resume from checkpoint')
    parser.add_argument('--accumulation-steps', type=int, help="gradient accumulation steps")
    parser.add_argument('--use-checkpoint', action='store_true', help="whether to use gradient checkpointing to save memory")
    parser.add_argument('--amp-opt-level', type=str, default='O1', choices=['O0', 'O1', 'O2'], help='mixed precision opt level, if O0, no amp is used')
    parser.add_argument('--output', default='output', type=str, metavar='PATH', help='root of output folder, the full path is <output>/<model_name>/<tag> (default: output)')
    parser.add_argument('--tag', help='tag of experiment')
    parser.add_argument('--eval', action='store_true', help='Perform evaluation only')
    parser.add_argument('--throughput', action='store_true', help='Test throughput only')
    parser.add_argument("--local_rank", default='0', type=int, help='local rank for DistributedDataParallel')
    args, unparsed = parser.parse_known_args()  config = get_config(args)

    return args, config  transform_test = transforms.Compose([
    transforms.Resize((224, 224)),
    transforms.ToTensor(),
    transforms.Normalize([0.5, 0.5, 0.5], [0.5, 0.5, 0.5])
])
classes = ("cat", "dog")




DEVICE = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
_, config = parse_option()
model = build_model(config)
checkpoint = torch.load('output/swin_tiny_patch4_window7_224/default/ckpt_epoch_1.pth', map_location='cpu')
model.load_state_dict(checkpoint['model'], strict=False)
model.eval()
model.to(DEVICE)

path = 'dataset/test/' testList = os.listdir(path) for file in testList:
    img = Image.open(path + file)
    img = transform_test(img)
    img.unsqueeze_(0)
    img = Variable(img).to(DEVICE)
    out = model(img)
    # Predict
    _, pred = torch.max(out.data, 1) print('Image Name:{},predict:{}'.format(file, classes[pred.data.item()]))

總結(jié)

本文帶領(lǐng)大家學(xué)習(xí)了如何使用Swin Transformer實(shí)現(xiàn)圖像分類(lèi)。通過(guò)這篇文章你學(xué)習(xí)到了Swin Transformer的環(huán)境配置和一些參數(shù)配置，學(xué)會(huì)了如何寫(xiě)推理的腳本。

亚洲一区精品自拍_2021年国内精品久久_男同十八禁gv在线观看_免费观看a级性爱黄片

Article / 文章中心

文章分類(lèi)

Swin Transformer實(shí)戰(zhàn)：使用 Swin Transformer實(shí)現(xiàn)圖像分類(lèi)

Swin Transformer簡(jiǎn)介

資料匯總

環(huán)境配置

1、電腦環(huán)境：

2、創(chuàng)建虛擬環(huán)境swin

3、安裝pytorch

4、安裝timm

5、安裝apex

5.1 下載apex

5.2 安裝apex

6、安裝一些其他的包

數(shù)據(jù)集

項(xiàng)目結(jié)構(gòu)

訓(xùn)練

1、獲取代碼和預(yù)訓(xùn)練模型

2、制作數(shù)據(jù)集

3、修改config.py文件

4、修改build.py

5、修改utils.py

6、修改main.py

7、運(yùn)行訓(xùn)練命令

推理

1、導(dǎo)入包和配置參數(shù)

2、創(chuàng)建模型

3、開(kāi)始推理

4、完整代碼：

總結(jié)

亚洲一区精品自拍_2021年国内精品久久_男同十八禁gv在线观看_免费观看a级性爱黄片

Article / 文章中心

文章分類(lèi)

Swin Transformer實(shí)戰(zhàn)：使用 Swin Transformer實(shí)現(xiàn)圖像分類(lèi)

Swin Transformer簡(jiǎn)介

資料匯總

環(huán)境配置

1、電腦環(huán)境：

2、創(chuàng)建虛擬環(huán)境swin

3、安裝pytorch

4、安裝timm

5、安裝apex

5.1 下載apex

5.2 安裝apex

6、安裝一些其他的包

數(shù)據(jù)集

項(xiàng)目結(jié)構(gòu)

訓(xùn)練

1、獲取代碼和預(yù)訓(xùn)練模型

2、制作數(shù)據(jù)集

3、修改config.py文件

4、修改build.py

5、修改utils.py

6、修改main.py

7、運(yùn)行訓(xùn)練命令

推理

1、導(dǎo)入包和配置參數(shù)

2、創(chuàng)建模型

3、開(kāi)始推理

4、完整代碼：

總結(jié)

1、電腦環(huán)境：

2、創(chuàng)建虛擬環(huán)境swin

3、安裝pytorch

4、安裝timm

6、安裝一些其他的包

2、制作數(shù)據(jù)集

3、修改config.py文件

4、修改build.py

5、修改utils.py

6、修改main.py

1、導(dǎo)入包和配置參數(shù)

2、創(chuàng)建模型

4、完整代碼：