torch transform.resize（）与cv2.resize（）的比较

import cv2 import numpy as np from PIL import image import torch import torchvision from torchvision import transforms as trans # device for pytorch device = torch.device('cuda:0') torch.set_default_tensor_type('torch.cuda.FloatTensor') model = torch.jit.load("traced_facelearner_model_new.pt") model.eval() # read the example image used for tracing image=cv2.imread("videos/example.jpg") test_transform = trans.Compose([ trans.ToTensor(), trans.Normalize([0.5, 0.5, 0.5], [0.5, 0.5, 0.5]) ]) test_transform2 = trans.Compose([ trans.Resize([int(112), int(112)]), trans.ToTensor(), trans.Normalize([0.5, 0.5, 0.5], [0.5, 0.5, 0.5]) ]) resized_image = cv2.resize(image, (112, 112)) tensor1 = test_transform(resized_image).to(device).unsqueeze(0) tensor2 = test_transform2(Image.fromarray(image)).to(device).unsqueeze(0) output1 = model(tensor1) output2 = model(tensor2)

1条回答

网友

1楼 · 发布于 2024-09-21 08:37:21

基本上torchvision.transforms.Resize()默认情况下使用PIL.Image.BILINEAR插值

而在代码中，您只需使用cv2.resize，它不使用任何插值

比如说

import cv2
from PIL import Image
import numpy as np

a = cv2.imread('videos/example.jpg')
b = cv2.resize(a, (112, 112))
c = np.array(Image.fromarray(a).resize((112, 112), Image.BILINEAR))

您将看到b和c略有不同

编辑：

实际上opencv文档说

INTER_LINEAR - a bilinear interpolation (used by default)

但是是的，它不会给出与PIL相同的结果

编辑2：

这也在文档中

To shrink an image, it will generally look best with INTER_AREA interpolation

显然

d = cv2.resize(a, (112, 112), interpolation=cv2.INTER_AREA)

给出与c几乎相同的结果。但不幸的是，这些并不能回答这个问题

编辑：

编辑2：

相关问题更多 >

编程相关推荐

热门问题

热门文章