回答此问题可获得 20 贡献值,回答如果被采纳可获得 50 分。
<p>首先,对不起,这篇文章太长了</p>
<p>我正在做一个项目,根据叶子的图像对植物进行分类。为了减少数据的差异,我需要旋转图像,使茎在图像底部水平对齐(270度)</p>
<p><strong>到目前为止我所处的位置…</strong></p>
<p>到目前为止,我所做的是创建一个阈值图像,然后从中找到轮廓并围绕对象绘制一个椭圆(在许多情况下,它无法涉及整个对象,因此忽略了茎…),然后,我创建4个区域(带有椭圆的边),并尝试计算最小值区域,这是因为假设在任何一点必须找到茎,因此它将是人口较少的区域(主要是因为它将被0包围),这显然不是我想要的工作</p>
<p>之后,我以两种不同的方式计算旋转角度,第一种涉及<code>atan2</code>函数,这只需要我想从(人口最少区域的质心)移动的点,以及<code>x=image width / 2</code>和<code>y = height</code>的点。这种方法在某些情况下有效,但在大多数情况下,我没有得到所需的角度,有时需要一个负角度,它会产生一个正角度,最终茎在顶部。在其他一些情况下,它只是以一种可怕的方式失败</p>
<p>我的第二种方法是尝试根据3个点计算角度:图像中心、人口最少区域的重心和270º点。然后使用<code>arccos</code>函数,并将其结果转换为度</p>
<p>这两种方法对我来说都失败了</p>
<p><strong>问题</strong></p>
<ul>
<li>你认为这是一个正确的方法,还是我只是把事情弄得比我应该做的更复杂</李>
<li>我如何找到叶子的茎(这不是可选的,它必须是茎)?因为我的想法不太管用</李>
<li>如何以稳健的方式确定角度?因为第二个问题中的相同原因</李>
</ul>
<p>下面是一些示例和我得到的结果(二进制掩码)。矩形表示我正在比较的区域,穿过椭圆的红线是椭圆的长轴,粉色圆圈是最小区域内的质心,红色圆圈表示270º参考点(角度),白色圆点表示图像的中心</p>
<p><a href="https://i.stack.imgur.com/yApSj.png" rel="noreferrer"><img src="https://i.stack.imgur.com/yApSj.png" alt="Original image"/></a>
<a href="https://i.stack.imgur.com/JG5z4.png" rel="noreferrer"><img src="https://i.stack.imgur.com/JG5z4.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/QMr9v.png" rel="noreferrer"><img src="https://i.stack.imgur.com/QMr9v.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/eyCbk.png" rel="noreferrer"><img src="https://i.stack.imgur.com/eyCbk.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/3qLYc.png" rel="noreferrer"><img src="https://i.stack.imgur.com/3qLYc.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/5w46l.png" rel="noreferrer"><img src="https://i.stack.imgur.com/5w46l.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/PHJeK.png" rel="noreferrer"><img src="https://i.stack.imgur.com/PHJeK.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/TqXkM.png" rel="noreferrer"><img src="https://i.stack.imgur.com/TqXkM.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/dUNj5.png" rel="noreferrer"><img src="https://i.stack.imgur.com/dUNj5.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/hywsd.png" rel="noreferrer"><img src="https://i.stack.imgur.com/hywsd.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/aDtnr.png" rel="noreferrer"><img src="https://i.stack.imgur.com/aDtnr.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/2y3CQ.png" rel="noreferrer"><img src="https://i.stack.imgur.com/2y3CQ.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/Z3OBo.png" rel="noreferrer"><img src="https://i.stack.imgur.com/Z3OBo.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/llfy1.png" rel="noreferrer"><img src="https://i.stack.imgur.com/llfy1.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/gK2BE.png" rel="noreferrer"><img src="https://i.stack.imgur.com/gK2BE.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/Eenl0.png" rel="noreferrer"><img src="https://i.stack.imgur.com/Eenl0.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/JOljP.png" rel="noreferrer"><img src="https://i.stack.imgur.com/JOljP.png" alt="enter image description here"/></a></p>
<p><a href="https://i.stack.imgur.com/0h6NZ.png" rel="noreferrer"><img src="https://i.stack.imgur.com/0h6NZ.png" alt="enter image description here"/></a></p>
<p><strong>我当前的解决方案</strong></p>
<pre class="lang-py prettyprint-override"><code> def brightness_distortion(I, mu, sigma):
return np.sum(I*mu/sigma**2, axis=-1) / np.sum((mu/sigma)**2, axis=-1)
def chromacity_distortion(I, mu, sigma):
alpha = brightness_distortion(I, mu, sigma)[...,None]
return np.sqrt(np.sum(((I - alpha * mu)/sigma)**2, axis=-1))
def bwareafilt ( image ):
image = image.astype(np.uint8)
nb_components, output, stats, centroids = cv2.connectedComponentsWithStats(image, connectivity=4)
sizes = stats[:, -1]
max_label = 1
max_size = sizes[1]
for i in range(2, nb_components):
if sizes[i] > max_size:
max_label = i
max_size = sizes[i]
img2 = np.zeros(output.shape)
img2[output == max_label] = 255
return img2
def get_thresholded_rotated(im_path):
#read image
img = cv2.imread(im_path)
img = cv2.resize(img, (600, 800), interpolation = Image.BILINEAR)
sat = cv2.cvtColor(img, cv2.COLOR_BGR2HSV)[:,:,1]
val = cv2.cvtColor(img, cv2.COLOR_BGR2HSV)[:,:,2]
sat = cv2.medianBlur(sat, 11)
val = cv2.medianBlur(val, 11)
#create threshold
thresh_S = cv2.adaptiveThreshold(sat , 255, cv2.ADAPTIVE_THRESH_MEAN_C, cv2.THRESH_BINARY, 401, 10);
thresh_V = cv2.adaptiveThreshold(val , 255, cv2.ADAPTIVE_THRESH_MEAN_C, cv2.THRESH_BINARY, 401, 10);
#mean, std
mean_S, stdev_S = cv2.meanStdDev(img, mask = 255 - thresh_S)
mean_S = mean_S.ravel().flatten()
stdev_S = stdev_S.ravel()
#chromacity
chrom_S = chromacity_distortion(img, mean_S, stdev_S)
chrom255_S = cv2.normalize(chrom_S, chrom_S, alpha=0, beta=255, norm_type=cv2.NORM_MINMAX).astype(np.uint8)[:,:,None]
mean_V, stdev_V = cv2.meanStdDev(img, mask = 255 - thresh_V)
mean_V = mean_V.ravel().flatten()
stdev_V = stdev_V.ravel()
chrom_V = chromacity_distortion(img, mean_V, stdev_V)
chrom255_V = cv2.normalize(chrom_V, chrom_V, alpha=0, beta=255, norm_type=cv2.NORM_MINMAX).astype(np.uint8)[:,:,None]
#create different thresholds
thresh2_S = cv2.adaptiveThreshold(chrom255_S , 255, cv2.ADAPTIVE_THRESH_MEAN_C, cv2.THRESH_BINARY, 401, 10);
thresh2_V = cv2.adaptiveThreshold(chrom255_V , 255, cv2.ADAPTIVE_THRESH_MEAN_C, cv2.THRESH_BINARY, 401, 10);
#thresholded image
thresh = cv2.bitwise_and(thresh2_S, cv2.bitwise_not(thresh2_V))
#find countours and keep max
contours = cv2.findContours(thresh, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)
contours = contours[0] if len(contours) == 2 else contours[1]
big_contour = max(contours, key=cv2.contourArea)
# fit ellipse to leaf contours
ellipse = cv2.fitEllipse(big_contour)
(xc,yc), (d1,d2), angle = ellipse
print('thresh shape: ', thresh.shape)
#print(xc,yc,d1,d2,angle)
rmajor = max(d1,d2)/2
rminor = min(d1,d2)/2
origi_angle = angle
if angle > 90:
angle = angle - 90
else:
angle = angle + 90
#calc major axis line
xtop = xc + math.cos(math.radians(angle))*rmajor
ytop = yc + math.sin(math.radians(angle))*rmajor
xbot = xc + math.cos(math.radians(angle+180))*rmajor
ybot = yc + math.sin(math.radians(angle+180))*rmajor
#calc minor axis line
xtop_m = xc + math.cos(math.radians(origi_angle))*rminor
ytop_m = yc + math.sin(math.radians(origi_angle))*rminor
xbot_m = xc + math.cos(math.radians(origi_angle+180))*rminor
ybot_m = yc + math.sin(math.radians(origi_angle+180))*rminor
#determine which region is up and which is down
if max(xtop, xbot) == xtop :
x_tij = xtop
y_tij = ytop
x_b_tij = xbot
y_b_tij = ybot
else:
x_tij = xbot
y_tij = ybot
x_b_tij = xtop
y_b_tij = ytop
if max(xtop_m, xbot_m) == xtop_m :
x_tij_m = xtop_m
y_tij_m = ytop_m
x_b_tij_m = xbot_m
y_b_tij_m = ybot_m
else:
x_tij_m = xbot_m
y_tij_m = ybot_m
x_b_tij_m = xtop_m
y_b_tij_m = ytop_m
print('-----')
print(x_tij, y_tij)
rect_size = 100
"""
calculate regions of edges of major axis of ellipse
this is done by creating a squared region of rect_size x rect_size, being the edge the center of the square
"""
x_min_tij = int(0 if x_tij - rect_size < 0 else x_tij - rect_size)
x_max_tij = int(thresh.shape[1]-1 if x_tij + rect_size > thresh.shape[1] else x_tij + rect_size)
y_min_tij = int(0 if y_tij - rect_size < 0 else y_tij - rect_size)
y_max_tij = int(thresh.shape[0] - 1 if y_tij + rect_size > thresh.shape[0] else y_tij + rect_size)
x_b_min_tij = int(0 if x_b_tij - rect_size < 0 else x_b_tij - rect_size)
x_b_max_tij = int(thresh.shape[1] - 1 if x_b_tij + rect_size > thresh.shape[1] else x_b_tij + rect_size)
y_b_min_tij = int(0 if y_b_tij - rect_size < 0 else y_b_tij - rect_size)
y_b_max_tij = int(thresh.shape[0] - 1 if y_b_tij + rect_size > thresh.shape[0] else y_b_tij + rect_size)
sum_red_region = np.sum(thresh[y_min_tij:y_max_tij, x_min_tij:x_max_tij])
sum_yellow_region = np.sum(thresh[y_b_min_tij:y_b_max_tij, x_b_min_tij:x_b_max_tij])
"""
calculate regions of edges of minor axis of ellipse
this is done by creating a squared region of rect_size x rect_size, being the edge the center of the square
"""
x_min_tij_m = int(0 if x_tij_m - rect_size < 0 else x_tij_m - rect_size)
x_max_tij_m = int(thresh.shape[1]-1 if x_tij_m + rect_size > thresh.shape[1] else x_tij_m + rect_size)
y_min_tij_m = int(0 if y_tij_m - rect_size < 0 else y_tij_m - rect_size)
y_max_tij_m = int(thresh.shape[0] - 1 if y_tij_m + rect_size > thresh.shape[0] else y_tij_m + rect_size)
x_b_min_tij_m = int(0 if x_b_tij_m - rect_size < 0 else x_b_tij_m - rect_size)
x_b_max_tij_m = int(thresh.shape[1] - 1 if x_b_tij_m + rect_size > thresh.shape[1] else x_b_tij_m + rect_size)
y_b_min_tij_m = int(0 if y_b_tij_m - rect_size < 0 else y_b_tij_m - rect_size)
y_b_max_tij_m = int(thresh.shape[0] - 1 if y_b_tij_m + rect_size > thresh.shape[0] else y_b_tij_m + rect_size)
#value of the regions, the names of the variables are related to the color of the rectangles drawn at the end of the function
sum_red_region_m = np.sum(thresh[y_min_tij_m:y_max_tij_m, x_min_tij_m:x_max_tij_m])
sum_yellow_region_m = np.sum(thresh[y_b_min_tij_m:y_b_max_tij_m, x_b_min_tij_m:x_b_max_tij_m])
#print(sum_red_region, sum_yellow_region, sum_red_region_m, sum_yellow_region_m)
min_arg = np.argmin(np.array([sum_red_region, sum_yellow_region, sum_red_region_m, sum_yellow_region_m]))
print('min: ', min_arg)
if min_arg == 1: #sum_yellow_region < sum_red_region :
left_quartile = x_b_tij < thresh.shape[0] /2
upper_quartile = y_b_tij < thresh.shape[1] /2
center_x = x_b_min_tij + ((x_b_max_tij - x_b_min_tij) / 2)
center_y = y_b_min_tij + (y_b_max_tij - y_b_min_tij / 2)
center_x = x_b_min_tij + np.argmax(thresh[y_b_min_tij:y_b_max_tij, x_b_min_tij:x_b_max_tij].mean(axis=0))
center_y = y_b_min_tij + np.argmax(thresh[y_b_min_tij:y_b_max_tij, x_b_min_tij:x_b_max_tij].mean(axis=1))
elif min_arg == 0:
left_quartile = x_tij < thresh.shape[0] /2
upper_quartile = y_tij < thresh.shape[1] /2
center_x = x_min_tij + ((x_b_max_tij - x_b_min_tij) / 2)
center_y = y_min_tij + ((y_b_max_tij - y_b_min_tij) / 2)
center_x = x_min_tij + np.argmax(thresh[y_min_tij:y_max_tij, x_min_tij:x_max_tij].mean(axis=0))
center_y = y_min_tij + np.argmax(thresh[y_min_tij:y_max_tij, x_min_tij:x_max_tij].mean(axis=1))
elif min_arg == 3:
left_quartile = x_b_tij_m < thresh.shape[0] /2
upper_quartile = y_b_tij_m < thresh.shape[1] /2
center_x = x_b_min_tij_m + ((x_b_max_tij_m - x_b_min_tij_m) / 2)
center_y = y_b_min_tij_m + (y_b_max_tij_m - y_b_min_tij_m / 2)
center_x = x_b_min_tij_m + np.argmax(thresh[y_b_min_tij_m:y_b_max_tij_m, x_b_min_tij_m:x_b_max_tij_m].mean(axis=0))
center_y = y_b_min_tij_m + np.argmax(thresh[y_b_min_tij_m:y_b_max_tij_m, x_b_min_tij_m:x_b_max_tij_m].mean(axis=1))
else:
left_quartile = x_tij_m < thresh.shape[0] /2
upper_quartile = y_tij_m < thresh.shape[1] /2
center_x = x_min_tij_m + ((x_b_max_tij_m - x_b_min_tij_m) / 2)
center_y = y_min_tij_m + ((y_b_max_tij_m - y_b_min_tij_m) / 2)
center_x = x_min_tij_m + np.argmax(thresh[y_min_tij_m:y_max_tij_m, x_min_tij_m:x_max_tij_m].mean(axis=0))
center_y = y_min_tij_m + np.argmax(thresh[y_min_tij_m:y_max_tij_m, x_min_tij_m:x_max_tij_m].mean(axis=1))
# draw ellipse on copy of input
result = img.copy()
cv2.ellipse(result, ellipse, (0,0,255), 1)
cv2.line(result, (int(xtop),int(ytop)), (int(xbot),int(ybot)), (255, 0, 0), 1)
cv2.circle(result, (int(xc),int(yc)), 10, (255, 255, 255), -1)
cv2.circle(result, (int(center_x),int(center_y)), 10, (255, 0, 255), 5)
cv2.circle(result, (int(thresh.shape[1] / 2),int(thresh.shape[0] - 1)), 10, (255, 0, 0), 5)
cv2.rectangle(result,(x_min_tij,y_min_tij),(x_max_tij,y_max_tij),(255,0,0),3)
cv2.rectangle(result,(x_b_min_tij,y_b_min_tij),(x_b_max_tij,y_b_max_tij),(255,255,0),3)
cv2.rectangle(result,(x_min_tij_m,y_min_tij_m),(x_max_tij_m,y_max_tij_m),(255,0,0),3)
cv2.rectangle(result,(x_b_min_tij_m,y_b_min_tij_m),(x_b_max_tij_m,y_b_max_tij_m),(255,255,0),3)
plt.imshow(result)
plt.figure()
#rotate the image
rot_img = Image.fromarray(thresh)
#180
bot_point_x = int(thresh.shape[1] / 2)
bot_point_y = int(thresh.shape[0] - 1)
#poi
poi_x = int(center_x)
poi_y = int(center_y)
#image_center
im_center_x = int(thresh.shape[1] / 2)
im_center_y = int(thresh.shape[0] - 1) / 2
#a - adalt, b - abaix, c - dreta
#ba = a - b
#bc = c - a(b en realitat)
ba = np.array([im_center_x, im_center_y]) - np.array([bot_point_x, bot_point_y])
bc = np.array([poi_x, poi_y]) - np.array([im_center_x, im_center_y])
#angle 3 punts
cosine_angle = np.dot(ba, bc) / (np.linalg.norm(ba) * np.linalg.norm(bc))
cos_angle = np.arccos(cosine_angle)
cos_angle = np.degrees(cos_angle)
print('cos angle: ', cos_angle)
print('print: ', abs(poi_x- bot_point_x))
m = (int(thresh.shape[1] / 2)-int(center_x) / int(thresh.shape[0] - 1)-int(center_y))
ttan = math.tan(m)
theta = math.atan(ttan)
print('theta: ', theta)
result = Image.fromarray(result)
result = result.rotate(cos_angle)
plt.imshow(result)
plt.figure()
#rot_img = rot_img.rotate(origi_angle)
rot_img = rot_img.rotate(cos_angle)
return rot_img
rot_img = get_thresholded_rotated(im_path)
plt.imshow(rot_img)
</code></pre>
<p>提前谢谢
<strong>---编辑---</strong></p>
<p>我根据要求在这里留下一些原始图像。
<a href="https://i.stack.imgur.com/e7FsS.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/e7FsS.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/OyVV4.jpg" rel="noreferrer">sample</a></p>
<p><a href="https://i.stack.imgur.com/4eaQU.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/4eaQU.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/wmScL.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/wmScL.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/qu3BG.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/qu3BG.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/WwUlW.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/WwUlW.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/nbF1b.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/nbF1b.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/xhAN9.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/xhAN9.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/gb8Op.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/gb8Op.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/Zlg9F.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/Zlg9F.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/D4Pxu.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/D4Pxu.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/1ypHk.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/1ypHk.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/hiIhk.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/hiIhk.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/XUFWN.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/XUFWN.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/T9bbI.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/T9bbI.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/4MR5r.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/4MR5r.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/dxZzX.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/dxZzX.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/KfguT.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/KfguT.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/4Bfk3.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/4Bfk3.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/czRWY.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/czRWY.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/nEc2z.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/nEc2z.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/l2tHX.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/l2tHX.jpg" alt="sample"/></a></p>
<p><a href="https://i.stack.imgur.com/eMIdD.jpg" rel="noreferrer"><img src="https://i.stack.imgur.com/eMIdD.jpg" alt="sample"/></a></p>