如何确保tf.control_依赖项（）当我训练具有多个网络的GANlike图时？问题的回答

如何确保tf.control_依赖项（）当我训练具有多个网络的GANlike图时？

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

我想我可以把这个问题概括为，“当我有两个唯一的网络时，如何使用批处理规范化？”在 我训练的基本上是一个GAN，鉴别器和生成器都有批量范数层。这有点不同，因为这两个网络都有各自的损耗函数，完全独立于另一个，这与普通的GAN框架不同。第二个网络基本上只是用来测量生成器在任务中的“错误程度”，但它们都应该完全独立地更新。在 我的网络都是在单独的gpu上定义的，因为它们相当大。在 我将网络放置在每个GPU上，并在以下代码中分配依赖项： <pre><code>with tf.device("/gpu:0"): pred = uNet2D(X, BETA[j], KERNEL_SIZE, is_training) cost = tf.reduce_sum(tf.nn.sigmoid_cross_entropy_with_logits(labels=tf.reshape(Y,[-1]),logits=tf.reshape(pred,[-1]))) update_ops = tf.get_collection(tf.GraphKeys.UPDATE_OPS) with tf.control_dependencies(update_ops): optimizer = tf.train.AdamOptimizer(learning_rate=LR[i]).minimize(W*cost) with tf.device("/gpu:1"): attention = attentionNetwork(X_ATTN, BETA[j], KERNEL_SIZE, is_training) cost_d = tf.reduce_sum(tf.nn.sigmoid_cross_entropy_with_logits(labels=Y_ATTN,logits=attention)) update_ops = tf.get_collection(tf.GraphKeys.UPDATE_OPS) with tf.control_dependencies(update_ops): optimizer_d = tf.train.AdamOptimizer(learning_rate=0.2*LR[i]).minimize(cost_d) </code></pre> 不过，我有点担心，因为我的张量板图图像表明，uNet（我的生成器）的输出是一个输入，梯度用于更新attentionenetwork（我的鉴别器）。在 有人能帮我决定怎样构造这些积木吗？我还担心的是，优化attentionNetwork需要包含uNet2D（）和cost on中定义的占位符gpu:0。在 谢谢！我的张量板图表附在下面。在 <a href="https://i.stack.imgur.com/8GgVd.png" rel="nofollow noreferrer"><img src="https://i.stack.imgur.com/8GgVd.png" alt="enter image description here"/></a> 编辑：当我在没有Batch Norm的情况下运行这个程序，因此没有control\u dependencies（）时，我得到了一个看起来像这样的Tensorboard，我很确定这是我想要的。在 <a href="https://i.stack.imgur.com/YfinL.png" rel="nofollow noreferrer"><img src="https://i.stack.imgur.com/YfinL.png" alt="enter image description here"/></a>

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

如何确保tf.control_依赖项（）当我训练具有多个网络的GANlike图时？

1 个回答

相关Python问题