从学习变量得到的期望张量流模型大小

# Store layers weight & bias weights = { # 5x5 conv, 1 input, 32 outputs 'wc1': tf.Variable(tf.random_normal([5, 5, 1, 32]),dtype=tf.float32), # 5x5 conv, 32 inputs, 64 outputs 'wc2': tf.Variable(tf.random_normal([5, 5, 32, 64]),dtype=tf.float32), # fully connected, 7*7*64 inputs, 1024 outputs 'wd1': tf.Variable(tf.random_normal([7*7*64, 1024]),dtype=tf.float32), # 1024 inputs, 10 outputs (class prediction) 'out': tf.Variable(tf.random_normal([1024, num_classes]),dtype=tf.float32) } biases = { 'bc1': tf.Variable(tf.random_normal([32]),dtype=tf.float32), 'bc2': tf.Variable(tf.random_normal([64]),dtype=tf.float32), 'bd1': tf.Variable(tf.random_normal([1024]),dtype=tf.float32), 'out': tf.Variable(tf.random_normal([num_classes]),dtype=tf.float32) }

1条回答

网友

1楼 · 发布于 2024-05-19 22:25:51

Adding up all those variables we would expect to get a model.ckpt.data file of size 12.45Mb

传统上，大多数模型参数都在第一个完全连通的层中，在本例中wd1。仅计算其大小即可得出：

7*7*128 * 1024 * 4 = 25690112

。。。或25.6Mb。注意4系数，因为变量dtype=tf.float32，即每个参数4字节。其他层也会影响模型大小，但不会太大。在

如您所见，您的估计值12.45Mb有点偏离（您是否使用每个参数16位？）。检查点还存储一些常规信息，因此开销大约为25%，这仍然很大，但不是300%。在

[更新]

所讨论的模型实际上具有形状为[7*7*64, 1024]的FC1层，如前所述。所以上面计算的大小应该是12.5Mb。这让我更仔细地查看保存的检查点。在

在检查之后，我注意到了我最初忽略的其他大变量：

^{pr2}$

Variable_2正好是wd1，但是Adam优化器还有2个副本。这些变量由the Adam optimizer创建，它们被称为槽，并为所有可训练变量保存m和{}累加器。现在总尺寸是合理的。在

您可以运行以下代码来计算图形变量-37.47Mb的总大小：

var_sizes = [np.product(list(map(int, v.shape))) * v.dtype.size
             for v in tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES)]
print(sum(var_sizes) / (1024 ** 2), 'MB')

开销其实很小。额外的大小是由于优化器。在

相关问题更多 >

编程相关推荐

热门问题

热门文章