Tensorflow：如何在QueueRunn中使用“新”数据集API

1条回答

网友

1楼 · 发布于 2024-10-06 13:35:06

如果使用数据集API，则不必使用QueueRunner来拥有队列/缓冲区。可以使用数据集API创建队列/缓冲区，并对数据进行预处理并并行训练网络。如果有数据集，可以使用prefetch function或shuffle function创建队列/缓冲区。在

有关更多信息，请参阅official tutorial on the Dataset API。在

以下是在CPU上使用预处理的预取缓冲区的示例：

 NUM_THREADS = 8
 BUFFER_SIZE = 100

 data = ...
 labels = ...
 inputs = (data, labels)

 def pre_processing(data_, labels_):
     with tf.device("/cpu:0"):
         # do some pre-processing here
         return data_, labels_

 dataset_source = tf.data.Dataset.from_tensor_slices(inputs)
 dataset = dataset_source.map(pre_processing, num_parallel_calls=NUM_THREADS)

 dataset = dataset.repeat(1)  # repeats for one epoch
 dataset = dataset.prefetch(BUFFER_SIZE)

 iterator = tf.data.Iterator.from_structure(dataset.output_types,
                                            dataset.output_shapes)
 next_element = iterator.get_next()
 init_op = iterator.make_initializer(dataset)

 with tf.Session() as sess:
     sess.run(init_op)
     while True:
         try:
             sess.run(next_element)
         except tf.errors.OutOfRangeError:
             break

相关问题更多 >

编程相关推荐

热门问题

热门文章

Tensorflow：如何在QueueRunn中使用“新”数据集API

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >