从RandomShuffleQueue离队不会减小大小

为了训练模型，我将模型封装在一个类中。我使用tf.RandomShuffleQueue排列列表的文件名。但是，当我将出列的元素出列时，队列的大小不会减少。从RandomShuffleQueue离队不会减小大小

以下是后面的代码片段更具体的问题：

如果我有例如仅5图像，但步骤范围高达100，将这个结果在addfilenames多次自动调用？它不会给出任何出队错误，所以我认为它会自动调用。

为什么tf.RandomShuffleQueue的大小没有变化？它保持不变。

import os 
import time 
import functools 
import tensorflow as tf 
from Read_labelclsloc import readlabel 


def ReadTrain(traindir): 
    # Returns a list of training images, their labels and a dictionay. 
    # The dictionary maps label names to integer numbers.                  
    return trainimgs, trainlbls, classdict 


def ReadVal(valdir, classdict): 
    # Reads the validation image labels. 
    # Returns a dictionary with filenames as keys and 
    # corresponding labels as values. 
    return valdict 

def lazy_property(function): 
    # Just a decorator to make sure that on repeated calls to 
    # member functions, ops don't get created repeatedly. 
    # Acknowledgements : https://danijar.com/structuring-your-tensorflow-models/ 
    attribute= '_cache_' + function.__name__ 
    @property 
    @functools.wraps(function) 
    def decorator(self): 
     if not hasattr(self, attribute): 
      setattr(self, attribute, function(self)) 
     return getattr(self, attribute) 

    return decorator  

class ModelInitial: 

    def __init__(self, traindir, valdir): 
     self.graph 
     self.traindir = traindir 
     self.valdir = valdir 
     self.traininginfo() 
     self.epoch = 0 



    def traininginfo(self): 
     self.trainimgs, self.trainlbls, self.classdict = ReadTrain(self.traindir) 
     self.valdict = ReadVal(self.valdir, self.classdict) 
     with self.graph.as_default(): 
      self.trainimgs_tensor = tf.constant(self.trainimgs) 
      self.trainlbls_tensor = tf.constant(self.trainlbls, dtype=tf.uint16) 
      self.trainimgs_dict = {} 
      self.trainimgs_dict["ImageFile"] = self.trainimgs_tensor 
     return None 

    @lazy_property 
    def graph(self): 
     g = tf.Graph() 
     with g.as_default(): 
      # Layer definitions go here 
     return g 


    @lazy_property 
    def addfilenames (self): 
    # This is the function where filenames are pushed to a RandomShuffleQueue 
     filename_queue = tf.RandomShuffleQueue(capacity=len(self.trainimgs), min_after_dequeue=0,\ 
               dtypes=[tf.string], names=["ImageFile"],\ 
               seed=0, name="filename_queue") 

     sz_op = filename_queue.size() 

     dq_op = filename_queue.dequeue() 

     enq_op = filename_queue.enqueue_many(self.trainimgs_dict) 
     return filename_queue, enq_op, sz_op, dq_op 

    def Train(self): 
    # The function for training. 
    # I have not written the training part yet. 
    # Still struggling with preprocessing 
     with self.graph.as_default(): 
      filename_q, filename_enqueue_op, sz_op, dq_op= self.addfilenames 

      qr = tf.train.QueueRunner(filename_q, [filename_enqueue_op]) 
      filename_dequeue_op = filename_q.dequeue() 
      init_op = tf.global_variables_initializer() 

     sess = tf.Session(graph=self.graph) 
     sess.run(init_op) 
     coord = tf.train.Coordinator() 
     enq_threads = qr.create_threads(sess, coord=coord, start=True) 
     counter = 0 
     for step in range(100): 
      print(sess.run(dq_op["ImageFile"])) 
      print("Epoch = %d "%(self.epoch)) 
      print("size = %d"%(sess.run(sz_op))) 
      counter+=1 

     names = [n.name for n in self.graph.as_graph_def().node] 
     coord.request_stop() 
     coord.join(enq_threads) 
     print("Counter = %d"%(counter)) 
     return None 





if __name__ == "__main__": 
    modeltrain = ModelInitial(<Path to training images>,\ 
            <Path to validation images>) 
    a = modeltrain.graph 
    print(a) 
    modeltrain.Train() 
    print("Success")

来源

2017-03-22 Ujjwal

，正是由于你的队列，这导致它在后台填写创建tf.train.QueueRunner引起的。

以下行导致创建一个背景“队列运行”线程：
```
qr = tf.train.QueueRunner(filename_q, [filename_enqueue_op]) 
# ... 
enq_threads = qr.create_threads(sess, coord=coord, start=True) 
```
此线程在循环中调用，这会导致队列filename_enqueue_op被填充为删除从元件它。
来自步骤1的后台线程几乎总是在队列上有一个未决队列操作（filename_enqueue_op）。这意味着在你将一个文件名出队之后，等待排队将运行添加填充队列返回到容量。（技术上这里有一个竞赛条件，你可以看到capacity - 1的大小，但这是不太可能的）。

来源

2017-03-22 16:42:22 mrry

谢谢。我也期待这一点。我发现'tf.train.QueueRunner'是相当严格的。我不得不使用一个'enqueue_op'。我不能在那里放置一个通用功能。我希望能够在一个完整的'enqueue_many'结束后通过增加'epoch'来保持对特定'epoch'的严格跟踪。我怎样才能实现它？不幸的是，我在'QueueRunner'中看不到任何这种功能。如果是，那么它没有适当的记录。数据集处理和预处理现在是TensorFlow最头痛的问题。有什么建议么？ – Ujjwal

@Ujjwal，queuerunner逻辑非常小，我有时会发现自己不使用队列运行器并自己从单独的Python线程运行队列操作很有用。这给了更多的控制权，请参阅https://github.com/yaroslavvb/stuff/tree/master/queues_talk中的一些解释 –

Ya。我终于找到了使用类结构和线程模块的解决方案。它比QueueRunner更清晰并提供更多控制。事实上，对于复杂的预处理，QueueRunners证明是无用的。在Github上，我看到为2.0版本提供了一个新的输入流水线。实际上，tensorflow需要对输入流水线进行彻底检查。 – Ujjwal

从RandomShuffleQueue离队不会减小大小

回答

相关问题