The bug about using hooks and MirroredStrategy in tf.estimator.Estimator

阿新 • • 發佈：2018-12-23

When I was using MirroedStrategy in my tf.estimator.Estimator:

Python

distribution = tf.contrib.distribute.MirroredStrategy(
      ["/device:GPU:0", "/device:GPU:1"])
config = tf.estimator.RunConfig(train_distribute=distribution,
                                  eval_distribute=distribution)
estimator = tf.estimator.Estimator(
      model_fn=build_model_fn_optimizer(), config=config)
estimator.train(input_fn=input_fn, steps=10)

1234567

distribution=tf.contrib.distribute.MirroredStrategy(["/device:GPU:0","/device:GPU:1"])config=tf.estimator.RunConfig(train_distribute=distribution,eval_distribute=distribution)estimator=tf.estimator.Estimator(model_fn=build_model_fn_optimizer(),config=

config)estimator.train(input_fn=input_fn,steps=10)

and add hooks for training:

Python

logging_hook = tf.train.LoggingTensorHook({'logits' : logits})
    return tf.estimator.EstimatorSpec(mode, loss=loss_fn(), train_op=train_op, training_hooks = [logging_hook])

12	logging_hook=tf.train.LoggingTensorHook({'logits':logits})returntf.estimator.EstimatorSpec(mode,loss=loss_fn(),train_op=train_op,training_hooks=[logging_hook])

The tensorflow report errors:

  File "/usr/lib/python3.6/site-packages/tensorflow/python/estimator/estimator.py", line 356, in train
    loss = self._train_model(input_fn, hooks, saving_listeners)
  File "/usr/lib/python3.6/site-packages/tensorflow/python/estimator/estimator.py", line 1179, in _train_model
    return self._train_model_distributed(input_fn, hooks, saving_listeners)
  File "/usr/lib/python3.6/site-packages/tensorflow/python/estimator/estimator.py", line 1309, in _train_model_distributed
    grouped_estimator_spec.training_hooks)
  File "/usr/lib/python3.6/site-packages/tensorflow/python/estimator/estimator.py", line 1305, in get_hooks_from_the_first_device
    for per_device_hook in per_device_hooks
  File "/usr/lib/python3.6/site-packages/tensorflow/python/estimator/estimator.py", line 1305, in <listcomp>
    for per_device_hook in per_device_hooks
AttributeError: 'Estimator' object has no attribute '_distribution'

1234567891011

File"/usr/lib/python3.6/site-packages/tensorflow/python/estimator/estimator.py",line356,intrainloss=self._train_model(input_fn,hooks,saving_listeners)File"/usr/lib/python3.6/site-packages/tensorflow/python/estimator/estimator.py",line1179,in_train_modelreturnself._train_model_distributed(input_fn,hooks,saving_listeners)File"/usr/lib/python3.6/site-packages/tensorflow/python/estimator/estimator.py",line1309,in_train_model_distributedgrouped_estimator_spec.training_hooks)File"/usr/lib/python3.6/site-packages/tensorflow/python/estimator/estimator.py",line1305,inget_hooks_from_the_first_deviceforper_device_hook inper_device_hooksFile"/usr/lib/python3.6/site-packages/tensorflow/python/estimator/estimator.py",line1305,in<listcomp>forper_device_hook inper_device_hooksAttributeError:'Estimator'objecthas no attribute'_distribution'

Without finding any answers on google, I have to look into the code of ‘estimator.py’ in tensorflow. Fortunately, the code defect is obvious:

Python

        scaffold = _combine_distributed_scaffold(
            grouped_estimator_spec.scaffold, self._train_distribution)

        # TODO(yuefengz): add a test for unwrapping per_device_hooks.
        def get_hooks_from_the_first_device(per_device_hooks):
          return [
              self._distribution.unwrap(per_device_hook)[0]
              for per_device_hook in per_device_hooks
          ]
            
        training_hooks = get_hooks_from_the_first_device(
            grouped_estimator_spec.training_hooks)

123456789101112

scaffold=_combine_distributed_scaffold(grouped_estimator_spec.scaffold,self._train_distribution)# TODO(yuefengz): add a test for unwrapping per_device_hooks.defget_hooks_from_the_first_device(per_device_hooks):return[self._distribution.unwrap(per_device_hook)[0]forper_device_hook inper_device_hooks]training_hooks=get_hooks_from_the_first_device(grouped_estimator_spec.training_hooks)

class Estimator havn’t any private argument named ‘_distribution’ but only have ‘_train_distribution’ and ‘_eval_distribution’. So the fix is just change ‘self._distribution.unwrap(per_device_hook)[0]’ to ‘self._train_distribution.unwrap(per_device_hook)[0]’.

I had submitted a request pull for tensorflow to fix this bug in branch 1.11

The bug about using hooks and MirroredStrategy in tf.estimator.Estimator

Related

The bug about using hooks and MirroredStrategy in tf.estimator.Estimator

About the diffrence of wait timed_wait and block in java

What's the difference between using “let” and “var” to declare a variable in JavaScript?

The issue about using Git bash for Docker in window

Transfer learning & The art of using Pre-trained Models in Deep Learning

The fusion of AI, ML, and Voice in the Contact Center

Everyone Is Missing the Point About Brian Wansink and P

Train your own ML model using Scikit and use in iOS app with CoreML (and probably with Augmented…

3 Ways to Enhance the Customer Experience Using AI and Machine Learning

A comprehensive Machine Learning workflow with multiple modelling using caret and caretEnsemble in…

Getting Started With the Slack API Using Python and Flask

[Preact] Use State and Props in the Component Render Function

The confusion about jsp four scopes and ServletContext,HttpSession,HttpServletReqest,PageContext

The Usage of Lambda and Heap in the C++ STL

Qualcomm platform, the commonly used parameters of charger and battery in device tree file

Edge-assisted Tra?ic Engineering and applications in the IoT

What is the difference between static func and class func in Swift?

session store list and show in the jsp

Six golden A Global Leader in Industrial IoT rules for creating the ideal German cover letter and r

Name Disambiguation in AMiner-Clustering, Maintenance, and Human in the Loop

The bug about using hooks and MirroredStrategy in tf.estimator.Estimator

Related

相關推薦