Loading weights in TH format when keras is set to TF format

I asked Francois Chollet about this (he doesn't have an SO account) and he kindly passed along this reply:

"th" format means that the convolutional kernels will have the shape (depth, input_depth, rows, cols)

"tf" format means that the convolutional kernels will have the shape (rows, cols, input_depth, depth)

Therefore you can convert from the former to the later via np.transpose(x, (2, 3, 1, 0)) where x is the value of the convolution kernel.

Here's some code to do the conversion:

from keras import backend as K

K.set_image_dim_ordering('th')

# build model in TH mode, as th_model
th_model = ...
# load weights that were saved in TH mode into th_model
th_model.load_weights(...)

K.set_image_dim_ordering('tf')

# build model in TF mode, as tf_model
tf_model = ...

# transfer weights from th_model to tf_model
for th_layer, tf_layer in zip(th_model.layers, tf_model.layers):
   if th_layer.__class__.__name__ == 'Convolution2D':
      kernel, bias = layer.get_weights()
      kernel = np.transpose(kernel, (2, 3, 1, 0))
      tf_layer.set_weights([kernel, bias])
  else:
      tf_layer.set_weights(tf_layer.get_weights())

In case the model contains Dense layers downstream of the Convolution2D layers, then the weight matrix of the first Dense layer would need to be shuffled as well.

Loading weights in TH format when keras is set to TF format

Tags:

Tensorflow

Keras

Related

Recent Posts