-
Notifications
You must be signed in to change notification settings - Fork 19.6k
Keras 2.0 release notes
This document details changes, in particular API changes, occurring from Keras 1 to Keras 2.
- The
nb_epochargument has been renamedepochseverywhere. - The methods
fit_generator,evaluate_generatorandpredict_generatornow work by drawing a number of batches from a generator (number of training steps), rather than a number of samples.-
samples_per_epochwas changed tosteps_per_epochinfit_generator. It now refers to the number of batches an epoch is considered as done. -
nb_val_sampleswas renamedvalidation_stepsinfit_generator. -
val_sampleswas renamedstepsinevaluate_generatorandpredict_generator.
-
- It is now possible to manually add a loss to a model by calling
model.add_loss(loss_tensor). The loss is added to the other losses of the model and minimized during training. - It is also possible to not apply any loss to a specific model output. If you pass
Noneas thelossargument for an output (e.g. in compile,loss={'output_1': None, 'output_2': 'mse'}, the model will expect no Numpy arrays to be fed for this output when usingfit,train_on_batch, orfit_generator. The output values are still returned as usual when usingpredict. - In TensorFlow, models can now be trained using
fitif some of their inputs (or even all) are TensorFlow queues or variables, rather than placeholders. See this test for specific examples.
- The
objectivesmodule has been renamedlosses. - Several legacy metric functions have been removed, namely
matthews_correlation,precision,recall,fbeta_score,fmeasure. - Custom metric functions can no longer return a dict, they must return a single tensor.
- Constructor arguments for
Modelhave been renamed:-
input->inputs -
output->outputs
-
- The
Sequentialmodel not longer supports theset_inputmethod. - For any model saved with Keras 2.0 or higher, weights trained with backend X will be converted to work with backend Y without any manual conversion step.
Deprecated layers MaxoutDense, Highway and TimedistributedDense have been removed.
- All layers that use the learning phase now support a
trainingargument incall(Python boolean or symbolic tensor), allowing to specify the learning phase on a layer-by-layer basis. E.g. by calling aDropoutinstance asdropout(inputs, training=True)you obtain a layer that will always apply dropout, regardless of the current global learning phase. Thetrainingargument defaults to the global Keras learning phase everywhere. - The
callmethod of layers can now take arbitrary keyword arguments, e.g. you can define a custom layer with a call signature likecall(inputs, alpha=0.5), and then pass aalphakeyword argument when calling the layer (only with the functional API, naturally). -
__call__now makes use of TensorFlowname_scope, so that your TensorFlow graphs will look pretty and well-structured in TensorBoard.
dim_ordering has been renamed data_format. It now takes two values: "channels_first" (formerly "th") and "channels_last" (formerly "tf").
Changed interface:
-
output_dim->units -
init->kernel_initializer - added
bias_initializerargument -
W_regularizer->kernel_regularizer -
b_regularizer->bias_regularizer -
b_constraint->bias_constraint -
bias->use_bias
Changed interface:
-
p->rate
- The
AtrousConvolution1DandAtrousConvolution2Dlayer have been deprecated. Their functionality is instead supported via thedilation_rateargument inConvolution1DandConvolution2Dlayers. -
Convolution*layers are renamedConv*. - The
Deconvolution2Dlayer is renamedConv2DTranspose. - The
Conv2DTransposelayer no longer requires anoutput_shapeargument, making its use much easier.
Interface changes common to all convolutional layers:
-
nb_filter->filters - 'nb_filter' is renamed as 'filters'.
- float kernel dimension arguments become a single tuple argument,
kernelsize. E.g. a legacy callConv2D(10, 3, 3)becomesConv2D(10, (3, 3)) -
kernel_sizecan be set to an integer instead of a tuple, e.g.Conv2D(10, 3)is equivalent toConv2D(10, (3, 3)). -
subsample->strides. Can also be set to an integer. -
border_mode->padding -
init->kernel_initializer - added
bias_initializerargument -
W_regularizer->kernel_regularizer -
b_regularizer->bias_regularizer -
b_constraint->bias_constraint -
bias->use_bias -
dim_ordering->data_format - In the
SeparableConv2Dlayers,initis split intodepthwise_initializerandpointwise_initializer. - Added
dilation_rateargument inConv2DandConv1D. - 1D convolution kernels are now saved as a 3D tensor (instead of 4D as before).
- 2D and 3D convolution kernels are now saved in format
spatial_dims + (input_depth, depth)), even withdata_format="channels_first".
the -> indicates that the terms has been renamed.
-
pool_length->pool_size -
stride->strides -
border_mode->padding
-
border_mode->padding -
dim_ordering->data_format
The padding argument of the ZeroPadding2D and ZeroPadding3D layers must be a tuple of length 2 and 3 respectively. Each entry i contains by how much to pad the spatial dimension i. If it's an integer, symmetric padding is applied. If it's a tuple of integers, asymmetric padding is applied.
-
length->size
The mode argument of BatchNormalization has been removed; BatchNorm now only supports mode 0 (use batch metrics for feature-wise normalization during training, and use moving metrics for feature-wise normalization during testing).
-
beta_init->beta_initializer -
gamma_init->gamma_initializer - added arguments
center,scale(booleans, whether to use abetaandgammarespectively) - added arguments
moving_mean_initializer,moving_variance_initializer - added arguments
beta_regularizer,gamma_regularizer - added arguments
beta_constraint,gamma_constraint - attribute
running_meanis renamedmoving_mean - attribute
running_stdis renamedmoving_variance(it is in fact a variance with the current implementation).
Same changes as for convolutional layers and recurrent layers apply.
-
init->alpha_initializer
-
sigma->stddev
-
output_dim->units -
init->kernel_initializer -
inner_init->recurrent_initializer - added argument
bias_initializer -
W_regularizer->kernel_regularizer -
b_regularizer->bias_regularizer - added arguments
kernel_constraint,recurrent_constraint,bias_constraint -
dropout_W->dropout -
dropout_U->recurrent_dropout -
consume_less->implementation. String values have been replaced with integers: implementation 0 (default), 1 or 2. - LSTM only: the argument
forget_bias_inithas been removed. Instead there is a boolean argumentunit_forget_bias, defaulting toTrue.
The Lambda layer now supports a mask argument.
Utilities should now be imported from keras.utils rather than from specific submodules (e.g. no more keras.utils.np_utils...).
-
std->stddev
- In the backend,
set_image_orderingandimage_orderingare nowset_data_formatanddata_format. - Any arguments (other than
nb_epoch) prefixed withnb_has been renamed to be prefixed withnum_instead. This affects two datasets and one preprocessing utility.