soumith · miqbal23 · Jul 4, 2019 · Jul 4, 2019
diff --git a/README.md b/README.md
@@ -38,13 +38,16 @@ In practice, works well:
 - Tom White's [Sampling Generative Networks](https://arxiv.org/abs/1609.04468) ref code https://github.com/dribnet/plat has more details
 
 
-## 4: BatchNorm
+## 4.a Batch normalization
 
 - Construct different mini-batches for real and fake, i.e. each mini-batch needs to contain only all real images or all generated images.
-- when batchnorm is not an option use instance normalization (for each sample, subtract mean and divide by standard deviation).
 
 ![batchmix](images/batchmix.png "BatchMix")
 
+## 4.b Layer normalization
+- when batchnorm is not an option use instance normalization (for each sample, subtract mean and divide by standard deviation).
+- use spectral normalization on discriminator (https://arxiv.org/abs/1802.05957)
+
 ## 5: Avoid Sparse Gradients: ReLU, MaxPool
 - the stability of the GAN game suffers if you have sparse gradients
 - LeakyReLU = good (in both G and D)
@@ -110,10 +113,12 @@ while lossG > B:
 - adding gaussian noise to every layer of generator (Zhao et. al. EBGAN)
   - Improved GANs: OpenAI code also has it (commented out)
 
-## 14: [notsure] Train discriminator more (sometimes)
+## 14: Train discriminator more
 
 - especially when you have noise
 - hard to find a schedule of number of D iterations vs G iterations
+- train discriminator in n times using Wasserstein distance 
+  - also makes losses correlates with sample quality
 
 ## 15: [notsure] Batch Discrimination