wasserstein-gan/README.md

Wasserstein GAN
===============

Code accompanying the paper ["Wasserstein GAN"](https://arxiv.org/abs/1701.07875)

## A few notes

- The first time running on the LSUN dataset it can take a long time (up to an hour) to create the dataloader. After the first run a small cache file will be created and the process should take a matter of seconds. The cache is a list of indices in the lmdb database (of LSUN)
- The only addition to the code (that we forgot, and will add, on the paper) are the [lines 163-166 of main.py](https://github.com/martinarjovsky/WassersteinGAN/blob/master/main.py#L163-L166). These lines act only on the first 25 generator iterations or very sporadically (once every 500 generator iterations). In such a case, they set the number of iterations on the critic to 100 instead of the default 5. This helps to start with the critic at optimum even in the first iterations. There shouldn't be a major difference in performance, but it can help, especially when visualizing learning curves (since otherwise you'd see the loss going up until the critic is properly trained). This is also why the first 25 iterations take significantly longer than the rest of the training as well.
- If your learning curve suddenly takes a big drop take a look at [this](https://github.com/martinarjovsky/WassersteinGAN/issues/2). It's a problem when the critic fails to be close to optimum, and hence its error stops being a good Wasserstein estimate. Known causes are high learning rates and momentum, and anything that helps the critic get back on track is likely to help with the issue.

## Prerequisites

- Computer with Linux or OSX
- [PyTorch](http://pytorch.org)
- For training, an NVIDIA GPU is strongly recommended for speed. CPU is supported but training is very slow.

Two main empirical claims:

### Generator sample quality correlates with discriminator loss

![gensample](imgs/w_combined.png "sample quality correlates with discriminator loss")

### Improved model stability

![stability](imgs/compare_dcgan.png "stability")


## Reproducing LSUN experiments

**With DCGAN:**

```bash
python main.py --dataset folder --dataroot data/maps --cuda
```

**With MLP:**

```bash
python main.py --mlp_G --ngf 512
```

Generated samples will be in the `samples` folder.

If you plot the value `-Loss_D`, then you can reproduce the curves from the paper. The curves from the paper (as mentioned in the paper) have a median filter applied to them:

```python
med_filtered_loss = scipy.signal.medfilt(-Loss_D, dtype='float64'), 101)
```

More improved README in the works.
readme 2017-01-30 14:29:17 +00:00			`Wasserstein GAN`
			`===============`

			`Code accompanying the paper ["Wasserstein GAN"](https://arxiv.org/abs/1701.07875)`

Update README.md 2017-03-23 14:47:18 +00:00			`## A few notes`
Update README.md 2017-01-30 19:40:30 +00:00
Update README.md 2017-01-30 23:28:45 +00:00			`- The first time running on the LSUN dataset it can take a long time (up to an hour) to create the dataloader. After the first run a small cache file will be created and the process should take a matter of seconds. The cache is a list of indices in the lmdb database (of LSUN)`
Update README.md 2017-03-30 15:24:07 +00:00			- The only addition to the code (that we forgot, and will add, on the paper) are the [lines 163-166 of main.py](https://github.com/martinarjovsky/WassersteinGAN/blob/master/main.py#L163-L166). These lines act only on the first 25 generator iterations or very sporadically (once every 500 generator iterations). In such a case, they set the number of iterations on the critic to 100 instead of the default 5. This helps to start with the critic at optimum even in the first iterations. There shouldn't be a major difference in performance, but it can help, especially when visualizing learning curves (since otherwise you'd see the loss going up until the critic is properly trained). This is also why the first 25 iterations take significantly longer than the rest of the training as well.
Update README.md 2017-08-24 15:55:46 +00:00			`- If your learning curve suddenly takes a big drop take a look at [this](https://github.com/martinarjovsky/WassersteinGAN/issues/2). It's a problem when the critic fails to be close to optimum, and hence its error stops being a good Wasserstein estimate. Known causes are high learning rates and momentum, and anything that helps the critic get back on track is likely to help with the issue.`
Update README.md 2017-01-30 19:40:30 +00:00
Update README.md 2017-03-23 14:47:18 +00:00			`## Prerequisites`
readme 2017-01-30 14:29:17 +00:00
			`- Computer with Linux or OSX`
			`- [PyTorch](http://pytorch.org)`
			`- For training, an NVIDIA GPU is strongly recommended for speed. CPU is supported but training is very slow.`

add readme 2017-01-30 14:41:11 +00:00			`Two main empirical claims:`

Update README.md 2017-03-23 14:47:18 +00:00			`### Generator sample quality correlates with discriminator loss`
add readme 2017-01-30 14:41:11 +00:00
			`![gensample](imgs/w_combined.png "sample quality correlates with discriminator loss")`

Update README.md 2017-03-23 14:47:18 +00:00			`### Improved model stability`
add readme 2017-01-30 14:41:11 +00:00
			`![stability](imgs/compare_dcgan.png "stability")`

readme 2017-01-30 14:29:17 +00:00
Update README.md 2017-03-23 14:47:18 +00:00			`## Reproducing LSUN experiments`
readme 2017-01-30 14:29:17 +00:00
			`With DCGAN:`

add readme 2017-01-30 14:41:11 +00:00			```bash
feat(upgrade): Solve issues from previous PyTorch version 2023-12-16 11:35:13 +00:00			`python main.py --dataset folder --dataroot data/maps --cuda`
readme 2017-01-30 14:29:17 +00:00			```

			`With MLP:`

add readme 2017-01-30 14:41:11 +00:00			```bash
readme 2017-01-30 14:29:17 +00:00			`python main.py --mlp_G --ngf 512`
			```

add readme 2017-01-30 14:41:11 +00:00			Generated samples will be in the `samples` folder.

			If you plot the value `-Loss_D`, then you can reproduce the curves from the paper. The curves from the paper (as mentioned in the paper) have a median filter applied to them:

			```python
			`med_filtered_loss = scipy.signal.medfilt(-Loss_D, dtype='float64'), 101)`
			```

readme 2017-01-30 14:29:17 +00:00			`More improved README in the works.`