PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications
Paper β’ 1701.05517 β’ Published β’ 9
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
A complete from-scratch implementation of the Gated PixelCNN autoregressive generative model, trained on CIFAR-10. Built using Huggingface's Ml-intern.
Gated_PixelCNN_CIFAR10.ipynb in Google ColabImplemented entirely from scratch using only torch.nn primitives:
y = tanh(W_f * x) β Ο(W_g * x)| Component | Specification |
|---|---|
| Model | Gated PixelCNN |
| Layers | 15 gated layers |
| Filters | 128 per stack |
| Kernels | 7Γ7 input, 3Γ3 body |
| Parameters | ~5.4M |
| Loss | Cross-Entropy (256 classes) |
| Metric | BPD + FID |
| Metric | Our Model (50 epochs) | Paper [2] (converged) |
|---|---|---|
| BPD (test) | ~3.5-4.0 | 3.03 |
| FID | Reported in notebook | β |