uoft-cs/cifar10
Viewer • Updated • 60k • 126k • 106
This model combines a Vision Transformer (ViT) backbone with a JAX Logical Neural Network (JLNN) layer. It provides high-accuracy image classification with built-in interpretability and uncertainty quantification.
To ensure stable training and prevent binary collapse, we use a custom FuzzyGrounding layer with the following parameters:
1.4 (softens the sigmoid gradients)-1.2 (starts predicates in a cautious, non-binary state)This configuration ensures that the Vision Transformer and the Logical layer converge smoothly, as seen in the training logs.
Refer to the official JLNN Repository for inference scripts.