VAE trained on Banking 77 Open Intent Classification Dataset
This is a Variational Autoencoder (VAE) trained on the PolyAI/banking77 dataset.
Architecture
- input_dim: 768
- hidden_dim: 256
- latent_dim: 64
Encoder
The encoder maps the input to a latent space distribution.
encoder = nn.Sequential(
nn.Linear(input_dim, hidden_dim),
nn.ReLU()
)
mu = nn.Linear(hidden_dim, latent_dim)
logvar = nn.Linear(hidden_dim, latent_dim)
Decoder
The decoder reconstructs the input from a sample of the latent space.
decoder = nn.Sequential(
nn.Linear(latent_dim, hidden_dim),
nn.ReLU(),
nn.Linear(hidden_dim, input_dim)
)
Metrics
The model was trained and evaluated using the following metrics:
- Training set: VAE Loss
- 50% reconstruction loss between original input vs reconstructed output
- 50% KL divergence between Latent Z vs standard normal distribution
- Validation set: 100% reconstruction loss -> used to find the best model (with the lowest reconstruction loss)
- Downloads last month
- 17
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support