KaiquanMah
/

VAE-Banking77-OpenIntentClassification

Model card Files Files and versions

VAE trained on Banking 77 Open Intent Classification Dataset

This is a Variational Autoencoder (VAE) trained on the PolyAI/banking77 dataset.

Architecture

input_dim: 768
hidden_dim: 256
latent_dim: 64

Encoder

The encoder maps the input to a latent space distribution.

encoder = nn.Sequential(
            nn.Linear(input_dim, hidden_dim),
            nn.ReLU()
        )

mu = nn.Linear(hidden_dim, latent_dim)
logvar = nn.Linear(hidden_dim, latent_dim)

Decoder

The decoder reconstructs the input from a sample of the latent space.

decoder = nn.Sequential(
            nn.Linear(latent_dim, hidden_dim),
            nn.ReLU(),
            nn.Linear(hidden_dim, input_dim)
        )

Metrics

The model was trained and evaluated using the following metrics:

Training set: VAE Loss
- 50% reconstruction loss between original input vs reconstructed output
- 50% KL divergence between Latent Z vs standard normal distribution
Validation set: 100% reconstruction loss -> used to find the best model (with the lowest reconstruction loss)

Downloads last month: 17

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train KaiquanMah/VAE-Banking77-OpenIntentClassification