Aesthetic Scorer

This model predicts 7 different aesthetic metrics for images:

  • Overall aesthetic score
  • Technical quality score
  • Composition score
  • Lighting score
  • Color harmony score
  • Depth of field score
  • Content score

Model Details

  • Based on CLIP ViT-B/32 visual encoder
  • Fine-tuned on the PARA dataset
  • Returns scores between 0-5 for each aesthetic dimension

Usage

from transformers import CLIPProcessor
from aesthetic_scorer import AestheticScorer
import torch
from PIL import Image

# Load the model
processor = CLIPProcessor.from_pretrained("YOUR_USERNAME/aesthetic-scorer")
model = torch.load("YOUR_USERNAME/aesthetic-scorer/model.pt")

# Process an image
image = Image.open("your_image.jpg")
inputs = processor(images=image, return_tensors="pt")["pixel_values"]

# Get scores
with torch.no_grad():
    scores = model(inputs)

# Print results
aesthetic_categories = ["Overall", "Quality", "Composition", "Lighting", "Color", "Depth of Field", "Content"]
for category, score in zip(aesthetic_categories, scores):
    print(f"{category}: {score.item():.2f}/10")
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for rsinema/aesthetic-scorer

Finetuned
(66)
this model