modernBERT for Norwegian.

by hoxmark - opened 26 days ago

Discussion

hoxmark

26 days ago

Have you thought about training a Norwegian modernBERT(https://huggingface.co/answerdotai/ModernBERT-base) model?

That would be very useful.

davda54

Language Technology Group (University of Oslo) org 7 days ago

Yes, we are planning to release s collection of new NorBERTs that will be more optimized for inference speed :)

hoxmark

6 days ago

Great news! Thanks you.

Will that also include the possible token length?

davda54

Language Technology Group (University of Oslo) org 6 days ago

What exactly do you mean by that? :)

hoxmark

5 days ago

•

edited 5 days ago

Thank you for answer and my apologies, I some how stopped mid sentence.

My question was suppose to say:
In the blog post introducing modernBERT they also say that they will increase the possibility for a sequence length of up to 8192 tokens. Is this something you will look into doing? :)

davda54

Language Technology Group (University of Oslo) org 5 days ago

I see :) Yes, we will increase the sequence length. But note that even the current NorBERT3 is able to accept longer sequences than the 512 tokens it has been trained on, thanks to its bucketed relative positional encoding.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment