26 2 9

Samuele Colombo

FinancialSupport

https://www.linkedin.com/in/samuele-colombo-ml/

AI & ML interests

None yet

Recent Activity

updated a dataset 1 day ago

mii-llm/requests

updated a dataset 3 days ago

mii-llm/results

reacted to anakin87's post with 👍 18 days ago

𝐍𝐞𝐰 𝐈𝐭𝐚𝐥𝐢𝐚𝐧 𝐒𝐦𝐚𝐥𝐥 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐌𝐨𝐝𝐞𝐥𝐬: 𝐆𝐞𝐦𝐦𝐚 𝐍𝐞𝐨𝐠𝐞𝐧𝐞𝐬𝐢𝐬 𝐜𝐨𝐥𝐥𝐞𝐜𝐭𝐢𝐨𝐧 💎🌍🇮🇹 I am happy to release two new language models for the Italian Language! 💪 Gemma 2 9B Neogenesis ITA https://huggingface.co/anakin87/gemma-2-9b-neogenesis-ita Building on the impressive work by VAGO Solutions, I applied Direct Preference Optimization with a mix of Italian and English data. Using Spectrum, I trained 20% of model layers. 📊 Evaluated on the Open ITA LLM leaderboard (https://huggingface.co/spaces/mii-llm/open_ita_llm_leaderboard), this model achieves strong performance. To beat it on this benchmark, you'd need a 27B model 😎 🤏 Gemma 2 2B Neogenesis ITA https://huggingface.co/anakin87/gemma-2-2b-neogenesis-ita This smaller variant is fine-tuned from the original Gemma 2 2B it by Google. Through a combination of Supervised Fine-Tuning and Direct Preference Optimization, I trained 25% of the layers using Spectrum. 📈 Compared to the original model, it shows improved Italian proficiency, good for its small size. Both models were developed during the recent #gemma competition on Kaggle. 📓 Training code: https://www.kaggle.com/code/anakin87/post-training-gemma-for-italian-and-beyond 🙏 Thanks @FinancialSupport and mii-llm for the help during evaluation.

View all activity

Organizations

FinancialSupport's activity

updated a dataset 1 day ago

mii-llm/requests

Updated about 5 hours ago • 4.28k

updated a dataset 3 days ago

mii-llm/results

Viewer • Updated 3 days ago • 1 • 2.53k

reacted to anakin87's post with 👍 18 days ago

Post

1595

𝐍𝐞𝐰 𝐈𝐭𝐚𝐥𝐢𝐚𝐧 𝐒𝐦𝐚𝐥𝐥 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐌𝐨𝐝𝐞𝐥𝐬: 𝐆𝐞𝐦𝐦𝐚 𝐍𝐞𝐨𝐠𝐞𝐧𝐞𝐬𝐢𝐬 𝐜𝐨𝐥𝐥𝐞𝐜𝐭𝐢𝐨𝐧 💎🌍🇮🇹

I am happy to release two new language models for the Italian Language!

💪 Gemma 2 9B Neogenesis ITA
anakin87/gemma-2-9b-neogenesis-ita
Building on the impressive work by VAGO Solutions, I applied Direct Preference Optimization with a mix of Italian and English data.
Using Spectrum, I trained 20% of model layers.

📊 Evaluated on the Open ITA LLM leaderboard ( mii-llm/open_ita_llm_leaderboard), this model achieves strong performance.
To beat it on this benchmark, you'd need a 27B model 😎

🤏 Gemma 2 2B Neogenesis ITA
anakin87/gemma-2-2b-neogenesis-ita
This smaller variant is fine-tuned from the original Gemma 2 2B it by Google.
Through a combination of Supervised Fine-Tuning and Direct Preference Optimization, I trained 25% of the layers using Spectrum.

📈 Compared to the original model, it shows improved Italian proficiency, good for its small size.

Both models were developed during the recent #gemma competition on Kaggle.
📓 Training code: https://www.kaggle.com/code/anakin87/post-training-gemma-for-italian-and-beyond

🙏 Thanks @FinancialSupport and mii-llm for the help during evaluation.

3 replies

updated a dataset 21 days ago

mii-llm/requests

Updated about 5 hours ago • 4.28k

updated 2 datasets 22 days ago

mii-llm/results

Viewer • Updated 3 days ago • 1 • 2.53k

mii-llm/requests

Updated about 5 hours ago • 4.28k

updated a dataset 27 days ago

mii-llm/results

Viewer • Updated 3 days ago • 1 • 2.53k