license: cc-by-nc-sa-4.0 | |
datasets: | |
- QCRI/LlamaLens-English | |
- QCRI/LlamaLens-Arabic | |
- QCRI/LlamaLens-Hindi | |
language: | |
- ar | |
- en | |
- hi | |
base_model: | |
- meta-llama/Llama-3.1-8B-Instruct | |
pipeline_tag: text-generation | |
tags: | |
- Social-Media | |
- Hate-Speech | |
- Summarization | |
- offensive-language | |
- News-Genre | |
# LlamaLens: Specialized Multilingual LLM forAnalyzing News and Social Media Content | |
## Overview | |
LlamaLens is a specialized multilingual LLM designed for analyzing news and social media content. It focuses on 19 NLP tasks, leveraging 52 datasets across Arabic, English, and Hindi. | |
<p align="center"> | |
<picture> | |
<img width="352" alt="capablities_tasks_datasets" src="./llamalens-avatar.png"> | |
</picture> | |
</p> | |
## Model Inference | |
TBA | |
# License | |
This model is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0). | |
# Citation | |
Please cite [our paper](https://arxiv.org/pdf/2410.15308) when using this model: | |
``` | |
@article{kmainasi2024llamalensspecializedmultilingualllm, | |
title={LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content}, | |
author={Mohamed Bayan Kmainasi and Ali Ezzat Shahroor and Maram Hasanain and Sahinur Rahman Laskar and Naeemul Hassan and Firoj Alam}, | |
year={2024}, | |
journal={arXiv preprint arXiv:2410.15308}, | |
volume={}, | |
number={}, | |
pages={}, | |
url={https://arxiv.org/abs/2410.15308}, | |
eprint={2410.15308}, | |
archivePrefix={arXiv}, | |
primaryClass={cs.CL} | |
} | |
``` | |