A collection's of Salesforce's Finance-specific model
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
MMPersuade: A Dataset and Evaluation Framework for Multimodal Persuasion
Organization Card
At Salesforce AI Research, we drive research advancements in the field of AI. We apply this research to develop AI products that you can trust, and solutions that benefit everyone.
spaces
9
pinned
Running
156
GIFT Eval
🥇
GIFT-Eval: A Benchmark for General Time Series Forecasting
pinned
Running
7
CRMArena Leaderboard
🥇
A realistic benchmark with real CRM tasks for LLM agents.
pinned
Running
14
ContextualBench-Leaderboard
🥇
View and submit LLM benchmark evaluations
pinned
Running
22
LLM Leaderboard for CRM
🥇
Filter and view LLM benchmark data
Sleeping
5
Elastic Reasoning
💬
Explore efficient reasoning techniques with large language models
models
178
Salesforce/UniDoc-Bench
Updated
Salesforce/BLIP3o-NEXT-EDIT-ENSEMBLE-DATASETS
Updated
Salesforce/xRouter
Text Generation
•
8B
•
Updated
•
15
Salesforce/BLIP3o-NEXT-SFT-3B
5B
•
Updated
•
11
•
1
Salesforce/BLIP3o-NEXT-GRPO-Geneval-3B
4B
•
Updated
•
9
•
2
Salesforce/BLIP3o-NEXT-Pretrain-3B
Updated
•
2
Salesforce/Llama-Fin-8b
Updated
•
138
•
4
Salesforce/FARE-8B
8B
•
Updated
•
116
•
3
Salesforce/FARE-20B
4.76M
•
Updated
•
14
•
3
Salesforce/BLIP3o-NEXT-edit-VAE
5B
•
Updated
•
6
•
2
datasets
51
Salesforce/UniDoc-Bench
Viewer
•
Updated
•
1.74k
•
789
•
7
Salesforce/3d_optical_flow_droid
Viewer
•
Updated
•
28.5M
•
1.37k
Salesforce/ConvoMem
Updated
•
659
•
2
Salesforce/LiveResearchBench
Viewer
•
Updated
•
623
•
248
•
3
Salesforce/LiveResearchBenchFull
Viewer
•
Updated
•
772
•
178
•
4
Salesforce/BLIP3o-NEXT-EDIT-ENSEMBLE-DATASETS
Updated
•
1.02k
Salesforce/FinEval
Viewer
•
Updated
•
33.8k
•
229
•
8
Salesforce/FinTrain
Viewer
•
Updated
•
25.1M
•
1.01k
•
5
Salesforce/EDR-200
Viewer
•
Updated
•
201
•
274
•
12
Salesforce/Hard2Verify
Viewer
•
Updated
•
200
•
301
•
6