1 3 34

Lino Giger

LinoGiger

LinoGiger

AI & ML interests

None yet

Recent Activity

reacted to jasoncorkill's post with 👀 5 days ago

Benchmarking Google's Veo2: How Does It Compare? The results did not meet expectations. Veo2 struggled with style consistency and temporal coherence, falling behind competitors like Runway, Pika, Tencent, and even Alibaba. While the model shows promise, its alignment and quality are not yet there. Google recently launched Veo2, its latest text-to-video model, through select partners like fal.ai. As part of our ongoing evaluation of state-of-the-art generative video models, we rigorously benchmarked Veo2 against industry leaders. We generated a large set of Veo2 videos spending hundreds of dollars in the process and systematically evaluated them using our Python-based API for human and automated labeling. Check out the ranking here: https://www.rapidata.ai/leaderboard/video-models https://huggingface.co/datasets/Rapidata/text-2-video-human-preferences-veo2

liked a dataset 5 days ago

Rapidata/text-2-video-human-preferences-veo2

liked a dataset 5 days ago

Rapidata/text-2-video-human-preferences-wan2.1

View all activity

Organizations

LinoGiger's activity

reacted to jasoncorkill's post with 👀 5 days ago

Post

2108

Benchmarking Google's Veo2: How Does It Compare?

The results did not meet expectations. Veo2 struggled with style consistency and temporal coherence, falling behind competitors like Runway, Pika, Tencent, and even Alibaba. While the model shows promise, its alignment and quality are not yet there.

Google recently launched Veo2, its latest text-to-video model, through select partners like fal.ai. As part of our ongoing evaluation of state-of-the-art generative video models, we rigorously benchmarked Veo2 against industry leaders.

We generated a large set of Veo2 videos spending hundreds of dollars in the process and systematically evaluated them using our Python-based API for human and automated labeling.

Check out the ranking here: https://www.rapidata.ai/leaderboard/video-models

Rapidata/text-2-video-human-preferences-veo2

liked 2 datasets 5 days ago

Rapidata/text-2-video-human-preferences-veo2

Viewer • Updated 6 days ago • 760 • 311 • 10

Rapidata/text-2-video-human-preferences-wan2.1

Viewer • Updated 6 days ago • 787 • 331 • 13

updated a dataset 6 days ago

Rapidata/text-2-video-human-preferences-veo2

Viewer • Updated 6 days ago • 760 • 311 • 10

updated a dataset 10 days ago

Rapidata/Translation-deepseek-llama-mixtral-v-deepl

Viewer • Updated 6 days ago • 845 • 320 • 14

liked a dataset 10 days ago

Rapidata/Translation-deepseek-llama-mixtral-v-deepl

Viewer • Updated 6 days ago • 845 • 320 • 14

liked a dataset 18 days ago

Rapidata/OpenGVLab_Lumina_t2i_human_preference

Viewer • Updated 18 days ago • 13k • 1.16k • 13

published a dataset 18 days ago

Rapidata/OpenGVLab_Lumina_t2i_human_preference

Viewer • Updated 18 days ago • 13k • 1.16k • 13

updated a dataset 18 days ago

Rapidata/OpenGVLab_Lumina_t2i_human_preference

Viewer • Updated 18 days ago • 13k • 1.16k • 13

reacted to jasoncorkill's post with 🤯 25 days ago

Post

2847

Integrating human feedback is vital for evolving AI models. Boost quality, scalability, and cost-effectiveness with our crowdsourcing tool!

..Or run A/B tests and gather thousands of responses in minutes. Upload two images, ask a question, and watch the insights roll in!

Check it out here and let us know your feedback: https://app.rapidata.ai/compare

reacted to jasoncorkill's post with 🚀 27 days ago

Post

2507

This dataset was collected in roughly 4 hours using the Rapidata Python API, showcasing how quickly large-scale annotations can be performed with the right tooling!

All that at less than the cost of a single hour of a typical ML engineer in Zurich!

The new dataset of ~22,000 human annotations evaluating AI-generated videos based on different dimensions, such as Prompt-Video Alignment, Word for Word Prompt Alignment, Style, Speed of Time flow and Quality of Physics.

Rapidata/text-2-video-Rich-Human-Feedback

reacted to jasoncorkill's post with ❤️🚀 about 1 month ago

Post

4553

Runway Gen-3 Alpha: The Style and Coherence Champion

Runway's latest video generation model, Gen-3 Alpha, is something special. It ranks #3 overall on our text-to-video human preference benchmark, but in terms of style and coherence, it outperforms even OpenAI Sora.

However, it struggles with alignment, making it less predictable for controlled outputs.

We've released a new dataset with human evaluations of Runway Gen-3 Alpha: Rapidata's text-2-video human preferences dataset. If you're working on video generation and want to see how your model compares to the biggest players, we can benchmark it for you.

🚀 DM us if you’re interested!

Dataset: Rapidata/text-2-video-human-preferences-runway-alpha