1 16 33

Gautier Evennou

Gevennou

AI & ML interests

PhD in ML on Multimodal

Recent Activity

liked a Space 2 days ago

nielsr/vilt-nlvr

liked a model 2 days ago

microsoft/Florence-2-large

new activity 21 days ago

facebook/emu_edit_test_set_generations:[ISSUE] What's up with "a train station in city" captions ?

View all activity

Organizations

Gevennou's activity

liked a Space 2 days ago

Vilt Nlvr

🚀

Compare two images with a sentence

liked a model 2 days ago

microsoft/Florence-2-large

Image-Text-to-Text • Updated Dec 8, 2024 • 593k • 1.38k

New activity in facebook/emu_edit_test_set_generations 21 days ago

[ISSUE] What's up with "a train station in city" captions ?

#3 opened about 1 year ago by

Gevennou

liked a model 30 days ago

microsoft/phi-4

Text Generation • Updated 3 days ago • 506k • 1.69k

upvoted a paper about 2 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 126

liked a model 4 months ago

stabilityai/stable-diffusion-3.5-large

Text-to-Image • Updated Oct 22, 2024 • 252k • • 2.12k

upvoted 2 papers 4 months ago

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 94

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 106

liked 2 Spaces 5 months ago

Gradio Lipsync Wav2lip

👄

Combine audio with a video or image to create a lip-synched video

786

Face to All

👨

AI filter for your portraits

liked 2 Spaces 6 months ago

791

Parler-TTS

🥖

High-fidelity Text-To-Speech

460

Florence2 + SAM2

🔥

Segment objects in images and videos using text prompts

liked a model 7 months ago

BleachNick/SD3_UltraEdit_w_mask

Text-to-Image • Updated Jun 30, 2024 • 1.08k • 12

upvoted a paper 8 months ago

StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images

Paper • 2406.13735 • Published Jun 19, 2024 • 5

liked a model 8 months ago

AIRI-Institute/StyleFeatureEditor

Image-to-Image • Updated Jul 19, 2024 • 10

upvoted a paper 8 months ago

The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing

Paper • 2406.10601 • Published Jun 15, 2024 • 66

liked a Space 8 months ago

720

Florence 2

📉

Analyze images to generate captions, detect objects, or perform OCR

liked a model 8 months ago

microsoft/Florence-2-large-ft

Image-Text-to-Text • Updated Jul 20, 2024 • 118k • 322

liked a dataset 8 months ago

UCSC-VLAA/Recap-DataComp-1B

Viewer • Updated 29 days ago • 1.88B • 3.72k • 165

liked a Space 8 months ago

120

MimicBrush

🐨

Transfers textures from a reference image to a masked region in a source image