1 4

Marco Gaido

mgaido91

https://mgaido91.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper 6 days ago

When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP

authored a paper 6 days ago

Direct Models for Simultaneous Translation and Automatic Subtitling: FBK@IWSLT2023

authored a paper 6 days ago

Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection

View all activity

Organizations

mgaido91's activity

authored 19 papers 6 days ago

When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP

Paper • 2303.16166 • Published Mar 28, 2023 • 1

Direct Models for Simultaneous Translation and Automatic Subtitling: FBK@IWSLT2023

Paper • 2309.15554 • Published Sep 27, 2023 • 1

Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection

Paper • 2310.15752 • Published Oct 24, 2023 • 1

Dealing with training and test segmentation mismatch: FBK@IWSLT2021

Paper • 2106.12607 • Published Jun 23, 2021 • 1

Speechformer: Reducing Information Loss in Direct Speech Translation

Paper • 2109.04574 • Published Sep 9, 2021 • 1

Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?

Paper • 2402.12025 • Published Feb 19, 2024

How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena

Paper • 2402.13208 • Published Feb 20, 2024

Does Simultaneous Speech Translation need Simultaneous Models?

Paper • 2204.03783 • Published Apr 8, 2022 • 1

Efficient yet Competitive Speech Translation: FBK@IWSLT2022

Paper • 2205.02629 • Published May 5, 2022 • 1

Over-Generation Cannot Be Rewarded: Length-Adaptive Average Lagging for Simultaneous Speech Translation

Paper • 2206.05807 • Published Jun 12, 2022 • 1

StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection

Paper • 2406.06097 • Published Jun 10, 2024

SimulSeamless: FBK at IWSLT 2024 Simultaneous Speech Translation

Paper • 2406.14177 • Published Jun 20, 2024

How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not

Paper • 2409.17044 • Published Sep 25, 2024 • 1

MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages

Paper • 2410.01036 • Published Oct 1, 2024 • 15

Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection

Paper • 2412.11978 • Published Dec 16, 2024

NUTSHELL: A Dataset for Abstract Generation from Scientific Talks

Paper • 2502.16942 • Published 16 days ago

updated a dataset 20 days ago

FBK-MT/mosel

Viewer • Updated 20 days ago • 57.5M • 1.25k • 71