-
Tweets115
-
Followers157
-
Following498
-
Likes432
We’re back with a new series of Conversational AI Talks. Everyone’s invited! Feel free to share with your network. 🗓 Every Thursday, 11:00 AM – 12:00 PM EDT 🚀 Kicking off on September 18th with an exciting lineup of speakers. 🔗 More details: poonehmousavi.github.io/rg
I’m happy to share that our paper, "Discrete Audio Tokens: More Than a Survey!", has been accepted at TMLR. 🎉 📄 Read: arxiv.org/pdf/2506.10274 🔎 Explore our tokenizer database & submit yours: poonehmousavi.github.io/dates-website/…
I’m happy to share that our paper, "Discrete Audio Tokens: More Than a Survey!", has been accepted at TMLR. 🎉 📄 Read: arxiv.org/pdf/2506.10274 🔎 Explore our tokenizer database & submit yours: poonehmousavi.github.io/dates-website/…
📢 Presenting our paper “LiSTEN: Learning Soft Token Embeddings for Neural Audio LLMs” — an interpretable fine-tuning method for spoken language understanding. 🗓 Wed, Aug 20 | 08:30–10:30 📍 A11-P2B-03 Hope to see you there! 📄 arxiv.org/pdf/2505.18517 @ISCAInterspeech
Our pick of the week by @beomseok_lee_: "ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs" by Pooneh Mousavi, @yingzhi_wang, @mirco_ravanelli, and @CemSubakan (2025) arxiv.org/abs/2505.19937 #SLU #speech #multimodal #LLM
Our pick of the week by @beomseok_lee_: "ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs" by Pooneh Mousavi, @yingzhi_wang, @mirco_ravanelli, and @CemSubakan (2025) arxiv.org/abs/2505.19937 #SLU #speech #multimodal #LLM
📢 Join our Conversational AI Reading Group! 📅 Thursday, June 19th | 11 AM - 12 PM EST 🎙 Speaker: Yuki Mitsufuji (@mittu1204) - SonyAI 📖 Topic: "AI for Creators: Pushing Creative Abilities to the Next Level" 🔗 Details: (poonehmousavi.github.io/rg)
``Discrete Audio Tokens: More Than a Survey!,'' Pooneh Mousavi, Gallil Maimon, Adel Moumen, Darius Petermann, Jiatong Shi, Haibin Wu, Haici Yang, Anastasia Kuznetsova, Artem Ploujnikov, Ricard Marxer, Bhuvana Ramabhadran, Benjamin Elizalde, Loren Lugosch… ift.tt/GA4ZC6u
🎵💬 If you are interested in Audio Tokenisers, you should check out our new work! We empirically analysed existing tokenisers from every way - reconstruction, downstream, LMs and more. Grab yourself a ☕/🍺 and sit down for a read!
🌟🌟 Great collaboration, with a diverse all-star team led by @MousaviPooneh - check it out👇 📄Paper - arxiv.org/abs/2506.10274 🌐Website (+updating tokeniser DB!) - poonehmousavi.github.io/dates-website/
🚀 We're excited to announce our latest work: "Discrete Audio Tokens: More Than a Survey!" It presents a comprehensive survey and benchmark of audio tokenizers across speech, music, and general audio. preprint: arxiv.org/pdf/2506.10274 website: poonehmousavi.github.io/dates-website/
📢 Join our Conversational AI Reading Group! 📅 Thursday, June 12th | 11 AM - 12 PM EST 🎙 Speaker: Andros Tjandra 📖 Topic: "Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound" 🔗 Details: (poonehmousavi.github.io/rg)
📢 Join our Conversational AI Reading Group! 📅 Thursday, May 29th | 11 AM - 12 PM EST 🎙 Speaker: Yossi Adi @adiyossLC 📖 Topic: "On The Landscape of Spoken Language Models" 🔗 Details: (poonehmousavi.github.io/rg)
Learn about speaker diarization, the science behind it, and the future of diarization at @pyannoteAI research labs youtu.be/ECqxZgVevuI?fe…
... in which I'll talk about my decade-old love for speaker diarization and the loss functions used to train underlying neural networks
... in which I'll talk about my decade-old love for speaker diarization and the loss functions used to train underlying neural networks https://t.co/f5WHG4UMVO
🗣️🧠 Speech Language Models require lots of compute to train, right? In our new paper, we test is it possible to train an SLM on 1xA5000 gpu in 24 hours? The results may surprise you (they even surprised us)! Tips, open source resources, full paper 👇🏻
@convAI2024 Thank you for having me, and thank you all the listeners! I had a great time 🙌 If you missed it, here's the recording and the slides! Recording: youtube.com/watch?v=REH034… Slides: poonehmousavi.github.io/assets/slides/…
@convAI2024 Thank you for having me, and thank you all the listeners! I had a great time 🙌 If you missed it, here's the recording and the slides! Recording: youtube.com/watch?v=REH034… Slides: poonehmousavi.github.io/assets/slides/…
📢 Join our Conversational AI Reading Group! 📅 Thursday, May 15th | 11 AM - 12 PM EST 🎙 Speaker: Wen-Chin Huang (@unilightwf) 📖 Topic: "Automatic Quality Assessment for Speech and Beyond" 🔗 Details: (poonehmousavi.github.io/rg) , (youtube.com/@CONVAI_RG)
📢 Join our Conversational AI Reading Group! 📅 Thursday, May 8th | 11 AM - 12 PM EST 🎙 Speaker: Leda Sari 📖 Topic: "The Voicebox Model and Its Applications" 🔗 Details: (poonehmousavi.github.io/rg)
We’re really excited to have Dan Povey join us for our next Conversational AI Reading Group. He is the creator of the Kaldi toolkit and author of many well-known papers. Don’t miss his talk!
We’re really excited to have Dan Povey join us for our next Conversational AI Reading Group. He is the creator of the Kaldi toolkit and author of many well-known papers. Don’t miss his talk!
🚨I am honored to give an online invited talk at the Conversational AI Reading Group, MILA @convAI2024 on 5/15 11am-12pm EDT (5/16 0-1am Japan time), titled "Automatic Quality Assessment for Speech and Beyond"! Please find more info on the website: poonehmousavi.github.io/rg
📢 Join our Conversational AI Reading Group! 📅 Thursday, April 24th | 11 AM - 12 PM EST 🎙 Speaker: Oriol Nieto(@urinieto) from Adobe Research 📖 Topic: "GenAI for Sound Design" 🔗 Details: (poonehmousavi.github.io/rg)

傅丰元 Bob Fu @fm100
6K Followers 3K Following i make content&context, build in community. real-time ai/voice/video @rtedevcommunity & @AgoraIo丨灵感买家俱乐部丨离线丨利器 🐦帮助彼此完成各自项目: https://t.co/OHxgXPINRJ
DG. @dataghees
1K Followers 6K Following scaling speech native LLMs @rimelabs the future is willed into existence. bioML, discovering new science, housing, industrial policy, local politics.
Parshin Shojaee @ParshinShojaee
3K Followers 1K Following PhD student @VT_CS | AI for Science, Math, Code, Reasoning | Intern @Apple | prev @Adobe
Sathvik Udupa @SathvikUdupa
65 Followers 566 Following Graduate Student, BUT Speech@FIT. Previously, SPIRE Lab, IISc.
Mori Kiyotada @KiyotadaMr
109 Followers 145 Following 2-year Master’s student specializing in speech recognition and perception. 日本で日本人として生きていく。
Stefano Perna, Ph.D. @st3p_dot_io
72 Followers 309 Following AI Research Scientist @Translation and PhD student in Multimodal AI | Speech and Language Processing
Maryam Afshari @AfshariMaryam95
11 Followers 494 Following
chen zarfati @chenzarf
32 Followers 200 Following
Yigitcan Ozer @yiit_ozer_
291 Followers 633 Following postdoc @yamagishilab, NII | prev. research intern @SonyAI_global, Ph.D. at AudioLabs Erlangen, researcher @FraunhoferIIS
yingzhi wang @yingzhi_wang
38 Followers 93 Following Research on Speech & Audio, collaborator @SpeechBrain1
Enno Hermann @enno_hermann
193 Followers 529 Following Postdoc at @Idiap_ch - Speech. Coqui TTS fork maintainer.
Nonlinear Camel @nonlinear_camel
3 Followers 53 Following
Nima Nooshiri @nimanzik
473 Followers 995 Following Data Scientist at BDiM GmbH | PhD in Seismology | Digital Signal Processing | Applied Deep Learning | AI Dev | Prev.: @GFZ_Potsdam and @DIAS_Dublin
Yossi Adi @adiyossLC
872 Followers 370 Following Assistant Professor @ The Hebrew University of Jerusalem, CSE; Research Scientist @ Meta AI (FAIR); Drummer @ Lucille Crew 🤖🥁🎤🎧🌊
Julien Hauret @jhauret33
19 Followers 132 Following Ph.D. Student - Deep Learning & Speech Processing @LeCnam
ryu @ryu0000000001
314 Followers 186 Following Nothing is boring. No knowledge is irrelevant: only not relevant *yet*. - Jonathan Gorard
Ace Jiachen Luo @jiachenluo96
133 Followers 4K Following keep it simple and humble 😀 # multimodal foundation model, healthcare, human, society, ecology @QMUL @Cambridge @UCAS
Avihu Dekel @AvihuDkl
287 Followers 568 Following Deep Learning Researcher at IBM. Sharing works I find interesting. Might also write about: Food, Cello, Cute animals, Israel and...
Ivan @_fentropy
401 Followers 1K Following Interested in Speech Recognition/Computer Vision/NLP/Bayesian ML. Wrote a bit in these languages: Python/R/C++. Lots of shitposting. RU (mainly)/EN
Loren Lugosch @lorenlugosch
2K Followers 994 Following Machine learning @ ; audio & language; Freigeisterei und Vielgeisterei; "at once a man of business and a man of rhyme"
kodhandarama(shreeram... @cricketrasika
126 Followers 1K Following PhD in speech synthesis, carnatic rasika, on a quest to visit all national parks
Anya @anyapiunova
27 Followers 528 Following
Baylee Schneider @SchneiderB4061
70 Followers 3K Following
Seshiwr @SeshiwrklwI73
129 Followers 6K Following
Alexa Carroll @CarrollAle77698
55 Followers 4K Following
あまねゆみこ @amaneyumik6343
73 Followers 2K Following
VenusGrote @b53JS4LC2f16928
80 Followers 2K Following
armin zd @armin__zd
0 Followers 241 Following
Shaseighs @shaseighs25912
93 Followers 6K Following
Maryam Eslami @Maryam_Eslami
3K Followers 3K Following Research Scientist @UofIllinois & @FBK_research Materials Science & Electrochemistry (Cover photo by @mahedmousavi)
Srija Anand @srija_anand
176 Followers 1K Following MS Student @ AI4Bharat, IIT Madras Volunteer @MAD Chennai IIITD'21
Tereyth @tereyth89260
111 Followers 5K Following
Hamid Rouhani @hamid914
78 Followers 1K Following Hamid Rouhani is a https://t.co/1BTLsnJl8o student of Software Engineering at Ferdowsi University of Mashhad.
Musfiqur Rahman @mrsumit2010
141 Followers 4K Following aka Mushi. Abba to Faatiha. Bangladeshi-Canadian. PhD student at @DASLabConcordia. @CREATE_SE4AI trainee. He/Him/His
Roswell @nonomiyara78454
92 Followers 7K Following
傅丰元 Bob Fu @fm100
6K Followers 3K Following i make content&context, build in community. real-time ai/voice/video @rtedevcommunity & @AgoraIo丨灵感买家俱乐部丨离线丨利器 🐦帮助彼此完成各自项目: https://t.co/OHxgXPINRJ
MT Group at FBK @fbk_mt
1K Followers 443 Following #MachineTranslation Research Unit @FBK_research. #nlproc #deeplearning #ai
Parshin Shojaee @ParshinShojaee
3K Followers 1K Following PhD student @VT_CS | AI for Science, Math, Code, Reasoning | Intern @Apple | prev @Adobe
DG. @dataghees
1K Followers 6K Following scaling speech native LLMs @rimelabs the future is willed into existence. bioML, discovering new science, housing, industrial policy, local politics.
Sathvik Udupa @SathvikUdupa
65 Followers 566 Following Graduate Student, BUT Speech@FIT. Previously, SPIRE Lab, IISc.
Mori Kiyotada @KiyotadaMr
109 Followers 145 Following 2-year Master’s student specializing in speech recognition and perception. 日本で日本人として生きていく。
Stefano Perna, Ph.D. @st3p_dot_io
72 Followers 309 Following AI Research Scientist @Translation and PhD student in Multimodal AI | Speech and Language Processing
Beomseok LEE @beomseok_lee_
47 Followers 144 Following PhD student @uniTrento. Affiliated in @naverlabseurope and @fbk_mt. Ex research engineer @samsungresearch
Matteo Negri @negri_teo
425 Followers 512 Following Researcher at Fondazione Bruno Kessler, mainly on #machinetranslation and #NLProc.
Luisa Bentivogli @luisabentivogli
329 Followers 197 Following Head of the @fbk_mt research unit at @fbk_research Interested in #machinetranslation #nlproc #FairnessML • She/her • Views are my own
Hervé "pyannote" Bre... @hbredin
2K Followers 701 Following Hervé Bredin /👨🏻💻 Creator of 🎹 pyannote / ⚒️ Co-founder and CSO @pyannoteAI /👨🏼🔬 Researcher @CNRS (on leave)
Ivan @_fentropy
401 Followers 1K Following Interested in Speech Recognition/Computer Vision/NLP/Bayesian ML. Wrote a bit in these languages: Python/R/C++. Lots of shitposting. RU (mainly)/EN
الجزيرة - عا... @AJABreaking
3.0M Followers 1 Following تغطية الجزيرة للأخبار العاجلة على مدار الساعة، للاطلاع على التقارير والتغطيات للأحداث على الساحتين العربية والدولية، تابعوا حسابنا @AJArabic
قناة الجزير... @AJArabic
24.1M Followers 26 Following الجزيرة.. الرأي والرأي الآخر.. تابع أخبارنا العاجلة على @AJABreaking
BBC Dari @bbcafghanistan
762K Followers 5 Following حساب رسمی بیبیسیدری. شماره واتساپ ما: 00448000121010 خبرها و داستانهای شخصی، اجتماعی، اقتصادی و هنری از افغانستان و جهان.
Maryam Eslami @Maryam_Eslami
3K Followers 3K Following Research Scientist @UofIllinois & @FBK_research Materials Science & Electrochemistry (Cover photo by @mahedmousavi)
Alexa Carroll @CarrollAle77698
55 Followers 4K Following
あまねゆみこ @amaneyumik6343
73 Followers 2K Following
Seshiwr @SeshiwrklwI73
129 Followers 6K Following
Baylee Schneider @SchneiderB4061
70 Followers 3K Following
kodhandarama(shreeram... @cricketrasika
126 Followers 1K Following PhD in speech synthesis, carnatic rasika, on a quest to visit all national parks
Anya @anyapiunova
27 Followers 528 Following
Avihu Dekel @AvihuDkl
287 Followers 568 Following Deep Learning Researcher at IBM. Sharing works I find interesting. Might also write about: Food, Cello, Cute animals, Israel and...
العربیه فار... @AlArabiya_Fa
464K Followers 47 Following العربيه فارسى به عنوان بخشی از شبكه العربيه در سال 2008 راهاندازی شد.
Barak Ravid @BarakRavid
395K Followers 823 Following Global Affairs Correspondent for Axios. CNN analyst. Washington correspondent for Israel's channel 12. Author of Trump's Peace. link in Bio
Denny Zhou @denny_zhou
21K Followers 541 Following Founded & lead the Reasoning Team in Google Brain (now part of Google DeepMind). Build LLMs to reason. Opinions my own.
Joan Serrà @serrjoa
2K Followers 565 Following Does research on machine learning at Sony AI, Barcelona. Works on audio analysis, synthesis, and retrieval. Likes tennis, music, and wine.
Alexis Conneau @alex_conneau
35K Followers 189 Following Co-founder and CEO https://t.co/efv72CKpAG (@WaveFormsAI) - Ex @OpenAI GPT-4o/AVM Audio Research Lead - #Her #TARS - Ex @AIatMeta, @Polytechnique (X11)
Minje Kim @minje_research
412 Followers 246 Following Associate Professor at CS@UIUC; Visitic Academic at Amazon Lab126; Want to share my thoughts on audio & AI research, graduate studies, and life.
Piotr Żelasko @PiotrZelasko
1K Followers 693 Following AI + Speech @ Nvidia. PhD @ AGH-UST, ex-JHU. My interests: speech processing technologies; ML/AI software engineering. Building OSS for Speech AI.
yingzhi wang @yingzhi_wang
38 Followers 93 Following Research on Speech & Audio, collaborator @SpeechBrain1
Yusuf Aytar @yusufaytar
1K Followers 149 Following Research Scientist @ DeepMind. Making machines smarter. Views are my own.
حامد عاقل @aghel_ir
14K Followers 5K Following بنده هیچ خدا | وطن جان من است |#آقای_امام_حسین |هرگونه توهین بلاک
Xiaohua Zhai @XiaohuaZhai
11K Followers 311 Following Researcher at Meta (previously at OpenAI Zürich, Google DeepMind)
Convai_rg @convAI2024
245 Followers 1 Following
Jing Liu @JLiu_Compuling
365 Followers 1K Following 2nd year PhD student @CoML_ENS | Msc @LeuvenAi| ResMA @CLSRadboud| reverse engineer language acquisition using NN
Martijn Bartelds @BarteldsMartijn
533 Followers 374 Following Postdoctoral Scholar @stanfordnlp | Formerly @univgroningen, @tudelft and @Penn
Ding Li @dingzeyuli
1K Followers 331 Following 💻 Sr Research Scientist at Adobe Research Follow me on Threads! ➡️ https://t.co/mVnRM1cYGJ Fediverse ➡️ @[email protected]
Seyed Moosavi @smoosavid
546 Followers 138 Following AI/ML Researcher @Apple; Past @imperialeee @ETH_en @EPFL_en #AdversarialML #ReliableAI