Takuya Yoshioka @_ty274
Speech technology researcher/manager @AssemblyAI linkedin.com/in/ty274/ Bellevue, WA Joined November 2016-
Tweets917
-
Followers552
-
Following57
-
Likes3K
Want to hear a friend in a noisy café? We designed deep learning-based headphones that let you isolate the speech from a specific person just by *looking* at them for a few seconds. CHI'24 honorable mention award. Paper: arxiv.org/abs/2405.06289 Code: github.com/vb000/LookOnce…
I got an early demo of this when I visited @uwcse a couple months ago and the ability to isolate sounds in your environment was pretty great. Nice work, @b_veluri, Malek Itani, Tuochao Chen, Takuya Yoshioka, and @ShyamGollakota!
I got an early demo of this when I visited @uwcse a couple months ago and the ability to isolate sounds in your environment was pretty great. Nice work, @b_veluri, Malek Itani, Tuochao Chen, Takuya Yoshioka, and @ShyamGollakota!
Hi all, please let me know if you know large-scale speech data that can be used for training our Whisper reproduction (OWSM) model (arxiv.org/abs/2309.13876). We plan to move to OWSM v4.
Last Friday marked the end of my 7-year journey at Microsoft, filled with rewarding challenges, both in research & production, and incredible colleagues. I'll be starting something new very soon. マイクロソフトを退職しました。まだずっとシアトル界隈にいます。
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer paper page: huggingface.co/papers/2308.06… Recent advancements in generative speech models based on audio-text prompts have enabled remarkable innovations like high-quality zero-shot text-to-speech. However,…
SpeechX from our new paper is a single generative model that edits, enhances & creates speech, enabling zero-shot TTS, spoken content editing (while preserving ambience), speaker extraction & speech/noise removal. Demo: aka.ms/speechx Paper: arxiv.org/abs/2308.06873
To everyone booking their @IEEE_WASPAA trip: please consider attending #SANE2023, which will take place at NYU on Thursday October 26, the day after #WASPAA2023. Register at saneworkshop.org/sane2023/
To everyone booking their @IEEE_WASPAA trip: please consider attending #SANE2023, which will take place at NYU on Thursday October 26, the day after #WASPAA2023. Register at saneworkshop.org/sane2023/
@ieeeICASSP Are there poster printing facilities at/near the conference venue?
Real-time target sound extraction with waveformer (to appear in ICASSP). Joint work with UW researchers. Paper (updated): arxiv.org/abs/2211.02250 Demo: waveformer.cs.washington.edu Code (both causal and non-causal): github.com/vb000/Waveform…
WASPAA 2023 calls for papers! The traditional intimate Mohonk Mountain House with exciting changes: double-blind review, an unprecedented amount of travel grants, and more. More information: waspaa.com/call-for-paper… #waspaa2023
すごい! 世界最大1万9千時間の音声コーパスと高精度日本語音声認識モデルがオープンソースで公開 - 窓の杜 forest.watch.impress.co.jp/docs/news/1471… via @madonomori
The #ICASSP2023 paper submission site is now open! Submit your papers by 19 October 2022 to be considered. Learn more about the paper guidelines and submission requirements here: hubs.la/Q01nmxt_0
How can we do streaming multi-talker ASR by best combining speech separation and overlap-robust ASR? t-SOT-VA does that and works for real meeting audio with any # of mics, achieving the best published WERs of 13.7%/15.5% for AMI-MDM dev/eval. Paper: arxiv.org/abs/2209.04974
Please retweet, @Lam19Tk, a young MT researcher, soon-to-be-PhD needs your help. He is looking for a job in speech/text translation. A job he already had lined-up has been revoked due to the hiring freezes in the industry. Here's his linkedin profile: linkedin.com/in/tsz-kin-lam…
Our new work on speaker diarization: arxiv.org/abs/2208.13085 (1) TS-VAD with cross-speaker transformer achieves a new SOTA DER in VoxConverse. (2) Further EEND-EDA integration for one-step diarization brings down the DER in CALLHOME.
The challenge submission deadline is approaching (Sep 26). If you're interested in it, please do not hesitate to ask the CHiME Steering Group ([email protected]) or members (chimechallenge.org/current/steeri…) individually!
The challenge submission deadline is approaching (Sep 26). If you're interested in it, please do not hesitate to ask the CHiME Steering Group ([email protected]) or members (chimechallenge.org/current/steeri…) individually!
TTIC celebrates the life of Sadaoki Furui ttic.edu/news/#0822-3
Hi all, SLT'22 will organize a hackathon event. Please check slt2022.org/hackathon.php The application deadline is Sep. 30th! @ieee_slt

Shinji Watanabe @shinjiw_at_cmu
4K Followers 362 Following I'm working at CMU (2021-). I was working at NTT (2001-2011), MERL (2012-2017), and JHU (2017-2020). Speech and Audio Processing is my main research topic.
AK @_akhaliq
425K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo ,submit papers here: https://t.co/UzmYN5YmrQ
WAVLab | @CarnegieMel... @WavLab
2K Followers 142 Following Shinji Watanabe's Audio and Voice Lab | WAVLab @LTIatCMU @SCSatCMU | Speech Recognition, Speech Enhancement, Spoken Language Understanding, and more.
Desh Raj @rdesh26
3K Followers 2K Following Research Scientist @Meta GenAI | Previously: @jhuclsp, @IITGuwahati
Robin Scheibler @fakufakurevenge
858 Followers 924 Following Grower of cucumbers 🥒, tomatoes 🍅, and chilli peppers 🌶️. I ❤ audio, microphone arrays, IoT, Python, and data.
まっすー @ymas0315
2K Followers 1K Following
Yuma Koizumi @yuma_koizumi
3K Followers 500 Following Staff Research Scientist @GoogleDeepMind Tokyo 🇯🇵. Speech processing. Tweets are my own.
Hirofumi Inaguma @HirofumiInaguma
1K Followers 1K Following Multimodal, Speech at Fundamental AI Research (FAIR) @MetaAI
yamakatz @kyama0321
1K Followers 1K Following 🐻🐼👨🏻🎓🧑🏻💻🎧🦻🚘🟢 専門は聴覚や補聴技術など音響学全般。現在は人間の感覚を補助・拡張する数理・技術・装置・環境の未来に興味あり。
mat @ballforest
6K Followers 3K Following Pokémon GO / Outlier detection / Anomaly detection / Robust statistics / Functional Analysis / Statistical mechanics / Kernel methods / Dynamical system
shimoll @shimolle
249 Followers 230 Following
Siddharth Dalmia @siddalmia05
2K Followers 447 Following Audio LLMs @ Waveforms AI | #SpeechProc and #NLProc | Previously Research Scientist @GoogleDeepmind | PhD @LTIatCMU @SCSatCMU
Katsuhito Sudoh (ja) @katsuhitosudoh
4K Followers 2K Following 奈良女子大学 教授(所属は生活環境科学系). 機械翻訳の研究をしている気がします/ Keywords: Eマウント,平SFC,平JGC,DL Gold,万年筆,インク,お茶,ZFS / ポストの内容は当人個人の見解です English: @katsuhito_sudoh
Yusuke Kida @KID_A_Radiohead
626 Followers 517 Following ソフトバンク生成AI子会社Gen-AXのCTO一年生。最先端の生成AI技術を使って本当に使える画期的なプロダクトを作ることにトライしています。専門は音声認識・音声信号処理。https://t.co/vDh9zfkAZ2
Mirco Ravanelli @mirco_ravanelli
4K Followers 2K Following Deep learning for Conversational AI. Creator of SpeechBrain.
たいし @_tai_shi
539 Followers 740 Following Assistant Professor at Tokyo Metropolitan University, 東京都立大学 システムデザイン学部 助教, マルチチャンネルでDNN使わないリアルタイム音源分離の研究してます🎛️
Jessee Constantino @JesseeCons20223
0 Followers 23 Following Oracle Cloud – GPU/AI Infrastructure | U.S. Army Veteran
Jouwi @Jouwi014
10 Followers 1K Following
Angelwing @Angelwing19714
40 Followers 4K Following
Matt @Matt60289293
0 Followers 28 Following
Parker Jennifer Nina ... @EdwindeLeon12
870 Followers 3K Following The best motivation for any trader is to be better today than you were yesterday. Washington District of Columbia, USA https://t.co/4ATZlKkdNM
Dev Aggarwal @devxpy
495 Followers 827 Following cofounder / cto @ https://t.co/YWSFudL27I | hiring software engineers
ravinder syal @ravisyal
739 Followers 7K Following
Tauthyez @TauthyezHzYb
45 Followers 989 Following
BUT Speech @ButSpeech
675 Followers 295 Following We do impactful research and raise new leading scientific personalities in the field of speech processing.
Susannahoffs hoffs @Susannahof65590
57 Followers 2K Following American singer songwriter musician and actress 🌎❤️🇺🇲
Hirotaka Hiraki @h1raki
1K Followers 2K Following PhD student @rkmt Lab in U-Tokyo / HCI, Speech, Wearable Interface / ACT-X / AIST / IPA未踏’21 / eeic2018(elab) / Artwork(@YURAteam1) /Juggling /classical guitar
Young Scientist Award... @youngsc06963908
792 Followers 3K Following International Young Scientist Awards
nvm @iyoume___
53 Followers 311 Following UX Researcher / MS at @hcdeUW / Fulbrighter ←BS in CS←コミカレ←社会人←高校中退。HCI、UXリサーチとデザイン、A11yに興味があります。
Forever° @Forever1859911
62 Followers 4K Following
VernaBerkeley @k0O185ZyG71Cj
71 Followers 7K Following
Nebius @nebiusai
13K Followers 920 Following The ultimate cloud for AI innovators. For GenAI open-source model endpoints, check out @NebiusAIStudio.
Annabelle_US_ @AnnabelleU89654
74 Followers 5K Following
AJ @AJ__CB30
144 Followers 796 Following Interested in AI/ML and AI for scientific Discovery. Alumnus of @hinducollege_du CS Phd @RiceUniversity and currently at @MSFTResearch ex @GoogleIndia.
Takayuki Arakawa @ArakawaTakayuki
2 Followers 85 Following
Alkis Koudounas @AlkisKoudounas
248 Followers 625 Following 2x Research Intern @AmazonScience | PhD Student @PoliTOnews | ASR, Spoken Language & Multimodal Understanding | Responsible & Trustworthy AI | GPU very poor
JHU CLSP @jhuclsp
7K Followers 6K Following Center for Language and Speech Processing at @JohnsHopkins #NLProc #MachineLearning #AI https://t.co/6IXR5OSQtw @[email protected]
z jay @zjay951907
0 Followers 62 Following
yhpeng @peng_yanghui
0 Followers 65 Following
Ken Chatfield @kinacoken
96 Followers 177 Following
バイリンガルニ... @Bilingual_News
57K Followers 627 Following 毎週木曜更新の無料ポッドキャスト。独自の「バイリンガル会話方式」で、リアルな英会話を配信中!文字起こし・英語表現解説・宿題・単語帳などは公式アプリから。京都大学でリスニング教材として使われています。
Jeff Dean @JeffDean
365K Followers 6K Following Chief Scientist, Google DeepMind & Google Research. Gemini Lead. Opinions stated here are my own, not those of Google. TensorFlow, MapReduce, Bigtable, ...
Zixiong Su @zshawnsu
363 Followers 518 Following Silent Speech & Human-AI Interaction. UTokyo PhD Fellowship, JSPS DC2. Prev . Research intern @Meta @RealityLabs Visiting Researcher @UCLA, intern @SonyCSL
gbil @GOexle
61 Followers 487 Following
n0n4m399 @n0n4m399
0 Followers 77 Following
Bartosz Antosik @bartoszantosik
107 Followers 2K Following
Sathvik Udupa @SathvikUdupa
65 Followers 566 Following Graduate Student, BUT Speech@FIT. Previously, SPIRE Lab, IISc.
nakazawa kazushi(中�... @nkzwkzs
663 Followers 3K Following 博士(工学)音声認識系の仕事をしています DNNベースで音声の品質を評価する研究していました IEEE Sendai YPとASJ若手フォーラムで活動してます atcoder:茶色
bagofwords.ai @bagofwordsai
381 Followers 4K Following All About NLP and Its Applications #safenlp #NLProc #ai #ml
Melody @MQuashe59232
12 Followers 2K Following For good-looking clothes and worthy people, you have to work hard.
Awareness AI @AwarenessAI
51 Followers 262 Following 🤖 Leading AI awareness & ethical education. Bridging tech & society for a smarter future. #AIForAll #FutureOfEducation #TechEthics
Satvik Dixit @SatvikDixit9
128 Followers 911 Following MS student @CarnegieMellon | Prev @IITDelhi | Audio understanding and generation
Ilya @IlyaGurvich
8 Followers 94 Following
stonelazy @sudharsankp
31 Followers 523 Following Human almost. Active consumer. Passive producer. Developer. Like = Bookmark
Shinji Watanabe @shinjiw_at_cmu
4K Followers 362 Following I'm working at CMU (2021-). I was working at NTT (2001-2011), MERL (2012-2017), and JHU (2017-2020). Speech and Audio Processing is my main research topic.
AK @_akhaliq
425K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo ,submit papers here: https://t.co/UzmYN5YmrQ
WAVLab | @CarnegieMel... @WavLab
2K Followers 142 Following Shinji Watanabe's Audio and Voice Lab | WAVLab @LTIatCMU @SCSatCMU | Speech Recognition, Speech Enhancement, Spoken Language Understanding, and more.
Desh Raj @rdesh26
3K Followers 2K Following Research Scientist @Meta GenAI | Previously: @jhuclsp, @IITGuwahati
Robin Scheibler @fakufakurevenge
858 Followers 924 Following Grower of cucumbers 🥒, tomatoes 🍅, and chilli peppers 🌶️. I ❤ audio, microphone arrays, IoT, Python, and data.
Yuma Koizumi @yuma_koizumi
3K Followers 500 Following Staff Research Scientist @GoogleDeepMind Tokyo 🇯🇵. Speech processing. Tweets are my own.
Hirofumi Inaguma @HirofumiInaguma
1K Followers 1K Following Multimodal, Speech at Fundamental AI Research (FAIR) @MetaAI
Wei-Ning Hsu @mhnt1580
2K Followers 133 Following Research Scientist @ Meta FAIR / audio generation, self-supervised learning, speech processing
AI at Meta @AIatMeta
712K Followers 288 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
Alexis Conneau @alex_conneau
35K Followers 189 Following Co-founder and CEO https://t.co/efv72CKpAG (@WaveFormsAI) - Ex @OpenAI GPT-4o/AVM Audio Research Lead - #Her #TARS - Ex @AIatMeta, @Polytechnique (X11)
Andrew Ng @AndrewYNg
1.3M Followers 1K Following Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs
Hung-yi Lee (李宏�... @HungyiLee2
5K Followers 20 Following Hung-yi Lee is currently a professor at National Taiwan University. He owns a YouTube channel teaching deep learning in Mandarin.
Soumith Chintala @soumithchintala
250K Followers 1K Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.
Anurag Kumar @AcouIntel
2K Followers 286 Following Research Scientist, @GoogleDeepMind | Prev: @AIatMeta | CMU @SCSatCMU | @IITKanpur | Audio/Speech, Multimodal AI
Yuki Mitsufuji @mittu1204
4K Followers 92 Following PhD, Distinguished Engineer @Sony, Lead Research Scientist/VP of AI Research @SonyAI_global, Head of Creative AI Lab, Former Associate Prof. @tokyotech_jp
Alexandre Défossez @honualx
5K Followers 500 Following Chief exploration officer @kyutai_labs, with strong interests in stochastic optimization, audio generative models, and AI for science.
Somshubra Majumdar @HaseoX94
861 Followers 411 Following Sr. Deep Learning Research Engineer @NVIDIAAI. MSCS'18 @UICCS. Multi-domain Deep Learning researcher and library developer. All opinions are my own.
Hugging Face @huggingface
560K Followers 208 Following The AI community building the future. https://t.co/VkRPD0Vclr
Jong Wook Kim 💟 @_jongwook_kim
4K Followers 594 Following Member of Technical Staff @OpenAI; previously at @nyuMARL, @SpotifyResearch, @pandoramusic, @kakaocorpglobal, and @NCSOFT
Naoyuki Kanda @naoyukikandaslp
144 Followers 88 Following
DailyAudioPapers @mlsp4audio
790 Followers 637 Following Daily tweets on selected arXiv papers on audio (eess․AS/cs․SD) | Brief reviews of interesting papers | Machine learning | Signal processing
Alexandr Wang @alexandr_wang
327K Followers 833 Following chief ai officer @meta, founder @scale_ai. rational in the fullness of time
Akash Mahajan @akashmjn
619 Followers 648 Following Chatting with PDFs @ContextualAI | prev @Azure Speech; @Stanford @atherenergy @iitmadras
Sanchit Gandhi @sanchitgandhi99
5K Followers 40 Following Research @MistralAI. Previously speech @huggingface, Masters at @Cambridge_Uni.
Georgi Gerganov @ggerganov
52K Followers 289 Following 24th at the Electrica puzzle challenge | https://t.co/baTQS2bdia
Hervé "pyannote" Bre... @hbredin
2K Followers 701 Following Hervé Bredin /👨🏻💻 Creator of 🎹 pyannote / ⚒️ Co-founder and CSO @pyannoteAI /👨🏼🔬 Researcher @CNRS (on leave)
Miguel J 🇺🇦 �... @bonuelphotog
364 Followers 500 Following Head of AI at Circle Medical. Previously: https://t.co/V2CJc8dV6V, Temi, Voicebox, Nuance. 18+ years of exp in Speech Recognition, Translation, and language technologies.
Sriram Ganapathy @tweet4sri
370 Followers 158 Following Associate Professor, Indian Institute of Science, Bangalore. Google Research India, Bangalore.
Thomas Wolf @Thom_Wolf
95K Followers 6K Following Co-founder at @HuggingFace - open-source and open-science
Sharath Adavanne @adavanne
567 Followers 726 Following Applied AI/ML, PhD @TampereUni, Previously @facebook (@meta), @AdobeResearch, @RakutenRIT, @FreshworksInc, @Krutrim
Jung-Woo Ha @JungWooHa2
5K Followers 3K Following Sr. Secretary to the President for AI & Future Planning, #Korea 대한민국 대통령실 AI미래기획수석비서관 Full Member, #NAEK
IEEE ICASSP @ieeeICASSP
5K Followers 1 Following IEEE International Conference on Acoustics, Speech, and Signal Processing. #ICASSP2026 will be held 4-8 May 2026 in Barcelona, Spain.
Qiuqiang Kong @QiuqiangK
1K Followers 242 Following Assistant Professor at @CUHKofficial, previously at @ByteDanceTalk, Ph.D. at @UniOfSurrey
Gautham Mysore @GauthamMysore
795 Followers 291 Following Head of Audio and Video AI Research @AdobeResearch
Nicholas J. Bryan @NicholasJBryan
1K Followers 470 Following Head of Music AI @AdobeResearch (personal account)
Lukas Biewald @l2k
25K Followers 4K Following Cofounder/CEO of @weights_biases - tools for AI developers.
Yu Wang @yuwang_tw
1K Followers 429 Following Music x ML | Research Scientist @Spotify. PhD @nyuMARL. prev @AdobeResearch @GoogleMagenta 🎶🎸🇹🇼
Eduardo Fonseca @edfonseca_
1K Followers 549 Following Research Scientist @GoogleAI. Sound Understanding. Previously @mtg_upf. He/him.
Efthymios Tzinis @ETzinis
549 Followers 300 Following Senior Research Scientist @GoogleAI | Ph.D. from @IllinoisCS | Formerly @merl_news, @RealityLabs | My opinions do not represent my employer
Shang-Wen Li @ShangwenLi1
2K Followers 992 Following Research Scientist at FAIR; #AI, #NLProc & #speech processing; Past: PhD @MIT_CSAIL, ML scientist at AWS, Alexa & Siri; Views my own
gontani @gontani
1K Followers 1K Following 音声認識の周辺を徘徊するR&Dエンジニア。最近はLLM。博士(工学)→ポスドク中に1年間ドイツ滞在→2011年から企業勤め。電機→メガベンチャー→ ex- @PreferredNetJP → Gen-AX // LLM、音声信号処理、音声認識、機械学習 // 子育て、欧州等
Piotr Żelasko @PiotrZelasko
1K Followers 693 Following AI + Speech @ Nvidia. PhD @ AGH-UST, ex-JHU. My interests: speech processing technologies; ML/AI software engineering. Building OSS for Speech AI.
Takuma OKAMOTO @okamotocamera
330 Followers 86 Following Research Manager@NICT, Japan / Jogging / Drinking
Justin Salamon @justin_salamon
3K Followers 773 Following Head of Sound Design AI Research at Adobe. Machine learning and signal processing for audio & video. Musician. He/him.
Weights & Biases @weights_biases
46K Followers 1K Following The AI developer platform.🛠️ Track and evaluate your LLM applications in real-time with @weave_wb.
Jonathan Le Roux @JonathanLeRoux
2K Followers 311 Following Speech and audio research scientist at MERL. Opinions never really my own. 🦋https://t.co/6pSuhzw3fb