Amir H. Kargaran @amir_nlp
On job maket (Fall 2025) / 🤖 PhD student @CisLmu/ 🛠️ Multilingual NLP / Previous: Intern @huggingface. My views! kargaranamir.github.io Munich, Germany Joined August 2019-
Tweets382
-
Followers799
-
Following3K
-
Likes6K
> SmolLM3 > GLM-4.5 > NVIDIA-Nemotron-Nano These are just some of the recent OS releases relying on 🥂 FineWeb2 for their multilingual data Proud that the community trusts us for their data supply 🫡
Molecules speak in atoms and bonds. LLMs can learn that language. Even with SOTA #denovo design, our largest molecular LLM study finds a plot twist: early saturation, weak scaling, and proxy metrics that mislead on real tasks! Led by @KChitsaz and @roshan_msb 🧵 More in thread:
Crazy how much attention OCR is getting. I couldn't find any that work well with less common scripts. Stress-test it with Cuneiform, at least it's part of Unicode!
I'm at the @aclmeeting to present our papers on multilingual evaluation, programming languages, and translation! #ACL2025 Feel free to stop by to exchange ideas and discuss! I’m also on the job market. If you think there’s a potential fit, I’d love to hear from you.
FineWeb2 🥂 has been accepted to @COLM_conf See you in October 🇨🇦
FineWeb2 🥂 has been accepted to @COLM_conf See you in October 🇨🇦 https://t.co/RdmrOpksje
We have finally released the 📝paper for 🥂FineWeb2, our large multilingual pre-training dataset. Along with general (and exhaustive) multilingual work, we introduce a concept that can also improve English performance: deduplication-based upsampling, which we call rehydration.
Are you working on multilingual, multicultural #LLM? Interested in diverse & inclusive language modeling? 😎 Stay tuned at our MELT workshop collocated with #COLM2025 🔗 melt-workshop.github.io 🫶 We welcome 2p (EA), 4p (short), 8p (long) papers as well as talented reviewers!
Are you working on multilingual, multicultural #LLM? Interested in diverse & inclusive language modeling? 😎 Stay tuned at our MELT workshop collocated with #COLM2025 🔗 melt-workshop.github.io 🫶 We welcome 2p (EA), 4p (short), 8p (long) papers as well as talented reviewers!
Consider submitting your multilingual NLP work to the MELT workshop @ COLM 2025: melt-workshop.github.io Deadline: June 23
Consider submitting your multilingual NLP work to the MELT workshop @ COLM 2025: melt-workshop.github.io Deadline: June 23
Tracing Multilingual Factual Knowledge Acquisition in Pretraining. arxiv.org/abs/2505.14824
I really wanted to see the review details. It's clearly above the acceptance threshold of findings for me. When you fall into the cycle of rejection from ARR, it's hard to come out.
I'm embarassed to admit that I have just grokked how amazing Python coroutines and asyncio are. I want to rewrite every single piece of code with threads I have every written! But the learning curve is steep. This great blog opened my eyes: tenthousandmeters.com/blog/python-be…

ThatsEnough @ThatsEnough2022
507 Followers 604 Following سیاهی امروز ما یک واقعیت است اما سیاهی اینده یک احتمال است که ما میتوانیم در آن تاثیر گذار باشیم.
Nakhoda @Pr_1266
3K Followers 3K Following Sr. AlgroTrader, Ex Sr. ML Engineer, Founder of #NakhodaFinanceCo, $DJI, $NQ and #XAU Trader.
CIS, LMU Munich @CisLmu
1K Followers 132 Following Center for Information and Language Processing (CIS): #NLProc research group @LMU_Muenchen led by @HinrichSchuetze and @barbara_plank
Leonie Weissweiler @LAWeissweiler
1K Followers 321 Following postdoc @UT_Linguistics with @kmahowald | PhD @cislmu, prev. @Princeton @LTIatCMU @CambridgeLTL computational linguistics, construction grammar, morphology
اِل زِد آر @lzr7823
228 Followers 183 Following
Nathan Schneider @complingy
5K Followers 1K Following Computational Linguist and Professional Nerd at Georgetown University he/him pronouns, ALL the prepositions @[email protected] @complingy.bsky.social
Susan @barronsusan16
391 Followers 3K Following
Adeline @LuellaSchm85326
49 Followers 2K Following
Sue @n_sue23
253 Followers 3K Following
Ramiro Hilpert @HilpertRam32330
95 Followers 2K Following
Jan Hendrik Metzen @jan_metzen
175 Followers 598 Following Senior AI Researcher at IPAI Aleph Alpha Research @Aleph__Alpha.
Sarath Chandar @apsarathchandar
6K Followers 603 Following Associate Professor @polymtl and @Mila_Quebec; Canada CIFAR AI Chair; Machine Learning Researcher. Pro-bono office hours: https://t.co/tK69DKRf9N?amp=1
Tamau @Tamau93047
14 Followers 598 Following
straxico @straxico
475 Followers 782 Following ناشاد | خِرَدسوده | خرسِ بازار های گاوی | مستِ استمراری | چاینده در باد
Catherine Arnett @linguist_cat
797 Followers 572 Following NLP Researcher @AiEleuther. PhD @UCSanDiego Linguistics. Previously @pleiasfr @EdinburghUni. Interested in multilingual NLP, tokenizers, open science. She/her.
Rosendo Leannon @RLeannon63034
68 Followers 4K Following
Chunlan Ma @chunlan_ma712
7 Followers 27 Following
Freeman Lewin @Freeman_Lewin
743 Followers 1K Following Brick layer behind @TryBrickroad Building the future of data licensing.
Valentine @Valenti83196790
355 Followers 3K Following
Kiocmal @Kiocmal9060
20 Followers 1K Following
Zahra Sodagar @zarsodagar
6 Followers 162 Following CS Ph.D. Student @UofMaryland | Graduate of https://t.co/GFjFQTndLO. in EE from Sharif University of Technology
Gandalf @gandalferen
2 Followers 749 Following
AI Horizons @theaihorizons
582 Followers 4K Following AI tips, tools & insights in plain English. Led by NASA & Big Tech AI experts. Trusted by 10K+ pros & early adopters
Đorđe Klisura @Klisura_djordje
45 Followers 193 Following PhD Candidate in Information Technology at UT San Antonio @utsa
Zeeshan Memon @Zeesh_anM
8 Followers 537 Following
Elisa Bassignana @EliBassignana
773 Followers 366 Following Postdoc @NLPnorth @MilaNLProc | affiliated @AiCentreDK | ex @MaiNLPlab @AmazonScience
Moshe Binieli 🇮�... @MosheBinieli
652 Followers 458 Following AI Expert & Software Engineer | AI is far more dangerous despite its benefits | MSc in Computer Science | Tech Blogger | https://t.co/fbaWDIHq1T
Carley Dare @CarleyDare33350
109 Followers 4K Following
Advait Deshmukh @JustADwight
2 Followers 282 Following
Ivlodun @Ivlodun3632112
31 Followers 1K Following
Samy Ateia @puasdfjasdf
7 Followers 126 Following
Faeze Ghorbanpour @FaezeGhorbanpor
106 Followers 2K Following PhD at @TU_Muenchen NLP Researcher at @CisLmu Affiliated with @MunichCenterML
rahul x @rahulme74418504
58 Followers 2K Following
Irlaujoug @Irlaujoug00718
15 Followers 978 Following
Aurélien-Morgan @AurelienMorgan_
101 Followers 2K Following Building `pip install retrain-pipelines`, ML-Eng-centric OS DAG engine, WebConsole & transformers/diffusers retrain framework. Wandering around. Mind if I do.
˚♡⋆mimi ˚♡⋆... @mimi10v3
13K Followers 5K Following 🫶 aspiring AI ecologist e/acc & biophilia DC/VA/WV
Haeji Jung @haejiness_ai
36 Followers 284 Following Visiting Researcher, CMU LTI/ 💻Studying AI / 🏫M.S. in CSE, Korea University / ❤️Multiligual LM, Representation Learning, Computational Linguistics
Hussain Tech Explorer @TechHussainE
36 Followers 208 Following AI & ML Enthusiast @Cisco. Always exploring 🧠 and loving 🐕 life.
Mohsen Fayyaz @mohsen_fayyaz
196 Followers 418 Following PhD Student @ UCLA #NLProc #MachineLearning
Xiaoyan Bai @Elenal3ai
77 Followers 353 Following 1st-year PhD @ChicagoHAI @UChicagoCS /prev. BE in CS @UMich @michigan_AI
Sachit Malik @isachitmalik
167 Followers 4K Following Hola | Security Engineering at Apple | Alum: Carnegie Mellon; IIT Delhi
Hwaran Lee @hwaran_lee
415 Followers 274 Following Assistant Professor @sogang_univ. | ex-@NAVER_AI_Lab | PhD from @KAIST
MELT Workshop @MeltWorkshop
50 Followers 15 Following 🌍 Workshop on multilingual & culturally aware AI. Co-located with @COLM_conf 2025 in Montreal, Canada https://t.co/Z9KGqDSJRy
Majid Daliri @daliri__majid
445 Followers 826 Following Right-leaning liberal | PhD @NYUniversity | @Apple Scholar in AI/ML | Researching intelligence, human & machine
Benjamin Gögge-Feier... @bnggge
560 Followers 2K Following games + data + humanities - m.a. game dev + research, b.a. cultural sciences | DE: @bggg
JHU CLSP @jhuclsp
7K Followers 6K Following Center for Language and Speech Processing at @JohnsHopkins #NLProc #MachineLearning #AI https://t.co/6IXR5OSQtw @[email protected]
Dara @dara_tourt
13 Followers 8K Following
Haneul Yoo @HaneulYoo13
473 Followers 221 Following #NLProc PhD candidate at @kaistpr & Visiting scholar at @nyuniversity | Prev. @naver_ai_lab @upstageai @csiro @iamkepco
Hamed Esmaeilion @esmaeilion
705K Followers 90 Following Board member of 752AFV, Human Rights Activist
ThatsEnough @ThatsEnough2022
507 Followers 604 Following سیاهی امروز ما یک واقعیت است اما سیاهی اینده یک احتمال است که ما میتوانیم در آن تاثیر گذار باشیم.
علی شریفی ز�... @SharifiZarchi
139K Followers 438 Following عضو هیاتعلمی هوشمصنوعی و بیوانفورماتیک، دانشکدهی مهندسی کامپیوتر، دانشگاه صنعتی شریف. رییس کمیتهی علمی بینالمللی المپیاد جهانی هوشمصنوعی.
Nakhoda @Pr_1266
3K Followers 3K Following Sr. AlgroTrader, Ex Sr. ML Engineer, Founder of #NakhodaFinanceCo, $DJI, $NQ and #XAU Trader.
حافظه تاریخ... @hafezeh_tarikhi
295K Followers 1 Following Historical Memory / جهت ثبت و بازخوانی کانال تلگرام: https://t.co/0au5s7DgBX
Kianoosh @kianoosha76
7K Followers 849 Following
Yann LeCun @ylecun
949K Followers 764 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.
Barbara Plank @barbara_plank
9K Followers 969 Following I moved to a new sky, find me there. Account no longer active. https://t.co/wnGfbtVgZ3
CIS, LMU Munich @CisLmu
1K Followers 132 Following Center for Information and Language Processing (CIS): #NLProc research group @LMU_Muenchen led by @HinrichSchuetze and @barbara_plank
مطـــ @motiff__
552 Followers 193 Following پیرو مکتب گشادیسم، شاید به خاطر همین اخلاقمه، همیشه نگران
(((ل()(ل() 'yoav)))... @yoavgo
65K Followers 2K Following
زبونشناس @zabunshenas
29K Followers 785 Following مطالب مربوط به زبانشناسی، روانشناسی زبان، علوم شناختی با ویدئوهای جسته گریخته از موسیقی کلاسیک و ویولون. بعضا حرفهای نامربوط به مسائل روز
Behnam Neyshabur @bneyshabur
29K Followers 857 Following Research @AnthropicAI (Co-lead Discovery team) 💼 Past: Gemini @GoogleDeepMind (Co-led Blueshift team) 🧠 LLM Reasoning / AI Scientist 🎒Traveling & Backpacking
Leonie Weissweiler @LAWeissweiler
1K Followers 321 Following postdoc @UT_Linguistics with @kmahowald | PhD @cislmu, prev. @Princeton @LTIatCMU @CambridgeLTL computational linguistics, construction grammar, morphology
اِل زِد آر @lzr7823
228 Followers 183 Following
Hugo Larochelle @hugo_larochelle
121K Followers 639 Following Mila Scientific Director. Ex @Google DeepMind & Twitter Cortex. Father of 4. // Directeur scientifique à Mila. Ex @Google DeepMind & Twitter Cortex. Père de 4.
Sakana AI @SakanaAILabs
54K Followers 0 Following We are building a world class AI R&D company in Tokyo. We want to develop AI solutions for Japan’s needs, and democratize AI in Japan. https://t.co/1q07mb3TzE
Pieter Delobelle @pieterdelobelle
423 Followers 532 Following LLMs and tokenization - Prev: @apple, @aleph__alpha, PhD & postdoc @KU_Leuven
Per Engzell @pengzell
9K Followers 8K Following Associate Prof @UCLSocRes, ERC Starting Grant MaMo 2025-2030, Programme Lead MSc Sociology, Associate Ed RSSM & ESR. Also at @SOFI_su_se @NuffieldCollege 🟦☁️
Dirk Wulff @dirkuwulff
1K Followers 1K Following Senior scientist @mpib_berlin and @unibasel_en | language models, decision science, and sustainability (https://t.co/G46Ym2eo37).
Fred Sala @fredsala
1K Followers 647 Following Assistant Professor @WisconsinCS. Chief scientist @SnorkelAI. Working on machine learning & information theory.
Yejin Choi @YejinChoinka
25K Followers 402 Following professor at Stanford, researcher at NVIDIA, adventurer at heart
Real AI @ UCSB @ai_ucsb
287 Followers 160 Following
typedfemale @typedfemale
38K Followers 534 Following a really exciting new account "advanced pytorch user" - @cHHillee alt: @typedalt
Prime Intellect @PrimeIntellect
45K Followers 26 Following find compute. train models. contribute to open superintelligence. https://t.co/ZRZOsRRbwr
Elizabeth Salesky @esalesk
1K Followers 772 Following Research Scientist @GoogleDeepMind・PhD @jhuclsp・I like bubbles, bicycles, and language variation・https://t.co/x2ZlH1yuj6
Jan Hendrik Metzen @jan_metzen
175 Followers 598 Following Senior AI Researcher at IPAI Aleph Alpha Research @Aleph__Alpha.
Casper Hansen @casper_hansen_
10K Followers 457 Following NLP Scientist | AutoAWQ Creator | Open-Source Contributor
Badr M. Abdullah, PhD... @badr_nlp
968 Followers 2K Following Researcher @LSTSaar | Saarland University🦉 💬 Speech & Language Processing 💫 Machine Learning 🤖 Cognitive Science 🧠
Guardrails AI @guardrails_ai
3K Followers 4 Following Building the guardrails around large language models. Discord: https://t.co/PkSO3mMUvH
LM Studio @lmstudio
38K Followers 121 Following Download and run local LLMs on your computer 👾 https://t.co/e2E0DLMFJ5
Yawar Siddiqui @yawarnihal
1K Followers 568 Following Researcher in 3D Computer Vision at Meta. Views expressed are my own.
Clara Isabel Meister @clara__meister
2K Followers 55 Following Post-doc teaching a continuing studies program at ETH Zurich. Still figuring out how Twitter works... 🤦♀️
Catherine Arnett @linguist_cat
797 Followers 572 Following NLP Researcher @AiEleuther. PhD @UCSanDiego Linguistics. Previously @pleiasfr @EdinburghUni. Interested in multilingual NLP, tokenizers, open science. She/her.
Jack Morris @jxmnop
45K Followers 974 Following research @cornell @meta // language models, information theory, science of AI
Nathan C. Frey @nc_frey
4K Followers 1K Following In the arena | Ex-@PrescientDesign • @Genentech | Advisor @atomscale & @guidelabsai | @MIT, @Penn PhD, @BerkeleyLab
Social Computing Lab @DiesnerLab
273 Followers 323 Following Social Computing lab at the iSchool at UIUC. Human-Centered Data Science, Computational Social Science, NLP, Network Analysis, Responsible Computing.
Jana Diesner @janadiesner
529 Followers 341 Following Professor at University of Illinois Urbana Champaign. Social Computing, Network Science, Natural Language Processing, AI.
Jeremy Nguyen ✍🏼... @JeremyNguyenPhD
23K Followers 797 Following A.I. for writing, productivity, business | College Prof, A.I. Educator, A.I. Researcher | Writer on Disney+ show | Father to newborn, so sleepy
Runjin Chen @RunjinChen
503 Followers 52 Following Research Fellow @AnthropicAI | PH.D. student @UTAustin @VITAGroupUT | Previously BS/MS @sjtu1896
augustus odena @gstsdn
10K Followers 3K Following AI research at Meta. Previously cofounder at @AdeptAILabs. Invented Scratchpad / Chain-of-Thought.
Workshop on Large Lan... @l2m2_workshop
138 Followers 12 Following The First Workshop on Large Language Model Memorization.
Đorđe Klisura @Klisura_djordje
45 Followers 193 Following PhD Candidate in Information Technology at UT San Antonio @utsa
Longyue Wang @ACL2025 @wangly0229
2K Followers 475 Following Dr. | Senior Staff Engineer @AlibabaGroup | IEEE Senior Member | Previously @DCU, @TencentGlobal
Moshe Binieli 🇮�... @MosheBinieli
652 Followers 458 Following AI Expert & Software Engineer | AI is far more dangerous despite its benefits | MSc in Computer Science | Tech Blogger | https://t.co/fbaWDIHq1T
Nino Scherrer @ninoscherrer
993 Followers 2K Following Research Scientist at @Google | Rigorous evaluations, cognitive science & causality | Ex: {@PatronusAI, @VectorInst, @Mila_Quebec, @MPI_IS, @ETH_en}
Johannes Oswald @oswaldjoh
1K Followers 641 Following Research Scientist, Paradigms of Intelligence Team, Google Zurich
AtCoder @atcoder
36K Followers 3K Following プログラミングコンテスト運営サービス「AtCoder」 の公式アカウントです。コンテストの情報についてお知らせします。ADTの開催通知は@atcoder_adt リプライ/DMについては対応しておりません。お問い合わせはこちらから https://t.co/fWDVxKIucj
Psyho @FakePsyho
25K Followers 366 Following Game Designer; Problem Solver; past: OpenAI (Dota), Pro Competitive Programmer, Poker
Shital Shah @sytelus
13K Followers 11K Following Mostly research and code. If universe is an optimizer, what is its loss function? All opinions are my own.
Perplexity @perplexity_ai
336K Followers 63 Following Curiosity changes everything. Download our free app on iOS, Mac, Windows, and Android: https://t.co/BBZ1kG0TVG
MPIDR @MPIDRnews
8K Followers 850 Following The Max Planck Institute for Demographic Research (MPIDR) is one of the largest demographic research bodies in Europe and part of @maxplanckpress.
Aliakbar Akbaritabar,... @Akbaritabar
2K Followers 1K Following Computational Social Scientist with a background in Sociology | Research scientist at the Max Planck ins. for Demographic Research @MPIDRnews | prior @DZHW_info
Faeze Ghorbanpour @FaezeGhorbanpor
106 Followers 2K Following PhD at @TU_Muenchen NLP Researcher at @CisLmu Affiliated with @MunichCenterML
Sgt Sref @sergeantsref
18K Followers 1 Following -- Finding The Best Midjourney Style References -- So You Don't Have To