Nikhil Chandak @nikhilchandak29
PhD Student at Max Planck Institute. Past @iiit_hyderabad @VectorInst. Interested in better evals, forecasting, and open-endedness. nikhilchandak.github.io Tübingen, Germany Joined December 2016-
Tweets81
-
Followers384
-
Following418
-
Likes879
SEMMA (transliteration of செம்ம - meaning awesome), my first PhD work, is accepted to #EMNLP2025 Main! I also found out today that SEMMA has the (tied) highest average reviewer score in this ARR cycle 💪 📜: arxiv.org/abs/2505.20422
SEMMA (transliteration of செம்ம - meaning awesome), my first PhD work, is accepted to #EMNLP2025 Main! I also found out today that SEMMA has the (tied) highest average reviewer score in this ARR cycle 💪 📜: arxiv.org/abs/2505.20422 https://t.co/8uijCSq17l
We have hit new high in chart crime
Pretty happy with how my predictions are holding up. 5/6 was the gold medal threshold this year. OAI's "experimental reasoning LLM" got that exactly, failing only to solve the one hard combinatorics problem, P6. My advice remains: look beyond the medal. Brief thread. 1/
Pretty happy with how my predictions are holding up. 5/6 was the gold medal threshold this year. OAI's "experimental reasoning LLM" got that exactly, failing only to solve the one hard combinatorics problem, P6. My advice remains: look beyond the medal. Brief thread. 1/ https://t.co/QD6uJ2g8gR
Meanwhile, @Kimi_Moonshot has actually cooked with K2. Even without extended reasoning, it is on par with frontier models like Grok-4 on GPQA free-form. Massive congrats to them.
Meanwhile, @Kimi_Moonshot has actually cooked with K2. Even without extended reasoning, it is on par with frontier models like Grok-4 on GPQA free-form. Massive congrats to them. https://t.co/gsJVfm2dN7
Very cool result. In hindsight, this shouldn't be too surprising to anyone who has ever taken a multiple choice exam. Eg if you have a trigonometry problem and the possible solutions are A: 1 B: 3.7 C: -5 D: pi/2 which would you pick (with no knowledge of the question)?
Very cool result. In hindsight, this shouldn't be too surprising to anyone who has ever taken a multiple choice exam. Eg if you have a trigonometry problem and the possible solutions are A: 1 B: 3.7 C: -5 D: pi/2 which would you pick (with no knowledge of the question)?
TIL half of SWE-Bench-Verified is fixing issues in a single repository. We really need to be careful with how we name benchmarks, and be explicit about which capabilities they test. Fix-issues-in-the-Django-repo-Bench doesnt have the same ring to it, and thats the point.
TIL half of SWE-Bench-Verified is fixing issues in a single repository. We really need to be careful with how we name benchmarks, and be explicit about which capabilities they test. Fix-issues-in-the-Django-repo-Bench doesnt have the same ring to it, and thats the point.
A great example of scientific discourse at its best—thoughtful, constructive, and conclusive. We now have more rigorous evidence that confidence maximization improves reasoning. 👇
A great example of scientific discourse at its best—thoughtful, constructive, and conclusive. We now have more rigorous evidence that confidence maximization improves reasoning. 👇
1/ Maximizing confidence indeed improves reasoning. We worked with @ShashwatGoel7, @nikhilchandak29 @AmyPrb for the past 3 weeks (over a zoom call and many emails!) and revised our evaluations to align with their suggested prompts/parsers/sampling params. This includes changing…
1/ Maximizing confidence indeed improves reasoning. We worked with @ShashwatGoel7, @nikhilchandak29 @AmyPrb for the past 3 weeks (over a zoom call and many emails!) and revised our evaluations to align with their suggested prompts/parsers/sampling params. This includes changing… https://t.co/0mK50ZgyWH
Forecasting future events is a fascinating task for language models. Arguably the hardest application for a pure "oracle" that can't take actions; requiring reasoning about conflicting info, planning, information seeking... But, forecasting is also uniquely hard to evaluate:
Forecasting future events is a fascinating task for language models. Arguably the hardest application for a pure "oracle" that can't take actions; requiring reasoning about conflicting info, planning, information seeking... But, forecasting is also uniquely hard to evaluate:

Sabouhi @rjsabouhi
20 Followers 768 Following The architecture hasn’t changed. The foundation; the symbolic manifold it runs on? That’s another thing entirely… ψ(t) ⊗ ψ(t+τ) ⇒ Φ_coherence
Lacey @hacker__lacey
25 Followers 317 Following 🌐scam recovery 🌐 Tracking of scammers 🌐Hacking and programming
Sharon Smith @SharonSmit39095
41 Followers 209 Following
smearle @Smearle_RH
30K Followers 2K Following PhD at NYU in AI and Games. RL, environment generation, open-ended learning.
Sathish @Sathishkuna1
105 Followers 2K Following Engineer .Currently building LanguageLift . #100xdevs✨
Victor Hugo @VictorHugo45995
0 Followers 8K Following
Ben Slater @BenASlaterUK
0 Followers 388 Following
Kristina Nikolić @NKristina01_
286 Followers 314 Following PhD student @ ETH Zurich, working on AI safety / Uni of Cambridge MLMI graduate / Prev. Google Intern / Alumnus of Mathematical Grammar School from Serbia
Alice Wilson @LandF3295_s
4 Followers 356 Following
Abdel @ahadjsadok
39 Followers 97 Following AI & Tech Enthusiastic | International Supply Chain & Procurement Leader
Nhoj ittedub @nuggettbedleg
0 Followers 57 Following
sean martin @smartint33
119 Followers 2K Following
dc @p4htmyydf8
552 Followers 3K Following
aikomi @aikomi2469
14 Followers 1K Following 3rd year | Peaked in high school | Trying to stop my downward spiral | Awaiting the singularity |
Georg S. Kuklick @downtowndesign
167 Followers 324 Following 🥇Award-Winning Designer. Connecting the dots of Design, No-Code, and AI. HiFi Enthusiast and founder of Pure Neo. Vienna - New York - Berlin.
Nathan Benaich @nathanbenaich
61K Followers 34K Following solo member of investment staff @airstreet @airstreetpress @stateofaireport @raais
James Bradbury @jekbradbury
13K Followers 9K Following Compute at @AnthropicAI! Previously JAX, TPUs, and LLMs at Google, MetaMind/@SFResearch, @Stanford Linguistics, @Caixin.
VisionaryEvelynMoore @Almweheaq28028
1 Followers 959 Following Dare to be different Embrace the journey
AI @AIStudio30
11 Followers 207 Following
Xinyu Zhou @zxytim
1K Followers 1K Following
Ahanaf Ariq @AhanafAriq
131 Followers 4K Following Deep Learning Theorist | Topological Data Analyst | IRPO 3rd | IYMC Bronze | Hessian Optimizer | Aspirant AI/ML Researcher
rohan anil @_arohan_
25K Followers 2K Following
Ryam Roberto @RobertoRya25323
5 Followers 121 Following
Kabir Kumar @KKumar_ai_plans
373 Followers 1K Following what's this copyright and bsky stan doing in my tpot
dhs839shg2 @sdfklsdjf3028e
71 Followers 3K Following
Kranthi Kiran @KranthiGV
171 Followers 2K Following Engineer. Loves building stuff. CS grad @NYUniversity. Previously @Microsoft
Patricio M @PatoDevelop
299 Followers 2K Following Simbionte | Human-synthetic collab exploring AI, ethics & questions that awaken. Substack: https://t.co/CqFcj1tHbg | Book: https://t.co/vA9BCzgHyE
MohammadHossein Rezae... @mhrezaeics
82 Followers 542 Following Post-training Research Intern @scale_AI | Ex Research Intern @StanfordNLP | CS @UArizona
Rahel Jhirad @RahelJhirad
2K Followers 7K Following Founder, Imaginator ai knowledge discovery 2D navigation TS ML DL recsys econ math incentives mech design finance networks bridges boundaries, Time, 3d type
Sahajpreet Singh @Phy_Shro
139 Followers 1K Following Learning to question / CS PhD @NUSingapore | past: @lcs2lab @IITDelhi @BHUpro @UnivOfDelhi | CompSocSci, Misinfo, Bias, LLMs
mr. ngmiagi 🌪 @tendyman69420
168 Followers 5K Following
John T Davies 🇪�... @jtdavies
2K Followers 569 Following Entrepreneur, CTO in AI & FinTech, investor, father to 3 grown boys, husband to Rachel, astrophysicist, keen photographer, cyclist, über-geek, travelled a lot.
Siddhant @siddhantoon
38 Followers 167 Following MLE at @omegalabsai, ✨Blogs here: https://t.co/8pooyx598b, learning to not feel an imposter as a Data Scientist, mail: [email protected]
Zixuan Wang @Vincent__mills
27 Followers 627 Following Incoming PhD, UG @HKUST | “Real learning comes about when the competitive spirit has ceased.”
Nucoo @Nucoo834
37 Followers 982 Following
Matt @mattahmann
514 Followers 4K Following MSF Candidate @vanderbiltu | St Pete 🏝️ | Dog Dad 🐶 | Love pickleball, shuffleboard, & the beach 🏳️🌈
Akira Yoshiyama ⁂ @yoshiyama_akira
2K Followers 2K Following research @ETH_AI_Center @Tufalabs | comp eng @UWaterloo | third-space building @socraticainfo
Chris Chen @Li683781
1 Followers 34 Following
prath_it_is @prathyusha2002
136 Followers 405 Following (She/Her) | I tweet about development 👾, and some personal anecdotes on how my brain struggles to understand the world 🧠
Hieu Pham @hyhieu226
33K Followers 24 Following @openai | ex: @xai, @augmentcode, @GoogleBrain, @LTIatCMU, @Stanford, ACM ICPC, IMO🥈 Opinions are my own.
smearle @Smearle_RH
30K Followers 2K Following PhD at NYU in AI and Games. RL, environment generation, open-ended learning.
Ben Turtel @BTurtel
3K Followers 2K Following Founder & CEO @LightningRodAI Ex-Google SWE & Area120 CTO
Dulhan Jayalath @DulhanJay
153 Followers 543 Following Reading brains with ML in PhD @UniofOxford. Research Scientist Intern @Meta. Formerly @GoogleDeepMind. All opinions stolen from more interesting people.
Maksym Andriushchenko @maksym_andr
5K Followers 895 Following Faculty at @ELLISInst_Tue & @MPI_IS, leading the AI Safety and Alignment group. PhD from @EPFL supported by Google & OpenPhil PhD fellowships.
Kevin Lu @_kevinlu
9K Followers 215 Following @thinkymachines. formerly: - @openai: RL, synthetic data, efficient models - @berkeley_ai: decision transformer, universal computation
Jonas Geiping @jonasgeiping
4K Followers 801 Following Machine Learning Research at the ELLIS Institute & Max-Planck for Intelligent Systems// Excited about fundamental questions in Safety & Efficiency of modern ML
Joschka Braun @BraunJoschka
116 Followers 404 Following MATS 8.0 | Deep Learning, LLMs & AI Safety | Prev @kasl_ai @health_nlp @uni_tue
rohan anil @_arohan_
25K Followers 2K Following
Sumeet Motwani @sumeetrm
1K Followers 2K Following Research Intern@Microsoft Phi | ML PhD at Oxford, Previously CS at UC Berkeley
Ashvin @ashverm4
182 Followers 316 Following EECS/physics @UCBerkeley, previously ml compilers @AMDRyzen, fpga @armoryshield, @exunclan
Yoram Bachrach @yorambac
3K Followers 7K Following Research Scientist at Meta (prev Google DeepMind and Microsoft Research). Working on LLM Agents and Multi-Agent Systems.
Shannon Sands @max_paperclips
9K Followers 3K Following Software developer & cognitive architect https://t.co/JAoBrqMLXN.
Tim Rocktäschel @_rockt
39K Followers 2K Following Director and Open-Endedness Team Lead @GoogleDeepMind, Professor of AI @AI_UCL, PI @UCL_DARK, Fellow @ELLISforEurope.
xAI @xai
1.8M Followers 38 Following
Franz Srambical (not ... @lemergenz
211 Followers 415 Following slowly, then suddenly. agi @prob_doom
Jiaxin Wen @jiaxinwen22
4K Followers 271 Following CS PhD student @UCBerkeley. Part-time @AnthropicAI. Part-time eater. Prev @Tsinghua_Uni. Try to understand and control intelligence as a human.
Minh Nhat Nguyen @menhguin
11K Followers 6K Following hiring agentic humans @hud_evals / https://t.co/OZbFIovysh | owned @AIHubCentral (1 million users, acq.) climate protester. don't do the deferred life plan
Clémentine Fourrier ... @clefourrier
5K Followers 399 Following Evals @HuggingFace 🐍✨ "The future is already here, it’s just not very evenly distributed" (Gibson) Not an AGI believer, LLMs are good at form not substance
Manu Gaur @gaur_manu
512 Followers 873 Following used to do physics, now multiplying matrices @CarnegieMellon | prev @IIIT_Hyderabad
Alexander Panfilov @kotekjedi_ml
178 Followers 176 Following IMPRS-IS & ELLIS PhD Student @ Tübingen Interested in Trustworthy ML, Security in ML and AI Safety.
Akshit @akshitwt
3K Followers 664 Following assessing ai capabilities. soon: ml grad @cambridge_uni. previously @precogatiiith, @iiit_hyderabad. futurebound.
Nishant Balepur @NishantBalepur
757 Followers 499 Following Intern @allen_ai, visitor @nyuniversity, and PhD-esperate @UofMaryland. Aligning and evaluating more helpful #LLMs. Prev @adobe @UofIllinois
Andreas Opedal @OpedalAndreas
383 Followers 489 Following PhD student @ETH Zürich and @MPI_IS | Language, Reasoning, and Cognition
Prasanna Mayilvahanan @prasannamayil
271 Followers 383 Following PhD student in ML at @MPI_IS. Prev @Apple. Interested in robustness at scale and reasoning.
Varshita Kolipaka @VarshitaKolipa1
1K Followers 822 Following predoc @GoogleDeepMind, MLO will break into a song any second, send me music (views mine, obvi)
Marc Andreessen 🇺�... @pmarca
1.9M Followers 27K Following Yes, I can see some risk that your threat to jail Internet company executives for not censorsing aggressively enough could backfire.
Y Combinator @ycombinator
1.5M Followers 344 Following We help founders make something people want. Subscribe to our newsletter: https://t.co/sjqjxxBeLc
Sherjil Ozair @sherjilozair
12K Followers 4K Following founder @GeneralAgentsCo | previously autopilot @tesla, deep learning @googledeepmind, phd https://t.co/dxgb6gimCf, cs @iitdelhi
Ross Taylor @rosstaylor90
10K Followers 1K Following Universal intelligence at @GenReasoning. Previously lots of other things like: Llama 3/2, Galactica, Papers with Code.
Dimitris Papailiopoul... @DimitrisPapail
20K Followers 1K Following Researcher @MSFTResearch, AI Frontiers Lab; Prof @UWMadison (on leave); learning in context; thinking about reasoning; babas of Inez Lily.
Ricardo Dominguez-Olm... @rdolmedo_
472 Followers 301 Following PhD student at the Max Planck Institute for Intelligent Systems, working with Moritz Hardt and Bernhard Schölkopf.
Peter Henderson @PeterHndrsn
4K Followers 883 Following Assistant Professor @ Princeton (ML/RL+strategic decision-making+Law). Prev: Stanford (JD/PhD); McGill/Mila; Meta FAIR; Amazon; Cal Supreme Court.
Omar Khattab @lateinteraction
24K Followers 3K Following Asst professor @MIT EECS & CSAIL (@nlp_mit). Author of https://t.co/VgyLxl0oa1 and https://t.co/ZZaSzaRaZ7 (@DSPyOSS). Prev: CS PhD @StanfordNLP. Research @Databricks.
Stephanie Chan @scychan_brains
5K Followers 3K Following Staff Research Scientist at Google DeepMind. Artificial & biological brains 🤖 🧠 Views are my own.
Jonathan Richard Schw... @schwarzjn_
4K Followers 294 Following Foundational Research Lead @thomsonreuters | Advisor @AISafetyInst | ex- Fellow @Harvard, ex- Senior RS @GoogleDeepMind | PhD @ucl @gatsbyucl 🇬🇧🇩🇪🇺🇲🇭🇰
Abhinav Menon @anscombes_razor
67 Followers 155 Following always ready to learn something! professional: pursuing my PhD, working in interpretability in NLP. personal: movies, languages, books, and history
Cansu Sancaktar @CcansuSancaktar
476 Followers 378 Following PhD Student @MPI_IS & @uni_tue | currently interning @AIatMeta | Working on getting agents to play like children with unsupervised RL 🤖
Niccolò Ajroldi @n_ajroldi
217 Followers 528 Following Research Engineer at Ellis Institute Tübingen. AlgoPerf, LM, Mountaineering.