Come work with us! The Machine Learning Research (MLR) team at Apple is seeking a passionate AI researcher to work on Efficient ML algorithms:
jobs.apple.com/en-us/details/…
Healthy and unhealthy strategies for coping with the Apple paper:
- attack Apple for publishing it (which does nothing to address the underlying problems they pointed out)
or
- figure out its implications and develop a robust alternative (the healthier option)
Healthy and unhealthy strategies for coping with the Apple paper:
- attack Apple for publishing it (which does nothing to address the underlying problems they pointed out)
or
- figure out its implications and develop a robust alternative (the healthier option)
🧵 1/8 The Illusion of Thinking: Are reasoning models like o1/o3, DeepSeek-R1, and Claude 3.7 Sonnet really "thinking"? 🤔 Or are they just throwing more compute towards pattern matching?
The new Large Reasoning Models (LRMs) show promising gains on math and coding benchmarks,…
I will be attending #ICLR this week to present our GSM-Symbolic paper, and we also have a full-time opening on our team! Let me know if you're interested in discussing reasoning and/or joining us!
I will be attending #ICLR this week to present our GSM-Symbolic paper, and we also have a full-time opening on our team! Let me know if you're interested in discussing reasoning and/or joining us!
It was a pleasure joining @MLStreetTalk during the NeurIPS conference in December.
While it might seem that a lot has changed over the past 3 months (e.g., with new models like o3/R1), I still believe the current models are not capable of reasoning :)
youtube.com/watch?v=yQPdue…
Exactly! I wish that at least academic people understood this.
"All" models we have today are trained using cross-entropy to fit a distribution => By design, It is "impossible" for them to generate anything outside of that distribution.
Exactly! I wish that at least academic people understood this.
"All" models we have today are trained using cross-entropy to fit a distribution => By design, It is "impossible" for them to generate anything outside of that distribution.
Amazing analysis! This has been THE question I was thinking about every single day in the past month.
Although, I think if the model knows the algorithm (multiplication), we can only measure the accuracy of execution by the model and not necessarily their search/reasoning power.
Amazing analysis! This has been THE question I was thinking about every single day in the past month.
Although, I think if the model knows the algorithm (multiplication), we can only measure the accuracy of execution by the model and not necessarily their search/reasoning power.
🍏🍏🍏 Come work with us at Apple Machine Learning Research! 🍏🍏🍏
Our team focuses on curiosity-based, open research.
We work on several topics, including LLMs, optimization, optimal transport, uncertainty quantification, and generative modeling.
Infos 👇
We have open-sourced GSM-Symbolic templates and generated data! 🎉
- Github: github.com/apple/ml-gsm-s…
- Hugging Face: huggingface.co/datasets/apple…
I will be also attending #NeurIPS2024. If you are also attending and would like to discuss research ideas on reasoning, let's connect :)
1/🔔Excited to share my internship work, SALSA: Soup-based Alignment Learning for Stronger Adaptation, (NeurIPS workshop paper)! 🎉
Proximal Policy Optimization (PPO) often limits exploration by keeping models tethered to a single reference model. SALSA, however, breaks free…
1/ LLM inference is very expensive; and LLMs don't necessarily use their full capacity to respond to a specific prompt. That's why many researchers have been investigating adaptive computation methods such as early exiting, layer/expert pruning, speculative decoding, mixture of…
** Intern position on LLM reasoning **
@mchorton1991, @i_mirzadeh, @KeivanAlizadeh2
and I are co-hosting an intern position at #Apple to work on understanding and improving reasoning capabilities of LLMs. The ideal candidate:
- Has prior publications on LLM reasoning
- Is…
** Intern position on LLM reasoning **
@mchorton1991, @i_mirzadeh, @KeivanAlizadeh2
and I are co-hosting an intern position at #Apple to work on understanding and improving reasoning capabilities of LLMs. The ideal candidate:
- Has prior publications on LLM reasoning
- Is…
📢Internships at Apple ML Research🍏
We’re looking for a PhD research intern with interests in uncertainty quantification, LLMs, probabilistic ML and/or decision making under uncertainty!
See thread for more details 👇
[1/3]
1/ Can Large Language Models (LLMs) truly reason? Or are they just sophisticated pattern matchers? In our latest preprint, we explore this key question through a large-scale study of both open-source like Llama, Phi, Gemma, and Mistral and leading closed models, including the…
I was waiting and hoping the ML community on Twitter would move over to Mastodon so I wouldn't have to create an account here. But, well… here we are! :)
44 Followers 53 FollowingYoung Founder - CEO at “LuckyWeb” | Digital Business Consultant | Helping Businesses Scale with Web Design, Automation & Artificial intelligence
536 Followers 3K FollowingArchitect | interior designer | photographer
_______برای دیدن پروژههای معماری و عکاسیم قسمت media رو ببین. اگر هم کاملشونو خواستی بیا لینکدین و یا اینستاگرامم
2K Followers 900 FollowingHiring: resume to [email protected]
to love math is to see the face of God
Morgan Prize, Rhodes Scholar
Math PhD@Stanford; Neuro@Oxford; Math+Physics@MIT
46K Followers 2K FollowingSenior correspondent covering AI @WIRED • Subscribe to my newsletter https://t.co/jxLAFHz8UP • Robison (rah-beh-son) not Robinson • Send tips on Signal @ kylie.01
48K Followers 672 FollowingProfessor, Santa Fe Institute. Mostly posting on https://t.co/4NpA2IL5Va (at-melaniemitchell). More thoughts at https://t.co/nC43NHRozX.
488K Followers 146 FollowingNobel Laureate. Co-Founder & CEO @GoogleDeepMind - working on AGI. Solving disease @IsomorphicLabs. Trying to understand the fundamental nature of reality.
2K Followers 185 FollowingI research fundamental physics and the brain. Distance is a creation of the mind. Intelligence is deterministic and causal, not probabilistic and correlational.
47K Followers 110 FollowingMy new LM book: https://t.co/YXNQUy7O3t
PhD in AI, author of 📖 The Hundred-Page Language Models Book and 📖 The Hundred-Page Machine Learning Book
186K Followers 105 FollowingWe're sharing/showcasing best of @github projects/repos. Follow to stay in loop. Promoting Open-Source Contributions. UNOFFICIAL, but followed by github
292 Followers 207 FollowingComputer Science PhD student in the University of Maryland, College Park
Former BSc student in the Sharif University of Technology
45K Followers 1K FollowingNeuroscientist interested in cognitive-emotional brain
Author of The Entangled Brain (MIT Press); The Cogitive-Emotional Brain
Neuroscience & Philosophy Salon
2K Followers 20 FollowingMIT Doctor of Philosophy, strategist, polymath, engineer, lifelong learner, problem solver, and communicator. Ally to All humanity. @MLStreetTalk pod.
35K Followers 628 FollowingMLST is by Dr. Tim Scarfe @ecsquendor w/ cameos from @DoctorDuggar https://t.co/5YCv2SdFwN (early access/priv.discord) - Sponsor us!
633 Followers 438 FollowingPhD Student at @spcl_eth, focused on High-Performance Computing and Large Scale Deep Learning | Prev. intern at @Apple, @Microsoft, and @MSFTResearch
151K Followers 37 FollowingKnown as Mad Max for my unorthodox ideas and passion for adventure, my scientific interests range from artificial intelligence to the ultimate nature of reality
104K Followers 775 FollowingI’ve dedicated my life to understand intelligence and consciousness, and to harness this knowledge to invent and create tools to empower people. @microsoftai
16K Followers 495 FollowingHarvard Professor.
Full stack ML and AI.
Co-director of the Kempner Institute for the Study of Artificial and Natural Intelligence.
455 Followers 247 FollowingHuie-Rogers Endowed Chair Professor of CS @WSUPullman; alumni @OregonState @IITKanpur; AI, ML, Computing Systems, AI for Science and Engineering
30K Followers 123 FollowingMechanistic Interpretability lead DeepMind. Formerly @AnthropicAI, independent. In this to reduce AI X-risk. Neural networks can be understood, let's go do it!
25K Followers 101 FollowingDirector, @PrincetonPLI and Professor @PrincetonCS. Seeks math/conceptual understanding of deep learning and large AI models.
Also on the "other" social network
1.2M Followers 279 FollowingWe’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.
No recent Favorites. New Favorites will appear here.