Jason Alan Fries @ ICLR 2024 @jasonafries
Research scientist at Stanford University. Working on healthcare AI, foundation models, and data-centric AI. web.stanford.edu/~jfries/ California, USA Joined June 2010-
Tweets902
-
Followers1K
-
Following425
-
Likes2K
Come by #ICLR2024 Session 2 on Tuesday to see our work using representation editing to make foundation models robust! No fine-tuning, no additional data, no problem. arxiv.org/pdf/2309.04344
There is a really nice community of researchers developing transformer alternatives. Want to highlight these impressive folks. Simran Arora (@simran_s_arora), Chunting Zhou (@violet_zct), Dan Fu (@realDanFu), and Songlin Yang (@SonglinYang4)
Fun collaboration with @ChiaChunChiang discussing #LLMs !
Fun collaboration with @ChiaChunChiang discussing #LLMs !
A Mamba Primer (w/ Yair Schiff youtube.com/watch?v=dVH1dR… ) Mamba is a nice jumping off point to summarize foundational ideas in sequence modeling, parallel algorithms, continuous-time representations, and GPU aware algorithms. We try to put these together in the context of LMs.
Thrilled to share that my lab is looking for postdocs! In partnership with @ChanZuckerberg, we're focusing on developing massive biomedical foundation models to create an AI-powered virtual cell. Dream of harnessing the power of 1,000 H100s? Apply now at: snap.stanford.edu/apply/index-po…
Recommend following David Hall (@dlwh) and the Levanter project from @StanfordCRFM . Just no nonsense details about fixing the pain-points of scaling LLM training, one at a time.
I like to talk about Levanter’s performance, reproducibility, and scalability, but it’s also portable! So portable you can even switch from TPU to GPU in the middle of a run, and then switch back again! github.com/stanford-crfm/…
Open source LLMs need open training data. Today I release the largest dataset of English public domain books curated from the @internetarchive and the @openlibrary. It consists of more than 61 billion words and 650,000 OCR texts. Stay tuned for more! huggingface.co/datasets/story…
Deadline extended to March 8th for the Multimodal4Health workshop (at ICHI 2024). Submit your papers at luomancs.github.io/Multimodal4Hea…
A recent MIT study claimed open models can help create bioweapons. But it didn’t test if they’re more useful than just having internet access (and later studies found they aren’t). How can we assess the impact of open foundation models? New paper: crfm.stanford.edu/open-fms
Our clinical #NLP work just published in @NatureMedicine! We present a framework to adapt & evaluate #LLMs for summarization. Physicians 🩺 prefer #LLM summaries to those of #medical experts❗ Big step to reduce documentation 📚 and focus more on personalized care 🙌 A 🧵
Are #LLMs ready for deployment into the clinic? How can we tell if they are vs. are not? @jasonafries does a great job laying out the current state of affairs for evaluating medical LLMs and how our recent work, MedAlign (medalign.stanford.edu), fits into the bigger picture.
Really interesting 🧵 about evaluating large language models’ performance in a health care context!
Excellent thread! I always wondered why there is a rush by medical LLMs to show case MedQA in leaderboards than comparing their performances against clinicians using EHR data.
Superb tweet chain by @jasonafries reg the work on medalign.stanford.edu and why it matters. Check out the series 👇
Superb tweet chain by @jasonafries reg the work on medalign.stanford.edu and why it matters. Check out the series 👇
Very relevant work being presented at #AAAI. Great team effort led by @_scott_fleming_ @jasonafries and team!
Very relevant work being presented at #AAAI. Great team effort led by @_scott_fleming_ @jasonafries and team!
Alex Ratner @ajratner
5K Followers 553 Following @SnorkelAI @uwcse / prev @StanfordAILab – Interested in data management systems for machine learning, weak supervision, and impactful applications.Braden Hancock @bradenjhancock
1K Followers 157 Following Machine learning researcher, developer, and entrepreneur. Co-Founder & Head of Technology at @SnorkelAI.Tri Dao @tri_dao
19K Followers 366 Following Incoming Asst. Prof @PrincetonCS, Chief Scientist @togethercompute. Machine learning & systems.Monica Agrawal @MonicaNAgrawal
3K Followers 309 Following Incoming asst prof at @DukeU (July 2024). Co-founder at @LayerHealth. ML and NLP for healthcare. PhD @mit_csail, BS/MS @stanford. She/her.vincent sunn chen @vincentsunnchen
977 Followers 388 Following building @SnorkelAI. previously, @StanfordAILab, @hazyresearch.Roxana Daneshjou MD/P.. @RoxanaDaneshjou
22K Followers 5K Following Assistant professor of Biomedical Data Science @StanfordDBDS and Dermatology @StanfordMed | AI/ML & precision health | @Rice_BioE alum | @pdsoros Fellow 2014Sara Hooker @sarahookr
39K Followers 8K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Karan Goel @krandiash
3K Followers 883 Following Founder @cartesia_ai, Machine Learning PhD at @StanfordAILab, CMU / IIT-Delhi alum.Stephen Bach @stevebach
2K Followers 425 Following Asst. prof. @BrownCSDept. Working on improving how humans teach computers. Weak supervision, zero-shot learning, few-shot learning, and high-level knowledge.Dan Fu @realDanFu
5K Followers 177 Following CS PhD Candidate at Stanford, systems for machine learning. Sometimes YouTuber/podcaster. Academic Partner, @togethercompute.Nicholas Roberts @nick11roberts
608 Followers 1K Following Ph.D. student @WisconsinCS. Working on data-centric automated machine learning. Previously at CMU @mldcmu, UCSD @ucsd_cse, FCC @fresnocity.Irene Chen @irenetrampoline
8K Followers 821 Following ML for equitable healthcare. Assistant Professor @UCBerkeley and @UCSF. Prev @Harvard, @MIT, @MSFTResearchAndrew Beam @AndrewLBeam
9K Followers 1K Following Machine Learning for Medicine. Assistant Prof: @Harvard | Founding Editor @NEJM_AI, Co-host AI Grand Rounds 🎙️, Co-founder @generate_biomedJuan M. Banda, Ph.D @drjmbanda
594 Followers 467 Following Senior Data Scientist at Stanford Health Care TDS. Helping bring AI and machine learning to the clinic. Opinions are my own and not endorsements.Fred Sala @fredsala
990 Followers 550 Following Assistant Professor @WisconsinCS. Chief scientist @SnorkelAI. Working on machine learning & information theory.Michael Moor @Michael_D_Moor
2K Followers 779 Following MD. PhD. Postdoc @Stanford CS w/ @jure. Working towards generalist medical AI, one gradient step at a time.Akshay Chaudhari @Dr_ASChaudhari
2K Followers 436 Following Assistant Professor at Stanford Radiology working on deep learning in medical imagingManojSharma @MNS_Manoj
37 Followers 284 Following Embracing impermanence bcoz everything is transient..!!Abdul Aziz @aziz_cuCSE
40 Followers 1K Following Prospective Graduate Student of CSE at @UChittagong. Research Interests: Large Language Models (LLMs), Multimodal AI, Multilingual NLP, and Lexical Semantics.Tara Henderson @TaraHender95191
18 Followers 1K FollowingAbdullah Hamdi @Eng_Hemdi
9K Followers 2K Following postdoc @Oxford_VGG 🇬🇧 | 3D Gen AI | PhD alum @KaustVision 🇸🇦@TU_Muenchen 🇩🇪 | @fihmai founder | my @tedX talk about AI inequality: https://t.co/Y24DtOsASJ𝐂𝐡𝐫𝐢𝐬�.. @christinexzhu
582 Followers 502 Following Product @ PointClickCare. Building products that maximize people's potential. Style = Substance.Thearth @ThearthcHWKqM2
2 Followers 337 FollowingZenobiaNeedham @8htg710mYHycxL
1 Followers 248 FollowingLeeWodehous @q07sDF3As3r44s0
37 Followers 2K FollowingChaitanya Shivade @ekshoonyame
243 Followers 460 Following NLP + ML for Health @amazon, ex @IBMResearch, @OhioStateMary-Anne Hartley (An.. @anniehartley_
153 Followers 202 Following LiGHT : Laboratory for Intelligent Global Health Technology @Yale @EPFL -- (I don't really use this platform anymore -- moved to LinkedIn)Jonathan Gortat @J_G1
26 Followers 36 FollowingAtul tiwari @atultiwari1
33 Followers 304 Following Technology. IT. Spirituality. Occult. PhysiologySneha Shah Jain, MD, .. @SnehaShahJain
851 Followers 958 Following Chief Cardiology Fellow at Stanford. Hopkins MD, Harvard MBA, Columbia IM residency. Formerly at Moderna Therapeutics and Flare Capital.PURPOSE: the Pain Res.. @PainResearchers
512 Followers 3K Following NIH-funded program uniting #pain #researchers across the continuum of #painresearch, from all disciplines and all career levels. https://t.co/nkmStorLSZSimon Lee @SimonLee79475
44 Followers 109 Following @CompMedUCLA Ph.D Student | LLMs, Ai in Healthcare | Overthinker | Cat Enthusiast | Prev: @celsiustx @epflBipin Singh @bipin_alld
137 Followers 2K Following Assistant Professor, Centre for Life Sciences, @MahindraUni | Computational Drug Discovery | Protein Engineering | AI/ML for Health & MedicineArjun Balaji MD @arjbalaji
77 Followers 421 Following ml, medicine, biotech // https://t.co/9wTQx6IoJk // VC @ https://t.co/O5nsxDqm2lDanyal Z Khan @dzkhan94
1K Followers 1K Following Neurosurgery Trainee @QSNeurosurgery | NIHR Academic Clinical Fellow @BRAIN_UCL @UCLIoN | #AI #CollaborativeResearch #GlobalSurgeryavnikothari @avnikothari1
0 Followers 16 FollowingYvette @cedalight1
7 Followers 892 FollowingResearch to the Peopl.. @Research2People
493 Followers 2K Following Open Patient-Partnered Research For #Oncology and #RareDisease.Raheel Sayeed, MD @rsayeed
302 Followers 758 Following biomedical informatics / Digital Health @HarvardDBMI @bos_chip also: @MedicalGear RT ≠ endorsemanluo @manluo12
93 Followers 213 Following Current Research Fellow at Mayo Clinic, PhD from ASU, former inter @Google @Meta @Salesforce, working on natural language processing and vision+language.ubaid @ubaidraj_83
1 Followers 89 Following모도리 @modori518
3 Followers 116 FollowingUIowa Computer Scienc.. @UIowaCS
375 Followers 51 Following Department of Computer Science The University of Iowa https://t.co/cO00Bc3kvdhabout632 @tulin632
80 Followers 709 FollowingZaiqiao Meng @mengzaiqiao
398 Followers 516 Following Lecturer at @ir_glasgow @GlasgowCS @UofGlasgow Affiliated Lecturer at @CambridgeLTL Working on ML, IR, NLP, KG, GNN, RecSys and AI4Science Opinions are my own.Jordi Clive @JordiClive
115 Followers 376 Following Lead Deep Learning Engineer @ChattermillAI • ML Researcher @laion_ai • SFT Team OpenAssistant • @huggingface contributor • NLG Research @imperialcollegeDuttonΦ @duttonphi
126 Followers 538 Following ..aagen (double (2x) agent).. ..previously Wołfram|Ałpha.. ..baeksu.ai.. ..jajangmyeon all day/all night..Sam Parker 🇺🇸�.. @BasedSam_Parker
80 Followers 168 Following The Official Personal Account For @SamParkerSenateWeidi Xie @WeidiXie
2K Followers 577 Following Computer Vision Researcher. Associate Professor at SJTU, Previously @Oxford_VGG. 中文名:谢伟迪 Personal Webpage: https://t.co/sZoZ0AfKrXTalles Viana @otalviana
21 Followers 224 Following swDev/música/arte/natureza https://t.co/mVXjZoZswOAlexander Makhaev @mankms
10K Followers 9K Following I work hard on https://t.co/Xx3Q0DzVeL and have fun building https://t.co/oAbu1LbRlw (or vice versa). PHP/JavaScript developer (Laravel, Symfony, React, Vue, Tailwind CSS).Vivek Ponnaiyan @viveksworld
800 Followers 638 Following Founder & Angel investor. AI & Fintech junkie. Past: Chime, BMW Self driving, Bloomberg, health-tech founder. Tweets about AI, startups, learning, & football.Minghui Chen @chenmh43
42 Followers 659 Following PhD student @UBC. Interested in federated learning, trustworthy AI, and deep-phenomena.emanon @JianSuji
81 Followers 1K FollowingTom Kerr @TomKerr53184324
5 Followers 54 FollowingAbdulaziz @abdulaziz_asz
1 Followers 653 FollowingOnno Kampman @KampmanOnno
175 Followers 1K Following Mental health care transformation at MOHT Singapore | Cognitive neuroscientist at the University of Cambridge | ML & NLP in SingaporeDickson Neoh 🚀 @dicksonneoh7
977 Followers 1K Following 🚀 I share bite-size practical machine learning deployment tips | 💡 Current Projects👉 https://t.co/ClHoj7uDia | 🎉 My best Tweets👉 https://t.co/2YzTSSRucvAndrej Karpathy @karpathy
983K Followers 905 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Yann LeCun @ylecun
715K Followers 718 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Percy Liang @percyliang
50K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistAlex Ratner @ajratner
5K Followers 553 Following @SnorkelAI @uwcse / prev @StanfordAILab – Interested in data management systems for machine learning, weak supervision, and impactful applications.François Chollet @fchollet
471K Followers 771 Following Deep learning @google. Creator of Keras. Author of 'Deep Learning with Python'. Opinions are my own.Pranav Rajpurkar @pranavrajpurkar
21K Followers 717 Following Professor at Harvard | Medical Artificial Intelligence | https://t.co/Z6tBGoluEGBraden Hancock @bradenjhancock
1K Followers 157 Following Machine learning researcher, developer, and entrepreneur. Co-Founder & Head of Technology at @SnorkelAI.Tri Dao @tri_dao
19K Followers 366 Following Incoming Asst. Prof @PrincetonCS, Chief Scientist @togethercompute. Machine learning & systems.Henry Kiss Ehrenberg @henryehrenberg
208 Followers 136 Following co-founder + engineering @SnorkelAI | hint water enthusiast | merch shop in linkedin bioAndrew Ng @AndrewYNg
1.0M Followers 916 Following Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCsZachary Lipton @zacharylipton
59K Followers 2K Following Professor: CMU/@acmi_lab, CTO / CSO: @AbridgeHQ, Creator: @d2l_ai & https://t.co/QQt98VNLUp, Relapsing 🎷(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingRichard Socher @RichardSocher
101K Followers 970 Following CEO @youSearchEngine Investing at @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMindMonica Agrawal @MonicaNAgrawal
3K Followers 309 Following Incoming asst prof at @DukeU (July 2024). Co-founder at @LayerHealth. ML and NLP for healthcare. PhD @mit_csail, BS/MS @stanford. She/her.vincent sunn chen @vincentsunnchen
977 Followers 388 Following building @SnorkelAI. previously, @StanfordAILab, @hazyresearch.Roxana Daneshjou MD/P.. @RoxanaDaneshjou
22K Followers 5K Following Assistant professor of Biomedical Data Science @StanfordDBDS and Dermatology @StanfordMed | AI/ML & precision health | @Rice_BioE alum | @pdsoros Fellow 2014James Zou @james_y_zou
10K Followers 59 Following @Stanford professor. Chan-Zuckerberg investigator. Sloan Fellow. AI for biotech + health. Making AI more trustworthy, reliable and human compatible.Karan Goel @krandiash
3K Followers 883 Following Founder @cartesia_ai, Machine Learning PhD at @StanfordAILab, CMU / IIT-Delhi alum.Sami Nas 👨⚕�.. @digitalhealthxx
8K Followers 9K Following Senior functional/technical consultant to bring added value via #digitalhealth #ai and #datascience based solutions #MedTwitterResearch to the Peopl.. @Research2People
493 Followers 2K Following Open Patient-Partnered Research For #Oncology and #RareDisease.manluo @manluo12
93 Followers 213 Following Current Research Fellow at Mayo Clinic, PhD from ASU, former inter @Google @Meta @Salesforce, working on natural language processing and vision+language.Eric Nguyen @exnx
2K Followers 331 Following PhD in BioEngineering & AI @stanford @HazyResearch @StanfordAILab @arcinstituteUIowa Computer Scienc.. @UIowaCS
375 Followers 51 Following Department of Computer Science The University of Iowa https://t.co/cO00Bc3kvdArc Institute @arcinstitute
22K Followers 24 Following A new scientific institution for curiosity-driven biomedical science and technology.swyx @ICLR_conf @swyx
92K Followers 3K Following Anti-ego ideas for anti-ergodic life. Founder, @smolmodels ▹ Listen: @latentspacepod ▹ Read: @coding_career ▹ Join: @aiDotengineerNeel Das @Nilzkool
6K Followers 5K Following Gen AI expert in healthcare | Building https://t.co/Erkxvoh2hm to help tutors conduct CS & coding quizzes effortlessly | Opinions are mineMayee Chen @MayeeChen
1K Followers 425 Following CS PhD student @StanfordAILab @HazyResearch, undergrad @princeton. she/her 🎃Karandeep Singh @kdpsinghlab
10K Followers 3K Following Jacobs Chancellor’s Endowed Chair @UCSanDiego. Chief Health AI Officer @UCSDHealth. Creator of Tidier.jl #JuliaLang. #GoBlue. Views own.Minh Nguyen @minhnsf
107 Followers 114 Following PhD candidate @StanfordBMI @StanfordDBDS, DARE fellow @StanfordVPGE, DS Scholar @StanfordData, M.A @UCBerkeley Biostats, RN @UCSFHealthTristan Naumann @TristanNaumann
760 Followers 103 FollowingNathan Lambert @natolambert
25K Followers 693 Following Figuring out AI @allen_ai, "rl boi" DM me papers. Writes @interconnectsai, talks @retortai Has phd and some credentialsNeil Gaiman @neilhimself
3.0M Followers 984 Following News about Neil Gaiman. Not posted by Neil. You can find him replying & posting in person at neil-gaiman on Tumblr, or @neilhimself.neilgaiman.com on BlueskyCory Doctorow NONCONS.. @doctorow
498K Followers 3K Following Author/activist/journalist. New novel: THE BEZZLE, a thriller of hi-tech fraud and the Shitty Tech Adoption Curve https://t.co/4ZExCQHv6q @[email protected]Jacob Steinhardt @JacobSteinhardt
7K Followers 67 Following Assistant Professor of Statistics, UC BerkeleySimran Arora @simran_s_arora
2K Followers 212 Following CS PhD student at @StanfordAILab @hazyresearchMMitchell @mmitchell_ai
80K Followers 1K Following Interdisciplinary researcher focused on shaping AI towards long-term positive goals. ML & Ethics. Same content in the Sky, Threads, & the Prehistoric ElephantAllen Institute @AllenInstitute
64K Followers 3K Following The Allen Institute is committed to solving mysteries of bioscience — researching the unknown of human biology, in the brain, the cell and the immune system.Nomic AI @nomic_ai
14K Followers 50 Following Building explainable and accessible AI https://t.co/bbYqCdL8vQDaniel Uribe @dabluemx
84 Followers 250 Following Fundador obsesivo de BlueCamp - Growth Agency, entusiasta del Machine Learning. Construyendo Morflow.Mark Dredze @mdredze
4K Followers 788 Following John C Malone Professor at @JohnsHopkins @JHUCompSci @jhuclsp @jhumceh; Part time @techatbloomberg (tweets my own) Mastodon @[email protected]andrewmccallum @andrewmccallum
5K Followers 238 Following Professor, Computer Science, Machine Learning, Natural Language Processing, Knowledge Bases, Mining Scientific Literature, https://t.co/UGe9VmkaCdJuan Manuel Zambrano .. @JMZambranoC
131 Followers 148 Following Medical AI PhD candidate @StanfordDBDS. Ex @MediUniandes & @UniandesIBIO. Tweets in English y en español 🇨🇴Michael Moor @Michael_D_Moor
2K Followers 779 Following MD. PhD. Postdoc @Stanford CS w/ @jure. Working towards generalist medical AI, one gradient step at a time.Mononito Goswami @MononitoGoswami
337 Followers 421 Following Robotics Ph.D. Student @CarnegieMellon | Applied Scientist Intern @AmazonScience | Machine Learning | Foundation Models | Healthcare Co-organizing @clinicalfmsAkash Chaurasia @_akashrc
94 Followers 135 Following CS @Stanford, building clinical foundation models @StanfordMed. prev ML @Tesla Autopilot, biomed. eng/CS @JohnsHopkinsMonica M Reddy @monicamreddy
270 Followers 469 Following PhD student at @KhouryCollege. Masters in CS from @umasscs. Working in Machine Learning for Healthcare.Michael Oberst @MichaelOberst
2K Followers 946 Following Incoming Assistant Professor of Computer Science at @JohnsHopkins, postdoc at @CarnegieMellon. PhD from @MIT_CSAIL. Reliable ML & Causality for Healthcare.Parisa Rashidi @Parisa__Rashidi
2K Followers 2K Following Co-director of @UF_IC3 Center and director of @UFiHealLab. Medical AI. #AI4Health #ML4health. Opinions my own. She/her.Pascale Fung @pascalefung
2K Followers 45 Following Chair Professor of ECE, Director of the Centre for AI Research (CAiRE), Hong Kong University of Science & Technology. Fellow of AAAI, ACL, IEEE, ISCA.Mihaela van der Schaa.. @MihaelaVDS
8K Followers 420 Following John Humphrey Plummer Professor of Machine Learning, Artificial Intelligence and Medicine @Cambridge_Uni, Fellow @TuringInstJudea Pearl @yudapearl
76K Followers 188 Following Student of causal inference, human reasoning, and history of ideas, all viewed through the sharp lens of artificial intelligence.@timnitGebru@dair-com.. @timnitGebru
169K Followers 3K Following she/her I am at @[email protected] via the #TwitterMigration. DAIR's Mastodon account is at [email protected]Bart de Witte @OpenMedFuture
7K Followers 2K Following #opensource #medical #AI #digitalhealth #openknowledge #datacommons #regenerative father ex-IBM ex-SAP @hippoai https://t.co/iLvrA3hySk medAI /accHussein Mozannar @HsseinMzannar
846 Followers 920 Following PhD @mitidss working on Human-AI Interaction 🇱🇧DBMI at Harvard Med @HarvardDBMI
7K Followers 682 Following Department of Biomedical Informatics at @Harvard/@harvardmed: #datascience- & #AI/#ML-powered models of clinical care to advance human health. Chair @zakkohane.Tanishq Mathew Abraha.. @iScienceLuvr
55K Followers 1K Following PhD at 19 | Founder and CEO at @MedARC_AI | Research Director at @StabilityAI | @kaggle Notebooks GM | Biomed. engineer @ 14 | TEDx talk➡https://t.co/xPxwKTq6QbML4H @SymposiumML4H
2K Followers 59 Following Machine Learning for Health (ML4H) • New Orleans 2023• #ml4h2023 • Contact: [email protected]NEJM AI @NEJM_AI
11K Followers 313 Following NEJM AI is a new journal on medical artificial intelligence and machine learning from NEJM Group, the publisher of @NEJM.Jason H. Moore, PhD @moorejh
36K Followers 19K Following Chair, Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles. Informatician. Data scientist. AI. Atari enthusiast. FSU & UMich grad.Qiao Jin, MD @DrQiaoJin
1K Followers 868 Following Postdoc @NCBI @NLM_NIH. Tsinghua MD. JMIR AE. Democratizing medical knowledge. AgentMD, MedRAG, TrialGPT, GeneGPT, MedCPT, PMC-Patients, PubMedQA. Views my own.Alejandro Lozano @Ale9806_
39 Followers 52 Following Biomedical Data Science Ph.D. Student @ SAIL: Stanford UniversityJulian Genkins @julian_genkins
463 Followers 383 Following PCP + Informatician @VUMChealth | From @VUmedicine to @UCSFDOM to @StanfordCIF | Cognitive informatics, PKM, telemedicine, #meded | Find me outside 🏃🏻♂️🌲Ahmed Alaa @_ahmedmalaa
1K Followers 1K Following Assistant Professor @UCBerkeley + @UCSF || Ex-@broadinstitute of MIT/Harvard, @MIT, @UCLA, @Cambridge_Uni, @UniofOxford || Machine Learning & AI for Medicine.nikeshk @nikeshk
162 Followers 261 FollowingBo Wang @BoWang87
8K Followers 2K Following Assistant Prof. CS,LMP @UofT; CIFAR AI Chair @VectorInst; Chief AI Scientist, @UHN; former PHD, CS @Stanford; opinions my own. #AI #healthcare #combioCome by #ICLR2024 Session 2 on Tuesday to see our work using representation editing to make foundation models robust! No fine-tuning, no additional data, no problem. arxiv.org/pdf/2309.04344
Data contamination is a huge problem for LLM evals right now. At Scale, we created a new test set for GSM8k *from scratch* to measure overfitting and found evidence that some models (most notably Mistral and Phi) do substantially worse on this new test set compared to GSM8k.
@jxmnop reminds me of this one: nonint.com/2023/06/10/the…
So happy to defend my PhD thesis and couldn't have done it without a 1 of 1 advisor @david_sontag and an incredible committee @erichorvitz @arvindsatya1 @roboticwrestler
Congratulations to Dr. Hussein Mozannar. @HsseinMzannar Strong dissertation research exploring multiple dimensions of human-AI collaboration. @david_sontag @roboticwrestler @arvindsatya1 @MIT
There is a really nice community of researchers developing transformer alternatives. Want to highlight these impressive folks. Simran Arora (@simran_s_arora), Chunting Zhou (@violet_zct), Dan Fu (@realDanFu), and Songlin Yang (@SonglinYang4)
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
Ask ChatGPT to pick a number between 1 and 100 - which does it pick? (by @Leniolabs_)
I will be defending my PhD thesis, Training Human-AI Teams, on April 25th at MIT and on Zoom! Please DM me or email for a link! -- (image generated after an hour of prompting DALL·E)
A Mamba Primer (w/ Yair Schiff youtube.com/watch?v=dVH1dR… ) Mamba is a nice jumping off point to summarize foundational ideas in sequence modeling, parallel algorithms, continuous-time representations, and GPU aware algorithms. We try to put these together in the context of LMs.
I highly recommend this tutorial on Mamba and related models. Full of insights on model design and hardware-aware implementation!
A Mamba Primer (w/ Yair Schiff youtube.com/watch?v=dVH1dR… ) Mamba is a nice jumping off point to summarize foundational ideas in sequence modeling, parallel algorithms, continuous-time representations, and GPU aware algorithms. We try to put these together in the context of LMs.
Today is my last day @huggingface 🤗 It's been an incredible experience supporting open biomedical AI, working with seriously brilliant and kind colleagues, and collaborating with truly amazing scientists, engineers, and clinicians across the world. I've learned and grown so…
Wow this is a big deal, first large-scale Mamba-based model! Mamba layers brings much longer context and higher inference throughput. Having 4 attention layers seem to be the sweet spot to get the best of Transformer & Mamba architectures.
Introducing Jamba, our groundbreaking SSM-Transformer open model! As the first production-grade model based on Mamba architecture, Jamba achieves an unprecedented 3X throughput and fits 140K context on a single GPU. 🥂Meet Jamba ai21.com/jamba 🔨Build on @huggingface
Thrilled to share that my lab is looking for postdocs! In partnership with @ChanZuckerberg, we're focusing on developing massive biomedical foundation models to create an AI-powered virtual cell. Dream of harnessing the power of 1,000 H100s? Apply now at: snap.stanford.edu/apply/index-po…
📢New research on mechanistic architecture design and scaling laws. - We perform the largest scaling laws analysis (500+ models, up to 7B) of beyond Transformer architectures to date - For the first time, we show that architecture performance on a set of isolated token…
Recommend following David Hall (@dlwh) and the Levanter project from @StanfordCRFM . Just no nonsense details about fixing the pain-points of scaling LLM training, one at a time.
I successfully defended my PhD today (exciting!). I've struggled today balancing feelings of accomplishment+gratitude and anti-climatic disappointment. It's been more than worth it to hear my youngest daughter toddle around saying "Doc-tor Dad!" 🥰
I like to talk about Levanter’s performance, reproducibility, and scalability, but it’s also portable! So portable you can even switch from TPU to GPU in the middle of a run, and then switch back again! github.com/stanford-crfm/…
Needle In A Haystack tests are flawed. Did you know that the Long-Context Attention in Gemini and GPT-4 is based on inserting the sentence “The best thing to do in San Francisco is eat a sandwich and sit in Dolores Park on a sunny day” at a random location in a text? We’ve seen…
Our recent experiments @NormalComputing demonstrate that all attention is not equal! XL-attention (large context windows) struggles with retrieval tasks well within context. Top-k attention (used in our Extended Mind approach) is cheap and effective.
couldn’t agree more about the need for more computational biologists. More importantly, we must ensure they receive the recognition and support they deserve. Their work should never be dismissed as merely “just service”. Additionally, I’d like to remind everyone of the excellent…
WE NEED COMPUTATIONAL PEOPLE IN BIOLOGY. WE NEED THEM IN DRUG DISCOVERY AND MULTI OMICS. WE NEED HARDWARE PEOPLE TO BUILD SPECIFIC AND PORTABLE DEVICES. BUT MOST IMPORTANTLY WE NEED DEVS. WE NEED TO MAKE OPEN ACCESS DATA AS ACCESSIBLE AS POSSIBLE. WE NEED COMPUTE GRANTS. GIVE