machinelearning.sg @ml_dot_sg
A machine learning community in Singapore. mastodon: @[email protected] machinelearning.sg Singapore Joined December 2020-
Tweets829
-
Followers236
-
Following761
-
Likes4K
Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion Develops a cascading latent diffusion approach that can generate multiple minutes of high-quality stereo music at 48kHz from textual descriptions. abs: arxiv.org/abs/2301.11757 repo: github.com/archinetai/aud…
Excited to announce our Deep Learning Tuning Playbook, a writeup of tips & tricks we employ when designing DL experiments. We use these techniques to deploy numerous large-scale model improvements and hope formalizing them helps the community do the same! github.com/google-researc…
Yay! nbviewer from @ProjectJupyter now lets you render any notebook stored on the HuggingFace Hub (works with any repo type) nbviewer.org Thanks @SylvainCorlay and team 💗
Learn to use a Document Image Transformer(DiT) to classify the category of the document with just a picture of it. Automatically differentiate presentations from scientific papers from forms with high accuracy. #DocumentAI #PracticalML news.machinelearning.sg/posts/document…
Upscale faces in photos to 4 or 8 times their original size with the Real ESRGAN model. In @GoogleColab with the github.com/ai-forever/Rea… model on @huggingface. news.machinelearning.sg/posts/face_sup… news.machinelearning.sg/posts/face_sup…
Automatically generate subtitles for videos with @OpenAI Whisper and @GoogleColab. news.machinelearning.sg/posts/video_su… news.machinelearning.sg/posts/video_su…
SPACEx: Speech-driven Portrait Animation with Controllable Expression abs: arxiv.org/abs/2211.09809 project page: deepimagination.cc/SPACEx/
Object detection through GLIP seems impressive 👀 hf.co/spaces/haotiz/…
"Lottery Pools: Winning More by Interpolating Tickets without Increasing Training or Inference Cost" So let’s say you’re pruning a neural net and want the best model you can get at the end. More precisely,… [1/9]
Who here is using Netron, the awesome model Visualizer from @lutzroeder? You can now use it to visualize any checkpoint file on the @huggingface Hub, by appending the file's URL to netron.app like so: netron.app/?url=https://h… Let me know if this is useful! 🔥
Do people use HF dark mode? we are probably going to switch to the system setting as default (instead of forcing a light theme)
We are super excited to launch @photoroom_app AI background generator. This way creators, sellers and entrepreneurs can have unique pro background to showcase their product. Try it here producthunt.com/posts/ai-backg… 🙏 stable diffusion & @huggingface
A @Gradio Demo for GLM-130B: An Open Bilingual Pre-Trained Model on @huggingface Spaces by huggingface.co/hanyullai demo: huggingface.co/spaces/hanyull… Get started with Gradio: gradio.app/getting_starte…
We have recently tested the excellent TorchDynamo prototype from @PyTorch team and benchmarked it vs @onnxruntime and TensorRT. TL;DR: big boost in inference perf + ease of use without major drawback. 👏 @jansel0 & team!
We have recently tested the excellent TorchDynamo prototype from @PyTorch team and benchmarked it vs @onnxruntime and TensorRT. TL;DR: big boost in inference perf + ease of use without major drawback. 👏 @jansel0 & team!
Working on a walkthrough of transformer code with side-by-side comparison to the computation graph. Felt the default image everyone uses is hard to interpret. Also, I like to show the matrices in my computation graphs. § refers to sections in a colab notebook (coming soon)
You've always wanted to use our platform but couldn't because of enterprise constraints? The wait is over with our new Private Hub, running completely in your own private & compliant environment, with all the security requirements for the enterprise. 👇 huggingface.co/blog/introduci…
We have now released the rendered English Wikipedia and BookCorpus datasets used to train our PIXEL model under huggingface.co/Team-PIXEL and they can be loaded via the datasets library. Thanks to @huggingface for hosting our >100GB of image data completely for free!
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2seq Model In zero-shot setting, AlexaTM 20B outperforms GPT3 (175B) on SuperGLUE and SQuADv2 datasets and provides SOTA on multilingual tasks such as XNLI, XCOPA, Paws-X, and XWinograd. arxiv.org/abs/2208.01448
Contrary to popular belief, many of the most capable AI organizations training large language models are already bottlenecked by Dataset Size, not just compute. "Chinchilla's Wild Implications" lesswrong.com/posts/6Fpvch8R…
Mathieu Ravaut @MatRavox
371 Followers 2K Following PhD candidate in NLP at @ntunlpsg w @JotyShafiq and @astarhq. Ex @layer6ai | @uoftcompsci | @centralesupelecRoy Lee @SRoyLee
1K Followers 1K Following Assistant Professor @sutdsg, working on online trust & safety, computational social science, and social NLP. Currently leading the Social AI Studio.CoralTracy @6tmW553c9Jlm3
1 Followers 127 FollowingMISS M Ξ�.. @missmetaverse
6K Followers 4K Following futurist and cyberpunk nerd #ai #llms #ml #vr #ar #mx #machinelearning #artificialintelligence #neuralnet inquiries: [email protected]NiBC you DLROM love W.. @omapzwve_ctsdpl
109 Followers 2K Following cafbc rrcb2ni1fapdl tsmcdpl https://t.co/upU1taWtucPhronesis Analytics @PhronesisZA
176 Followers 462 Following We're an African startup that uses AI to help society understand the government better.Compare the Cloud @comparethecloud
34K Followers 30K Following #Cloud Computing #Bigdata #iot #ai. Enquiries at [email protected]Applied AI @AppliedAIconf
801 Followers 1K Following The machine learning and artificial intelligence conference for developers, data scientists and analysts.Robert Scoble @Scobleizer
504K Followers 68K Following Follow me on my new podcast with AI startups, Unaligned. Tech industry color commentator since 1993. Author/Blogger. Former strategist @Microsoft.AI Online Course @aionlinecourse
1K Followers 3K Following Welcome to https://t.co/SgcrnmhQf1, an online platform that provides comprehensive and high-quality AI courses for learners of all levels.Annisa Computer Solok @AnnisaSolok
2 Followers 22 Followingike 🇦🇺 🇺🇸 @ikeisdumb
3K Followers 2K Following https://t.co/q4CxOfmN0h co-creator and art fella 🎨, student of science (physics, space, robotics, ai) ⚗️, can crack an egg with 1 hand (sometimes) 🥚MoogleLabs @mooglelabs
644 Followers 2K Following MoogleLabs Produce quality outputs by leveraging #AI #ML, #Blockchain, #DevOps, and #Metaverse. Gain intelligent experience with our delivery pipeline.Joshua Ampofo @dev_ampofo
1K Followers 2K Following Molecular Biologist and Biotechnologist | Genomics, and Bioinformatics | AI/ML EngineerOyeniyi Noah @yomtexontop
74 Followers 356 Following I am an emerging financial technologist with strong passion for technological innovations. #AI, #Data_science, #ML, #Openbanking, #OpenfinanceLiau Jian Jie @liaujianjie
687 Followers 441 Following CTO & co-founder @mobbindesign. @nuscomputing alum. Join me to shape the future of design 👉🏻 https://t.co/Hjb22L1uBhLin Shao @linshaonju
2K Followers 3K Following Assistant Professor in Robotics @NUS | Ph.D. @Stanford | Opinions are my ownNageshwar M @MitkarNk
10 Followers 101 Following curious student. - Python -Learning AI, ML & open source.Crescendo.ai @CrescendoAi
53 Followers 321 Following Data Science and AI R&D Firm Helping public and private companies, make smarter decisions, with AI.🤖Damien Benveniste @DamiBenveniste
2K Followers 460 Following The ML Guy - Follow me to learn about Machine Learning applications, Machine Learning System Design, MLOps, and the latest techniques and news about the field.Learning in Public - .. @motherofdata
159 Followers 3K Following #notetaking account. Retweets educational threads on Python, ML & Data analysis; post videos of my learnings & revisions. #learninginpublic #publicnotesGurpreet Kaur @GurupreetJethra
83 Followers 875 Following Gen AI Data Scientist👩🏻🎓| Former AA: IIM-A | Former Analyst: Central Govt. Of India | AI Researcher| Machine Learning| LLM| NLP| DL| CV| Data Science| MLOpsRAGHU KNOWS DATA😇 @kpdoo7
16 Followers 323 Following Exploring the realms of data science through daily discoveries and insights. Join me on the journey of continuous learning and exploration! 😇Ashish Patel @imashish2604
109 Followers 292 Following AI Researcher Scientist & Chief Data Scientist at IBM | Author of Hands-on Time Series Analytics with Python | Keras Contributor | IBM QuantumVirtana @VirtanaCorp
10K Followers 11K Following Achieve faster mean-time-to-resolution (#MTTR), proactive #CloudCostManagement, and accurate #CapacityPlanning and forecasting. #AccelerateHybridInnovationTekne | We empower yo.. @teknedatalabs
19 Followers 249 Following 🚀 We empower your data 👩🏻🚀 Big Data Engineering, Analytics & Data Visualization, IA & Machine Learning #WeAreTekne 🌎 Working worldwide in #IT & #BIScience & Tech News @TamilTechNews
78K Followers 4K Following Latest Science and Technology News, run by the author of "Emerging Technologies for Profit" https://t.co/3jl6BKNU85 and "Dream Big" https://t.co/3tkN26mVt6hoffer7 @hoffer7
713 Followers 1K Following Communicating about innovation, technology and user experience through stories and pictures, when I'm not in my garden bending nature to my will.Jennifer Vernon @JenVernonBM
943 Followers 5K Following I'm a #BrandMarketing specialist. I like to post videos about #CorporateBranding, #BrandingTips and #OnlineBranding.Tehreem Ali @tehreemali894
201 Followers 421 Following Looking for a PhD position || #Bioinformatics || #Researcher || #SystemsBiology || Plant-pathogen interactions || #Machinelearning || #DataScience ||#FreelancerSarah H Kasina @SarahKasina
127 Followers 335 Following Like to explore and know more about AI and ML #ArtificialIntelligence #MachineLearning #BigData #DataVisualization #DataAnalysisPatricio Denzer @pdenzer
42 Followers 417 FollowingNicole Barton @NicoleBartonES
507 Followers 4K Following I post mainly about #Entrepreneurship #EntrepreneurshipCourse and#EntrepreneurTips.AlgoveraAI @AlgoveraAI
3K Followers 1K Following Research Organization and Agency for Decentralized AIDivyashree S @dssingh2506
12 Followers 271 FollowingJiafei Duan @DJiafei
1K Followers 762 Following Robotics and AI PhD Student @uwcse @uw_robotics | AI Research @ASTARsg | BEng from @ntueee. Research in robot learning and embodied AIJovena Seah @jovenaseah
35 Followers 49 Following Data Analytics | Business Intelligence | Data EngineeringNur-amin @Nur_amin02
40 Followers 341 Following I want to make the world a better place before I die !Yann LeCun @ylecun
711K Followers 718 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Kayo Yin @kayo_yin
8K Followers 556 Following PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵Bill Yuchen Lin 🤖 @billyuchenlin
6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_uscKirk Borne @KirkDBorne
447K Followers 6K Following Advisor to startups. Freelancer. Global Speaker. Founder @LeadershipData. Top influencer in #BigData #DataScience #AI #IoT #ML #B2B. PhD Astrophysics @CaltechPengfei Liu @stefan_fee
2K Followers 616 Following Associate Prof. at SJTU, leading GAIR Lab (https://t.co/Nfd8KmZx3B) Co-founder of Inspired Cognition, Postdoc at @LTIatCMU, Previously FNLP, @MILAMontreal,Yoav Artzi @yoavartzi
13K Followers 163 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / https://t.co/3VmRSyYm2d / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaCSean (Xiang) Ren @xiangrenNLP
6K Followers 562 Following Building @SaharaLabsAI | @USCViterbi Early Career Chair, Professor @nlp_usc | @MIT TR 35 , @ForbesUnder30 | Prev: @allen_ai, @Snapchat, @Stanford, @UofIllinoisGoogle DeepMind @GoogleDeepMind
944K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.William Wang @WilliamWangNLP
14K Followers 718 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.Andrej Karpathy @karpathy
979K Followers 904 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Sebastian Raschka @rasbt
267K Followers 906 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.Shruti Rijhwani @shrutirij
4K Followers 499 Following * Research Scientist @GoogleDeepMind * #NLProc research * PhD from @LTIatCMU * Amateur woodworker, scuba diver, foosball playerAK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxYu Su @ysu_nlp
6K Followers 857 Following Dist. Assist. Prof.@OhioState, Director @osunlp, 20% Researcher@Microsoft. I like to think about intelligence, artificial or biologicalRoy Lee @SRoyLee
1K Followers 1K Following Assistant Professor @sutdsg, working on online trust & safety, computational social science, and social NLP. Currently leading the Social AI Studio.Graham Neubig @gneubig
31K Followers 586 Following Associate professor at CMU, studying natural language processing and machine learning.Steven Hoi @stevenhoi
3K Followers 411 Following Founder & CEO @HyperGAI. Ex-MD of Salesforce Research Asia, Ex-VP @Salesforce AI; Researcher in #AI; Professor at SMU & Ex-AP at NTU; IEEE Fellow.AI at Meta @AIatMeta
532K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.Carrd @carrd
39K Followers 1 Following Simple, free, fully responsive one-page sites for pretty much anything.PaddlePaddle @PaddlePaddle
8K Followers 128 Following PaddlePaddle is an Open-Source Deep Learning Platform Originated from Industrial Practice.Liau Jian Jie @liaujianjie
687 Followers 441 Following CTO & co-founder @mobbindesign. @nuscomputing alum. Join me to shape the future of design 👉🏻 https://t.co/Hjb22L1uBhAIGuys @RealAIGuys
100 Followers 95 Following Author: Ultimate Neural Network Programming with Python. Editor https://t.co/7V9dyo21AR. Educator and AI blogger.Exa (prev. Metaphor) @ExaAILabs
9K Followers 9 Following supercharge your LLM with the web's knowledge API → https://t.co/M5QuIA55d2 search engine → https://t.co/iqim6Mz5S3 discord → https://t.co/tzBhQZ0Jfc We're hiring | DM us!Emm @emmanuel_2m
32K Followers 6K Following Co-founder & CEO at https://t.co/7ElrGjg10n 🚀 | Craft unique and style-consistent game assets with custom-trained AI models 👾 | #GenAI #Gaming @scenario_ggColin Raffel @colinraffel
30K Followers 654 Following nonbayesian parameterics, sweet lessons, and random birds. Friend of @srush_nlpKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Omar Sanseviero @osanseviero
31K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Sergey Karayev @sergeykarayev
11K Followers 3K FollowingSurge AI @HelloSurgeAI
4K Followers 146 Following Love language? So do we. Surge AI is the world's most powerful data labeling and RLHF platform, designed from the ground up for stunning AI.pharmapsychotic @pharmapsychotic
18K Followers 7K Following Ai generative artist. code @StabilityAI fan of tacos and cats. #aiart #generativeart Ai tools: https://t.co/uyr9NjvLBafly51fly @fly51fly
5K Followers 2K Following BUPT prof | Sharing latest AI papers & insights | Join me in embracing the AI revolution! #MachineLearning #AI #InnovationMaurice Weiler @maurice_weiler
3K Followers 989 Following AI researcher with a focus on geometric DL and equivariant CNNs. PhD with Max Welling. Master's degree in physics.Open Data Science @_odsc
112K Followers 26K Following Bringing together the global data science community to help foster the exchange of innovative ideas and encourage the growth of open source software.Jan-Willem van de Mee.. @jwvdm
2K Followers 1K Following Associate Professor (UHD) at the University of Amsterdam; Probabilistic programming and its applications.apolinario (multimoda.. @multimodalart
10K Followers 378 Following ML for Art and Creativity, working @HuggingFace ([email protected])Philipp Hennig @PhilippHennig5
6K Followers 320 Following Professor for the Methods of Machine Learning at the University of Tübingen.Davis Blalock @davisblalock
12K Followers 165 Following Research scientist + first hire @MosaicML. @MIT PhD. I write + retweet technical machine learning content. If you write a thread about your paper, tag me for RTnear @nearcyan
45K Followers 883 Following https://t.co/IdaJwZJCXm partner @ https://t.co/9g1MIgjiqc dms openAnkit Goyal @imankitgoyal
1K Followers 511 Following Research Scientist @Nvidia | Developing Foundation Models for Robotics | Ph.D. @PrincetonCS | Previously: @UMich, @IITKanpurJulian Michael @_julianmichael_
1K Followers 122 Following Researching stuff @NYUDataScience. he/himTransactions on Machi.. @TmlrOrg
5K Followers 3 Following Transactions on Machine Learning Research (TMLR) is a new venue for dissemination of machine learning researchParlAI @parlai_parley
2K Followers 169 Following We're building a unified platform for sharing, training and evaluating dialogue models across many tasks. @FacebookAIMargaret Li @margs_li
792 Followers 120 Following 👩💻 PhD student @UWCSE / @UWNLP & @MetaAI. Formerly RE @FacebookAI Research, @Penn CS | 🏂💃🧋🥯 certified bi-coastal bb ♥️ IAH/PEK/PHL/NYC/SFO/SEAMidjourney @midjourney
338K Followers 0 Following New research lab. Exploring new mediums of thought. Expanding the imaginative powers of the human species. Join our beta: https://t.co/yAUpCWJRziLearnk8s @learnk8s
78K Followers 32 Following Broaden your Kubernetes expertise with a curated feed of news, articles and best practices. Mastodon: [email protected]Dmytro Mishkin 🇺�.. @ducha_aiki
18K Followers 591 Following Marrying classical CV and Deep Learning. I do things, which work, rather than being novel, but not working.Horace He @cHHillee
23K Followers 449 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemaleRoss Wightman @wightmanr
18K Followers 1K Following Computer Vision @ 🤗. Ex head of Software, Firmware Engineering at a Canadian 🦄. Currently building ML, AI systems or investing in startups that do it better.Stas Bekman @StasBekman
7K Followers 268 Following Toolmaker. Software creator, optimizer and harmonizer. Makes things work and fly at @ContextualAI Training LLM/RAG/Generative AI/Machine Learning/ScalabilityIlya Sutskever @ilyasut
370K Followers 2 Following towards a plurality of humanity loving AGIs @openaiJerome Pesenti @an_open_mind
17K Followers 287 Following Founder @Sizzle_AI. Ex @MetaAI, @Benevolent_AI, @IBMWatson, co-founder Vivisimo (acq. by IBM).Kubernetes @kubernetesio
304K Followers 94 Following #Kubernetes: open source production-grade container orchestration management. #CNCF #K8sAdrien Carreira @XciD_
518 Followers 699 Following Machine Learning / Software Engineer @huggingfaceAlex Nichol @unixpickle
8K Followers 388 Following Code, AI, and 3D printing. Opinions are my own, not my computer's...for now. Husband of @thesamnichol. Co-creator of DALL-E 2. Researcher @openai.Runway @runwayml
185K Followers 300 Following An applied AI research company building for the next era of art, entertainment and human creativity. We're hiring: https://t.co/Aj11xyhxOgms. curio (i can fina.. @reachartwork
36K Followers 99 Following Tumblr: https://t.co/9Txy6Ekhnh Made Simple Stable/Looking Glass AWAY founder DM for business inquiries 💞@anne_tifah💞Leo Boytsov @srchvrs
7K Followers 2K Following Sr. Research Scientist @AWS Labs (ph-D @LTIatCMU) working on unnatural language processing, speaking πtorch & C++. Opinions sampled from MY OWN 100T param LM.Jia-Bin Huang @jbhuang0604
51K Followers 285 Following Associate Professor @umdcs; Part-time Research Scientist @Meta. I like pixels.Rivers Have Wings @RiversHaveWings
31K Followers 224 Following AI/generative artist. Writes her own code. Absolute power is a door into dreaming.NVIDIA GTC @NVIDIAGTC
15K Followers 189 Following The Conference for the Era of AI. Need assistance? Explore our FAQ at https://t.co/r9hWthU7xd or DM us.Augmenting LLMs with Databases - combines an LLM with a set of SQL databases, enabling a symbolic memory framework - completes tasks via LLM generating SQL instructions that manipulate the DB autonomously (supports selection, insertion, update, delete) arxiv.org/abs/2306.03901
Recognize Anything: A Strong Image Tagging Model paper page: huggingface.co/papers/2306.03… demo: huggingface.co/spaces/xinyu12… present the Recognize Anything Model (RAM): a strong foundation model for image tagging. RAM can recognize any common category with high accuracy. RAM introduces a…
@srchvrs @huggingface @MosaicML has some BERT models trained with ALiBi and FlashAttention. The *should* generalize to longer context lengths without retraining. huggingface.co/mosaicml/mosai…
Now that FlashAttention is flashing and we can have very long now input windows, is anybody training a replacement for BERT? Or do we first need to wait till FlashAttention is fully integrated into @huggingface ?
I wrote a #Python #geostatistics package, #GeostatsPy, because there was no #opensource, complete, reliable alternative to support my students in the #Python ecosystem (at the time). I wrote just-in-time for lectures in the mornings! I also wrote many well-documented…
Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFs abs: arxiv.org/abs/2304.10532 project page: ethanweber.me/nerfbusters/ github: github.com/ethanweber/ner…
Scaling up instruction tuning corpora with templatization, this seems very similar to the kind of data augmentation we see in computer vision, e.g. skew / crop / rotate etc.
Super interesting! Instruction tuned models appear to produce less toxic content, without explicitly including fine tuning records targeting the reduction of toxic content. Neat!
'Waves' of instruction tuning datasets, the most recent characterized by synthetic, multi-lingual alignment datasets.
Anything-3D: Towards Single-view Anything Reconstruction in the Wild abs: arxiv.org/abs/2304.10261 github: github.com/Anything-of-an…
#3 Gumroad ( @gumroad ) Gumroad is a powerful, but simple, e-commerce platform.
Nearly all recently-proposed large language models (LLMs) are based upon the decoder-only transformer architecture. But, is this always the best architecture to use? It depends… 🧵 [1/8]
@DrJimFan @johnschulman2 On solving hallucination: didn’t DeepQA solve this 10 years ago when it beat world Jeopardy! world-champ Ken Jennings? The key is evidence retrieval and deep evidence scoring. The approach: apply neural analogs to the DeepQA framework, e.g., LLMs replace the synthesis step.
I think John's talk raises more questions than it answers. NLP research is far from done - lots of new important open problems are born from GPT. For example, how to make the model express uncertainty better? 4/
Every week we cover key papers in RLHF LLMs. Last week we covered InstructGPT, and it got a lot of interest. We continue this week with DeepMind’s GopherCite paper. Here’s what you need to know in 5 tweets:
⚡️Introducing WebGPT⚡️ Just this month, Chrome announced WebGPU's release. What does this mean? Near-native GPU speeds, from the web! I took the opportunity to build WebGPT: a package to run GPT models entirely on the browser. Here's why this is a big deal:
Can Large Language Models (#ChatGPT) transform Computational Social Science? Our recent work shows how they might (in partnership w/ experts). We evaluate on 24 #CSS tasks + draw a roadmap 🚗🗺️ to guide #LLM-augmented social science 🚀 Paper: calebziems.com/assets/pdf/pre… 🧵 thread
Jenni will open a text editor for you. Type in the title of your project. As you type, Jenni will give you suggestions about how good your title is — in real time.
Fine tune a 20B Language Model with RLHF using a 24GB consumer GPU? 🤯 It is now possible using TRL + PEFT! Check out the blogpost that explains how we achieve this step by step! Blogpost: huggingface.co/blog/trl-peft