Sangjun Park @cosmoquester

Artificial Intelligence Researcher cosmoquester.github.io Republic of Korea Joined July 2022

Tweets

24
Followers

16
Following

85
Likes

38

Sangjun Park @cosmoquester

8 months ago

For the year-end and New Year holidays, I've summarized a little bit of what I've been thinking about. You can watch it for fun, empathy and objection are all welcome! Reflection on the Conceptual Essence of Language Models, the Path to AGI cosmoquester.github.io/reflection-on-…

0 0 0 50 0

Kevin Patrick Murphy @sirbayes

9 months ago

I am happy to announce that the first draft of my RL tutorial is now available. arxiv.org/abs/2412.05265

74 751 4K 317K 4K

Download Image

Andrej Karpathy @karpathy

a year ago

# RLHF is just barely RL Reinforcement Learning from Human Feedback (RLHF) is the third (and last) major stage of training an LLM, after pretraining and supervised finetuning (SFT). My rant on RLHF is that it is just barely RL, in a way that I think is not too widely…

407 1K 9K 1.2M 6K

Download Image

JinYeong Bak @NoSyu

a year ago

A first in-person poster spotlight poster presentation at ICML 2024 by @cosmoquester

0 1 10 465 0

Download Image

Aran Nayebi @aran_nayebi

2 years ago

1/ How do humans and animals form models of their world? We find that Foundation Models for Embodied AI may provide a framework towards understanding our own “mental simulations”. 🧵👇 arxiv.org/abs/2305.11772 with awesome collaborators: @rishi_raj @mjaz_jazlab @GuangyuRobert

5 76 312 108K 178

Download Image

Stephanie Chan @scychan_brains

a year ago

Our paper comparing human and LM reasoning -- now published (open source)!

Andrew Lampinen @AndrewLampinen

a year ago

Our paper comparing human and LM reasoning -- now published (open source)!

1 13 72 12K 20

0 8 32 5K 8

Yann LeCun @ylecun

a year ago

Not only can't LLMs plan, they can't even generate specifications of a problem (in PDDL) that a standard planner could solve.

Max Zuo @max_zuo

a year ago

Not only can't LLMs plan, they can't even generate specifications of a problem (in PDDL) that a standard planner could solve.

9 41 195 189K 194

Download Image

58 107 682 182K 378

Yann LeCun @ylecun

a year ago

@alex_peys The problem isn't that it is a transformer. The problem is that it is an auto-regressive LLM. Auto-regressive LLMs that compute each token with a fixed number of computational steps can't reason, regardless of the details of the architecture.

100 137 2K 135K 458

JinYeong Bak @NoSyu

a year ago

Our paper has been accepted as a Spotlight Poster! Congratulations @cosmoquester Please visit our poster session and leave your comments and questions. icml.cc/virtual/2024/p… #ICML

JinYeong Bak @NoSyu

a year ago

Our paper has been accepted as a Spotlight Poster! Congratulations @cosmoquester Please visit our poster session and leave your comments and questions. icml.cc/virtual/2024/p… #ICML https://t.co/YEwaCSQ9zg

0 7 45 5K 16

1 2 33 2K 1

Download Image

Yann LeCun @ylecun

a year ago

@jeffclune openreview.net/forum?id=BZ5a1…

9 21 264 15K 288

Yann LeCun @ylecun

a year ago

Nice article in Financial Time where I explain that Auto-Regressive LLM are insufficient to reach human-level intelligence (or even cat-level intelligence). But alternative architectures that I call "objective driven" may reach human-level intelligence one day. They use world…

Financial Times @FT

a year ago

12 60 224 549K 99

Download Image

148 450 3K 645K 1K

Yann LeCun @ylecun

a year ago

If you are a student interested in building the next generation of AI systems, don't work on LLMs

Viva Technology @VivaTech

a year ago

If you are a student interested in building the next generation of AI systems, don't work on LLMs

45 252 1K 1.2M 444

Download Image

351 1K 7K 1.7M 2K

Sangjun Park @cosmoquester

a year ago

I am very pleased to announce my first paper "Memoria: Resolving Fateful Forgetting Problem through Human-Inspired Memory Architecture" at #ICML2024! Memoria processes long sequence information inspired by human memory systems. More Details: arxiv.org/abs/2310.03052

0 2 12 1K 2

Download Image

JinYeong Bak @NoSyu

a year ago

Excited to share our paper "Memoria: Resolving Fateful Forgetting Problem through Human-Inspired Memory Architecture" presented at ICML 2024! Congratulations to Sangjun Park! Preprint: arxiv.org/abs/2310.03052 #ICML2024

0 7 45 5K 16

Christopher Manning @chrmanning

a year ago

I do not believe human-level AI (artificial superintelligence, or the commonest sense of #AGI) is close at hand. AI has made breakthroughs, but the claim of AGI by 2030 is as laughable as claims of AGI by 1980 are in retrospect. Look how similar the rhetoric was in @LIFE in 1970!

113 360 2K 386K 544

Download Image

Kyunghyun Cho @kchonyc

2 years ago

a random thought on RAG, inspired by the (successful) phd defense of @mrdrozdov (the committee consists of @andrewmccallum, @MohitIyyer, @JonathanBerant, @HamedZamani and me) kyunghyuncho.me/a-random-thoug…

16 18 145 28K 66

Download Image

Ida Momennejad @criticalneuro

2 years ago

📢Are you (or do you know) a PhD student with experience in transformers, LLMs, & cognitive neuroscience (esp PFC, hippocampus)? Still interviewing interns for a project following up on our earlier work: arxiv.org/abs/2310.00194 nips.cc/virtual/2023/p… job: jobs.careers.microsoft.com/global/en/job/…