Siddharth Goyal @siddharth22dev

Code @RubrikInc, prev @Innovaccer @owasp linkedin.com/in/siddharth22/ Jaipur-Bangalore Joined March 2022

Tweets

270
Followers

46
Following

348
Likes

414

Kiran Mazumdar-Shaw @kiranshaw

a week ago

Namma Bengaluru has the best talent and the best weather but the worst infrastructure - if we fix garbage debris and roads, we can be among the best cities in the world. GBA has a great opportunity to do this. Let’s use collective will to do this @DKShivakumar @BBMPCOMM

284 421 2K 243K 50

Pedro Domingos @pmddomingos

a week ago

Einstein wasted the second half of his life on a fruitless quest. In the second half of his life, von Neumann invented game theory, computer architecture, implosion nuclear weapons, cellular automata and weather prediction, among other things.

257 360 7K 963K 2K

Ben Dicken @BenjDicken

3 weeks ago

Daniel clearly hasn't seen the language balls.

Daniel Lockyer @DanielLockyer

3 weeks ago

Daniel clearly hasn't seen the language balls. https://t.co/eMqrNI7Khg

204 23 660 6.2M 154

784 2K 24K 8.6M 9K

Download Video

Hyung Won Chung @hwchung27

2 months ago

This is my lecture from 2 months ago at @Cornell “How do I increase my output?” One natural answer is "I will just work a few more hours." Working longer can help, but eventually you hit a physical limit. A better question is, “How do I increase my output without increasing…

44 779 6K 459K 8K

Download Video

Sahil Bloom @SahilBloom

2 months ago

Dopamine from information gathering is a dangerous drug. Your entire life will change the moment you stop looking for more information and start acting on the information you already have. Always get your dopamine from action.

263 2K 14K 642K 5K

Quanquan Gu @QuanquanGu

2 months ago

This explains why LLaMA 4 failed. The tokens per parameter (TPP) is way off. You can’t defy scaling laws and expect miracles. === Llama 4 Maverick was 400B(17B active) and >30T tokens, TPP = 1764 Llama 4 Behemoth was 2T(288B active) and > 30T tokens, TPP = 104 DeepSeek v3 is…

wh @nrehiew_

2 months ago

11 14 226 106K 80

28 66 541 93K 341

Greg Kamradt @GregKamradt

2 months ago

We got a call from @xai 24 hours ago “We want to test Grok 4 on ARC-AGI” We heard the rumors. We knew it would be good. We didn’t know it would become the #1 public model on ARC-AGI Here’s the testing story and what the results mean: Yesterday, we chatted with Jimmy from the…

ARC Prize @arcprize

2 months ago

245 743 5K 7.3M 728

Download Image

306 862 7K 14.7M 1K

Aaron Gokaslan @SkyLi0n

2 months ago

Just opened a PR yesterday that will reduce the binary size PyTorch by 40% by adding 1 flag to NVCC With ~50M monthly of downloads of Pytorch, this one change will reduce global internet traffic by ~20PB. High impact changes like this is why I love OSS. github.com/pytorch/pytorc…

30 105 2K 115K 253

Yuchen Jin @Yuchenj_UW

2 months ago

The best open-source reasoning model will be dropped next Thursday if everything goes well. OpenAI hasn't open-sourced an LLM since GPT-2 in 2019, so I'm excited. We’re hosting it on Hyperbolic. Buckle up.

54 104 2K 173K 294

Download Image

Summer Yue @summeryue0

3 months ago

🔍 SEAL and Red Team at @scale_AI present a position paper outlining what we’ve learned from red teaming LLMs so far—what matters, what’s missing, and how model safety fits into broader system safety and monitoring. 🔗 scale.com/research/red_t… 📝 scale.com/blog/rethink-r…

Zifan (Sail) Wang @_zifan_wang

3 months ago

4 22 82 77K 56

Download Image

3 22 109 66K 70

Jack Morris @jxmnop

2 months ago

there’s a palpable tension in the air as hundreds of AI researchers (including me!) quietly work nights and weekends trying to figure out the “right way” to scale RL math & code are not the universe we will not rest until post-training is as clean and elegant as pre-training

36 35 840 64K 211

Rohan Pandey @khoomeik

2 months ago

my favorite version of the finetuning argument is Smolin’s theory that universes “reproduce” via black holes, and the conditions that are optimal for black hole production also happen to be near optimal for creating life unclear whether true but it’s a fun idea

Ross @rpoo

2 months ago

127 373 3K 374K 877

Download Image

5 8 82 10K 31

Download Image

Naval @naval

2 months ago

@rpoo Good treatment of the Anthropic Principle / fine tuning here: bretthall.org/an-anthropic-u… @ToKTeacher

5 9 143 26K 86

Ross @rpoo

2 months ago

our universe is pretty rare in configuration space - if strong nuclear force were: * 1% weaker - stars wouldnt make much carbon preventing carbon based life & less heavy element production would delay planet formation & take longer for evolution to occur before stars die * 1%…

127 373 3K 374K 877

Download Image

Kirubakaran Rajendran @kirubaakaran

2 months ago

What if I told you that Jane Street made ₹36,500 crores from Indian markets in just 2 years, and ₹4,800 crores of that was allegedly through market manipulation? They turned India's stock market into their personal ATM using a strategy so clever. Here's the complete details 🧵

318 2K 12K 1.6M 7K

Ziteng Sun @SZiteng

7 months ago

Inference-time procedures (e.g. Best-of-N, CoT) have been instrumental to recent development of LLMs. The standard RLHF framework focuses only on improving the trained model. This creates a train/inference mismatch. Can we align our model to better suit a given inference-time…

5 52 257 66K 240

Download Image

Michael Luo @AzianMike

3 months ago

I got a cease and desist from DocuSign for my free SaaS. A couple of months ago, I saw a tweet from @awilkinson: “I just found out how much we pay for DocuSign and my jaw dropped. What's the best alternative?” Me being naive, I thought “how hard could would it actually be to…

625 1K 19K 1.8M 7K

Download Image

Reso ☕️ @Resorcinolworks

3 months ago

Codeforces may ban Indians after what happened today.

104 92 2K 201K 447

Download Image

Nicholas Tomlin @NickATomlin

4 months ago

The long-term goal of AI is to build models that can handle arbitrary tasks, not just ones they’ve been trained on. We hope our new *benchmark generator* can help measure progress toward this vision

Vivek Verma @vcubingx

4 months ago

3 25 149 37K 93

Download Image

4 30 182 25K 124

Download Image

Uday Kotak @udaykotak

4 months ago

India 10 year bond yield at 6.20% pa. US 4.60%. Gap of 1.60% is probably lowest I recollect. Will we 1 day see Indian yields lower than the US? Depends mainly on relative inflation, risk premium, trust, and liquidity, for global and domestic investors in these 2 countries!