Rohit Malhotra @rohit_malh5

Openhands Maintainer | Ex-CTO @sitewizai | NLP @ CMU | Primarily interested in Agents | Secondary interests in creative design malhotra5.github.io Joined July 2018

Tweets

146
Followers

92
Following

67
Likes

3K

Graham Neubig @gneubig

4 days ago

Which LM is better at agentic coding? We have a bunch of useful academic benchmarks like SWE-Bench, but we don't have a good comparison of agentic coding LMs *in the wild*. To solve this, we released PR Arena: github.com/neulab/pr-arena

Jiseung Hong @jiseungh99

4 days ago

4 12 76 28K 20

Download Video

7 20 122 15K 58

Jiseung Hong @jiseungh99

4 days ago

Introducing ⚔️PR Arena⚔️ - free AI coding agents to fix real GitHub issues. Claude Sonnet 4 vs Gemini 2.5 Pro… Who writes better pull requests? 👉 Install here: github.com/apps/openhands… Powered by @allhands_ai

4 12 76 28K 20

Download Video

All Hands AI @allhands_ai

2 weeks ago

Having appropriate tests makes a world of difference for agent-driven development. If your agent can write a test to localize a bug or exercise a new feature, the following implementation is much more solid. OpenHands+GPT-5 is now 🥇 on the SWT-Bench testing leaderboard!

6 18 102 22K 32

Download Image

All Hands AI @allhands_ai

2 weeks ago

We built OpenHands in the open (~60K ⭐️ on GitHub). Now we’re giving back to the OSS ecosystem. Announcing the OpenHands Cloud OSS Credit Program → $100–$500 credits for maintainers. 👉 Learn how to apply!

1 7 77 7K 22

Robert Brennan @rbren_dev

2 months ago

Nothing more frustrating than seeing "private scaffold" on public benchmark results I love that model providers like Qwen and Mistral are now reporting their results specifically using OpenHands as the scaffold--feels like we're becoming a standard here x.com/Alibaba_Qwen/s…

Qwen @Alibaba_Qwen

2 months ago

316 1K 9K 2.0M 4K

Download Image

2 7 94 10K 19

Qwen @Alibaba_Qwen

2 months ago

>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…

316 1K 9K 2.0M 4K

Download Image

Rohit Malhotra @rohit_malh5

2 months ago

OpenHands is so general-purpose that I now think of leveraging it with workflow-driven prompting. Also stating constraints works well for me. Examples: • Examine the existing architecture, read docs for Y, plan how to implement X, then do it → Instead of: "Implement feature…

0 0 3 161 2

Mistral AI @MistralAI

2 months ago

Introducing Devstral Small and Medium 2507! This latest update offers improved performance and cost efficiency, perfectly suited for coding agents and software engineering tasks.

93 332 2K 395K 504

Download Image

Graham Neubig @gneubig

2 months ago

What will software development look like in 2026? With coding agents rapidly improving, dev roles may look quite different. My current workflow has changed a lot: - Work in github, not IDEs - Agents in parallel - Write English, not code - More code review Thoughts + a video👇

3 16 120 17K 90

Rohit Malhotra @rohit_malh5

3 months ago

PSA for engineering leadership exploring software agent solutions 🚨 This post nails the difference between agentic and agentless approaches — and why it actually matters for real software tasks, beyond SWE-Bench scores!

All Hands AI @allhands_ai

3 months ago

6 22 230 33K 117

Download Image

0 1 5 515 0

Rohit Malhotra @rohit_malh5

3 months ago

Some users click with code agents. Others struggle. Why? Agents are flexible and creative - just like their users! It's confusing! Agents should understand, educate, and adapt to users. Even personalize. If the agent isn’t willing to grow, the user likely won’t either.

Rohit Malhotra @rohit_malh5

3 months ago

0 1 2 1K 1

0 0 1 216 0

All Hands AI @allhands_ai

3 months ago

What if we could have *trustworthy* agents that don't just write code, but also do research, understand multimodal content, and perform many practically useful tasks? Today at OpenHands, we released a new agent that gets SOTA or competitive performance on 8 diverse tasks.

5 27 174 19K 102

Download Video

Rohit Malhotra @rohit_malh5

3 months ago

I believe “execution” and “evaluation” are two major challenges to adoption of code agents from a user perspective. Users must learn out how to leverage the agent effectively, and how to evaluate its work (asap) Could determine whether good agents also delivers great experiences