Joshua Ong @joshuaongg21

Visiting Researcher @EdinburghNLP | PhD Student @imperialcollege LLM Reasoning | Autoformalisation | Neurosymbolic AI joshuaongg21.github.io England, United Kingdom Joined February 2024

Tweets

174
Followers

78
Following

164
Likes

524

Joshua Ong @joshuaongg21

5 days ago

We introduce PiCSAR (Probabilistic Confidence Selection And Ranking)💡: A simple training-free method for scoring samples based on probabilistic confidence, selecting a reasoning chain with the highest confidence from multiple sampled responses. ✏️PiCSAR is generalisable across…

2 29 93 13K 61

Download Image

Wenda Li @WendaLi8

5 days ago

Picking reasoning chains by confidence works!

Joshua Ong @joshuaongg21

5 days ago

Picking reasoning chains by confidence works!

2 29 93 13K 61

Download Image

0 2 16 1K 4

Eleonora Giunchiglia @e_giunchiglia

5 days ago

🚀 Excited to see our work on PiCSAR out! Thrilled to have Joshua as a co-author — and even more thrilled that he’ll be joining my group this academic year. Big things ahead!

Joshua Ong @joshuaongg21

5 days ago

🚀 Excited to see our work on PiCSAR out! Thrilled to have Joshua as a co-author — and even more thrilled that he’ll be joining my group this academic year. Big things ahead!

2 29 93 13K 61

Download Image

0 3 11 2K 5

Lei Yu @LeiYu63

a month ago

Building on what @lmthang shared, here's another fun fact from our final push: we finalized the model checkpoint selection just 5 hours before the IMO problems were released! It's incredible to see the model we were babysitting over that final weekend now demonstrating…

Thang Luong @lmthang

a month ago

18 26 438 74K 69

Download Image

0 3 35 3K 3

Pasquale Minervini @PMinervini

a month ago

Massive massive massive congrats @GiwonHong413849 🚀🚀🚀🚀🚀🚀

0 3 45 3K 3

Download Image

Sohee Yang @ ACL 2025 @soheeyang_

a month ago

Our paper "Do Large Language Models Perform Latent Multi-Hop Reasoning without exploiting shortcuts?" will be presented at #ACL2025 today. 📍 Mon 18:00-19:30 Findings Posters (Hall X4 X5) Please visit our poster if you are interested! ✨

Sohee Yang @ ACL 2025 @soheeyang_

9 months ago

7 47 199 42K 135

Download Gif

0 10 72 4K 12

Pasquale Minervini @PMinervini

a month ago

Notable mention -- Joshua Ong (@joshuaongg21), main author of "Theorem Prover as a Judge for Synthetic Data Generation" (arxiv.org/abs/2502.13137), just finished his BSc in Maths at Edinburgh, and he is now starting a PhD with @e_giunchiglia at Imperial! Keep him on your radar 🚀

Pasquale Minervini @PMinervini

a month ago

1 17 72 13K 4

Download Image

4 2 18 3K 1

Kiril Gashteovski @kgashteo

a month ago

A NeurIPS review.

Yiping Lu @2prime_PKU

a month ago

A NeurIPS review.

271 465 5K 615K 505

Download Image

0 1 1 846 0

Denny Zhou @denny_zhou

a month ago

Slides for my lecture “LLM Reasoning” at Stanford CS 25: dennyzhou.github.io/LLM-Reasoning-… Key points: 1. Reasoning in LLMs simply means generating a sequence of intermediate tokens before producing the final answer. Whether this resembles human reasoning is irrelevant. The crucial…

41 459 3K 266K 4K

Wenda Li @WendaLi8

2 months ago

Lovely to see the impressive performance of the Seed Prover developed by the ByteDance Seed team at IMO 2025 — achieving a silver-level score (30 out of 42) within three days, and reaching (35 out of 42) with extended compute time. leanprover.zulipchat.com/#narrow/channe…