Senior Researcher @TencentGlobal, working on LLMs.
Ph.D. at @UniMelb; Ex @BytedanceTalk, @MSFTResearchtimhuang1.github.io Melbourne, AustraliaJoined June 2016
🌺GPT-4o’s image generation is stunning — but how well does it handle complex scenarios? 🤔
We introduce 🚀CIGEVAL🚀, a novel method to evaluate models' capabilities in Conditional Image Generation 🖼️➕🖼️🟰🖼️. Find out how top models perform when conditions get truly…
These findings resonate with my impressions. AFAIC, structured prompting outperforms CoT & ICL by steering LLMs through workflows.
Great to see this ‘rebuttal’ backed by such rigorous analysis — reminds me of the insights in LLMs Cannot Self-Correct. We need more like this!
These findings resonate with my impressions. AFAIC, structured prompting outperforms CoT & ICL by steering LLMs through workflows.
Great to see this ‘rebuttal’ backed by such rigorous analysis — reminds me of the insights in LLMs Cannot Self-Correct. We need more like this!
To Code, or Not To Code?
Exploring Impact of Code in Pre-training
discuss: huggingface.co/papers/2408.10…
Including code in the pre-training data mixture, even for models not specifically designed for code, has become a common practice in LLMs pre-training. While there has been…
🚀 A game-changer benchmark: LLM-Uncertainty-Bench 🌟
📚 We introduce "Benchmarking LLMs via Uncertainty Quantification", which challenges the status quo in LLM evaluation.
💡 Uncertainty matters too: we propose a novel uncertainty-aware metric, which tests 8 LLMs across 5…
FuseChat
Knowledge Fusion of Chat Models
While training large language models (LLMs) from scratch can indeed lead to models with distinct capabilities and strengths, this approach incurs substantial costs and may lead to potential redundancy in competencies. An alternative…
15K Followers 6K FollowingI build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
316 Followers 516 FollowingResearcher at the Alibaba DAMO Academy, Singapore R&D Center | Former Visiting Postdoc Researcher at UIUC @uiuc_nlp | NLP PhD from CUHK @CUHKofficial
2K Followers 2K FollowingPh.D. Student @PrincetonCS. Prev @Stanford @UW @pika_labs @MSFTResearch @UofIllinois @ZJU_China. I used to work on computer vision, but it's not all I do.
2K Followers 475 FollowingPh.D. student @HKUSTGuangZhou, Researcher @MetaGPT_, Cofounder of OpenManus, previously at RUC, Lenovo Research AI Lab, Zhipu AI.
225 Followers 569 FollowingSecond year PhD @UW | Post-Training, LLM reasoning and synthetic dataset.
https://t.co/cYAkbnCsCp
Open to chat and collaborate!
12K Followers 3K FollowingPhD-ing @MIT_CSAIL. Working on scalable and principled algorithms in #LLM and #MLSys. In open-sourcing I trust 🐳. she/her/hers
168 Followers 1K FollowingJournal of Contemporary Eastern Asia (ISSN 2383-9449) is a refereed biannual journal that takes a lead on a new scholarship in Asia. Tweet by @zhang_dechun
15K Followers 6K FollowingI build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
92K Followers 207 FollowingLMArena: Open Platform for Community-driven AI Benchmarking. Graduated from UC Berkeley / @lmsysorg. We’re hiring: https://t.co/1OkfLq2n0I
316 Followers 516 FollowingResearcher at the Alibaba DAMO Academy, Singapore R&D Center | Former Visiting Postdoc Researcher at UIUC @uiuc_nlp | NLP PhD from CUHK @CUHKofficial
618 Followers 205 Following🎓phd in Tsinghua University. Focus on RL, Embodied AI, and MLLM. 📖Author of limit-of-RLVR,phyworld,DeeR-VLA. 💼Seek a visit currently.
2K Followers 2K FollowingPh.D. Student @PrincetonCS. Prev @Stanford @UW @pika_labs @MSFTResearch @UofIllinois @ZJU_China. I used to work on computer vision, but it's not all I do.
579 Followers 551 FollowingCS Ph.D. at National University of Singapore (🇸🇬NUS-PhDing)
CS & STAT B.S. at University of Illinois Urbana-Champaign (🇺🇸UIUC-BS)
2K Followers 475 FollowingPh.D. student @HKUSTGuangZhou, Researcher @MetaGPT_, Cofounder of OpenManus, previously at RUC, Lenovo Research AI Lab, Zhipu AI.
793 Followers 961 FollowingMultidisciplinary artist. Pushing creative boundaries with AI & photography.Exploring a wide range of topics. Capturing stunning visuals & creating magic
225 Followers 569 FollowingSecond year PhD @UW | Post-Training, LLM reasoning and synthetic dataset.
https://t.co/cYAkbnCsCp
Open to chat and collaborate!