Our new work on test-time adaptation has been accepted at #CIKM2025 (@cikm2025)! I would like to thank my Ph.D. advisor, Prof. Chae and collaborators Amit and @Hitesh_LPatel for their contributions!
Sad to miss ACL in Vienna but so many of our members of SEACrowd are going to be there to present this work🔥
Reach out or find us in our merch 😉
Learn about our ongoing cool initiatives and how to participate or get our merch 😎
Sad to miss ACL in Vienna but so many of our members of SEACrowd are going to be there to present this work🔥
Reach out or find us in our merch 😉
Learn about our ongoing cool initiatives and how to participate or get our merch 😎
🚀 Excited to share that four of our papers have been accepted to #ACL2025!
Grateful to see our work across multimodal learning, retrieval, and speech recognized this year.
While I won’t be attending in person, our team will be there to present and connect on-site.
Accepted…
Excited to announce @ArionDas, CS undergrad at IIIT Ranchi and co-author of "SweEval: Do LLMs Really Swear?", will be speaking at our AI safety and alignment group.
He'll discuss LLM safety, handling sensitive language, and insights from the SweEval-Bench dataset.
Based on simple yet effective semantic selection criterion:
1. Negatives closer to the query than positives;
2. Yet far enough from the positive to avoid noise;
Use clustering and dimensionality reduction to do negative sampling at scale.
Based on simple yet effective semantic selection criterion:
1. Negatives closer to the query than positives;
2. Yet far enough from the positive to avoid noise;
Use clustering and dimensionality reduction to do negative sampling at scale.
Hard Negative Mining for Domain-Specific Retrieval in Enterprise Systems
Oracle presents a scalable hard-negative mining framework for domain-specific enterprise data, dynamically selecting challenging negatives to enhance re-ranking models.
📝arxiv.org/abs/2505.18366
We have finally made the dataset public for everyone to use: [link in the comments]
SweEval got accepted in NAACL '25 industry track.
It already has 120+ downloads, try out our dataset to know how LLMs fare with swear words!
#nlpoli#research#acl2025#naacl#LLMs#claude#gpt
We have finally made the dataset public for everyone to use: [link in the comments]
SweEval got accepted in NAACL '25 industry track.
It already has 120+ downloads, try out our dataset to know how LLMs fare with swear words!
#nlpoli#research#acl2025#naacl#LLMs#claude#gpt
Big news!
Three of our papers have been accepted to ACL 2025 in Vienna!
Grateful to all collaborators and reviewers. Excited to present our work and learn from the global NLP community.
See you in Austria this August! #ACL2025#NLP#AI
🌏 Join SEA-VL Phase 2 – Help Build a Vision-Language Model for Southeast Asia!
We’re thrilled to announce the launch of SEA-VL Phase 2, a global community initiative to build VLMs that truly understands SEA—its languages, cultures, and visual richness.🧵(1/x)
#nlproc#seacrowd
📢 Calling all SEA-passionate individuals!
SEACrowd is excited to launch our contributor call for SEA-VL Phase 2: Building Visual Language Models for Southeast Asia! 🌏
After the success of Phase 1 where we created a culturally grounded image dataset and benchmarking study we're…
80 Followers 135 FollowingPhD student interested in AI in Protein Design.
My research is focused on multi-state proteins and intrinsically disordered protein systems.
26 Followers 270 FollowingFounder, techire ai.
AI - Generative, Conversational, Agentic.
Top & trusted AI recruiting firm, 6+ years hiring n Applied AI & Research. 10+ years in tech.
563 Followers 319 FollowingPostdoc@UIUC, advised by Prof. Heng Ji @hengjinlp and Prof. Chengxiang Zhai. Robust and trustworthy LLMs. LLM hallucination. LLM knowledge. LLM reasoning.
83K Followers 8K FollowingCompiling in real-time, the race towards AGI.
🗞️ Don't miss my daily top 1% AI analysis newsletter directly to your inbox 👉 https://t.co/6LBxO8215l
62K Followers 27K Followingfounder, cyphr vc and previously k2 global. 240M+ prev invested in startups. former lots of things, including journalist, teacher, lawyer, pm, and more.
610 Followers 1K FollowingProfessor at Texas A&M University; ML/AI researcher; optimization for ML/AI; large reasoning models, developing LibAUC library for training deep neural nets.
50 Followers 153 FollowingResearch Scientist at #Upstage;
I've got Ph.D in #KoreaUniversity;
My research focuses on solving real-world medical NLP problems
115 Followers 187 FollowingResearch Associate Professor at The Artificial Intelligence Institute of UofSC (AIISC), UoSC
+
Advisory Scientist at Wipro AI.
800 Followers 5K FollowingAI explorer Interpretability, Alignment, Optimization, Safety & More at AryaXAI | AI for Social Good | AAAI UC 23 Scholar | Prev. @ Mila,Bosch,Manipal.
14K Followers 6K FollowingAI/ML engineer. Previously at Google: Product Manager for Keras and TensorFlow and developer advocate on TPUs. Passionate about democratizing Machine Learning.
80 Followers 135 FollowingPhD student interested in AI in Protein Design.
My research is focused on multi-state proteins and intrinsically disordered protein systems.
26 Followers 270 FollowingFounder, techire ai.
AI - Generative, Conversational, Agentic.
Top & trusted AI recruiting firm, 6+ years hiring n Applied AI & Research. 10+ years in tech.
563 Followers 319 FollowingPostdoc@UIUC, advised by Prof. Heng Ji @hengjinlp and Prof. Chengxiang Zhai. Robust and trustworthy LLMs. LLM hallucination. LLM knowledge. LLM reasoning.
83K Followers 8K FollowingCompiling in real-time, the race towards AGI.
🗞️ Don't miss my daily top 1% AI analysis newsletter directly to your inbox 👉 https://t.co/6LBxO8215l
62K Followers 27K Followingfounder, cyphr vc and previously k2 global. 240M+ prev invested in startups. former lots of things, including journalist, teacher, lawyer, pm, and more.
5K Followers 3 FollowingTweeting interesting papers submitted at https://t.co/rXX8x0HzXV.
Submit your own at https://t.co/QhbJKXBd4Q, and link models/datasets/demos to it!
610 Followers 1K FollowingProfessor at Texas A&M University; ML/AI researcher; optimization for ML/AI; large reasoning models, developing LibAUC library for training deep neural nets.