BUPT prof | Sharing latest AI papers & insights | Join me in embracing the AI revolution! #MachineLearning #AI #Innovationgithub.com/fly51flyJoined February 2009
[LG] AdaQAT: Adaptive Bit-Width Quantization-Aware Training
C Gernigon, S Filip, O Sentieys… [Inria] (2024)
arxiv.org/abs/2404.16876
- AdaQAT is an optimization-based method for mixed-precision uniform quantization of both weights and activations in DNNs.
- It uses relaxed…
[CL] Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks
M Ailem, K Marazopoulou, C Siska, J Bono [Microsoft] (2024)
arxiv.org/abs/2404.16966
- The standard approach for evaluating LLMs on benchmarks assumes test prompts represent a random…
[CL] Player-Driven Emergence in LLM-Driven Game Narrative
X Peng, J Quaye, W Xu, C Brockett… [Microsoft Research] (2024)
arxiv.org/abs/2404.17027
- Players interact with non-deterministic NPCs generated by GPT-4 in a text-adventure game where they try to solve a mystery and…
[CL] Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
arxiv.org/abs/2404.14367
- This paper aims to understand the behaviors of various procedures for fine-tuning language models with preference data, including RL, maximum likelihood, and…
[LG] Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey
arxiv.org/abs/2403.14608
- Large models have achieved remarkable performance but require substantial computational resources for fine-tuning. Parameter Efficient Fine-Tuning (PEFT) provides a…
[LG] A Survey on the Memory Mechanism of Large Language Model based Agents
arxiv.org/abs/2404.13501
- The memory module is a key component that differentiates agents from original large language models (LLMs), enabling agent-environment interactions.
- Memory serves…
[CV] How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
arxiv.org/abs/2404.16821
- InternVL 1.5 is an open-source multimodal large language model that aims to bridge the capability gap between open-source and proprietary…
[CV] Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
A Sabour, S Fidler, K Kreis [NVIDIA] (2024)
arxiv.org/abs/2404.14507
- Diffusion models have become state-of-the-art generative models, but are slow to sample from due to their reliance on sequential…
[CV] NeRF-XL: Scaling NeRFs with Multiple GPUs
R Li, S Fidler, A Kanazawa, F Williams [NVIDIA & UC Berkeley] (2024)
arxiv.org/abs/2404.16221
- The paper introduces NeRF-XL, a method for distributing Neural Radiance Fields (NeRFs) across multiple GPUs to enable training and…
[AS] Music Consistency Models
Z Fei, M Fan, J Huang [Kunlun Inc] (2024)
arxiv.org/abs/2404.13358
- Proposes Music Consistency Models (MusicCM) which leverages consistency models for efficient high-quality music synthesis with minimal sampling steps.
- Builds on existing…
[CL] LLM Evaluators Recognize and Favor Their Own Generations
A Panickssery, S R. Bowman, S Feng [MATS & New York University] (2024)
arxiv.org/abs/2404.13076
- Frontier LLMs like GPT-3.5, GPT-4, and Llama 2 exhibit self-preference when evaluating their own text summaries versus…
[CL] FlowMind: Automatic Workflow Generation with LLMs
Z Zeng, W Watson, N Cho, S Rahimi… [J. P. Morgan AI Research] (2024)
arxiv.org/abs/2404.13050
- FlowMind is a novel approach that uses LLMs to automatically generate workflows, addressing issues like hallucination and data…
[CV] Editable Image Elements for Controllable Synthesis
arxiv.org/abs/2404.16029
- The paper proposes a new image representation called "image elements" that supports spatial editing of input images and faithful reconstruction.
- Image elements are obtained by dividing…
[CV] MoDE: CLIP Data Experts via Clustering
arxiv.org/abs/2404.16030
- The noisy pairing in web-crawled image-caption data hurts the learning of CLIP via contrastive pretraining. Semantically different captions for similar images introduce false negatives which hurts…
[CV] CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data
arxiv.org/abs/2404.15653
- CatLIP refines pre-training on image-text data as an image classification task instead of contrastive learning. This removes the need for…
[CL] How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study
W Huang, X Ma, H Qin, X Zheng... [The University of Hong Kong & Beihang University & ETH Zurich] (2024)
arxiv.org/abs/2404.14047
- LLAMA3 models recently released by Meta have achieved state-of-the-art…
[CV] RadRotator: 3D Rotation of Radiographs with Diffusion Models
P Rouzrokh, B Khosravi, S Faghani, K L. Mulford, M J. Taunton, B J. Erickson, C C. Wyles [Mayo Clinic] (2024)
arxiv.org/abs/2404.13000
- The paper introduces a diffusion model-based method to rotate the anatomical…
[LG] Graph Machine Learning in the Era of Large Language Models (LLMs)
W Fan, S Wang, J Huang, Z Chen… [he Hong Kong Polytechnic University] (2024)
arxiv.org/abs/2404.14928
- GNNs have become a cornerstone in Graph ML for modeling complex relationships in graph data across…
[CL] LongEmbed: Extending Embedding Models for Long Context Retrieval
D Zhu, L Wang, N Yang, Y Song, W Wu, F Wei, S Li [Microsoft & Peking University] (2024)
arxiv.org/abs/2404.12096
- Long context embedding models have been confined to a narrow context window not exceeding 8k…
[CL] SpaceByte: Towards Deleting Tokenization from Large Language Modeling
K Slagle [Rice University] (2024)
arxiv.org/abs/2404.14408
- Tokenization improves performance of large LMs but has disadvantages like biases, adversarial vulnerability, worse character modeling, and…
267K Followers 906 FollowingMachine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.
6K Followers 978 FollowingGrad student @UMDCS. Past: @AIatMeta, @AmazonScience, @IITMadras. Currently working on #Diffusion and #Multimodal understanding. GPU poor. She/her.
14K Followers 2K FollowingPhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep running
7K Followers 4K FollowingResearcher in ML/NLP at the University of Edinburgh (faculty @InfAtEd @EdinburghNLP), @ELLISforEurope, @UCL_NLP, PI for @Clarify2020, https://t.co/WydvfU8ugz he/they
12K Followers 6K FollowingProduct Manager for Keras and Tensorflow high-level APIs. Previously worked on Cloud TPUs (Tensor Processing Units). Passionate about democratizing ML.
2K Followers 930 Following❤️🌎💚ステーキングプールオペレーター【VIVI 】です。Mission: Help people get started with Cardano .🌟( SPO) VIVI : Cardano Stake Pool by @vivi_adapool
3 Followers 17 FollowingBeihang University B.Eng @Beihang1952; The University of Hong Kong PhD ing @HKUniversity. Machine/Deep learner; Efficient AI; Quantization
1K Followers 2K FollowingML / GenAI (+Jailbreaks) research for Responsible AI & Productivity, @Microsoft AI, @WiMLDS| Ph.D. @CarnegieMellon, @UMich | making AI trustworthy | She/Her
213 Followers 532 FollowingWisdom on personal growth & self-improvement. Join & expand your unique potential.
Let's journey towards your best self! #personalgrowth #mindfulness #selfhelp
711K Followers 718 FollowingProfessor at NYU. Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.
979K Followers 904 Following🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥
229K Followers 3K Following@NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.
267K Followers 906 FollowingMachine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.
944K Followers 275 FollowingWe’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.
532K Followers 255 FollowingTogether with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
1.0M Followers 912 FollowingCo-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs
84K Followers 897 FollowingCovering the latest in AI R&D • ML Engineer • Ex-Mila researcher • MIT Lecturer • Building AlphaSignal, a technical newsletter read by 180,000+ ML experts.
379K Followers 77 FollowingTensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundation
1K Followers 2K FollowingML / GenAI (+Jailbreaks) research for Responsible AI & Productivity, @Microsoft AI, @WiMLDS| Ph.D. @CarnegieMellon, @UMich | making AI trustworthy | She/Her
1K Followers 3K FollowingIndependent AGI existential safety researcher. AI alignment field building with @CEffisciences. ML academia (PhD, postdoc) in a past life.
738 Followers 2K FollowingPhD student at Tsinghua NLP & AIR, obsessed with LLM ∩ Control (alignment and agent; and they are equivalent!); Two drifters with the world to see.
2K Followers 3K FollowingYou are safe with me🏳️⚧️💛🤍💜🖤. 3% curiosity and 97% gentle. Wish all my friends live long and prosper. My sunshine: @Iceblue_Sakura. Sister:@qiyishi
132K Followers 91K FollowingCEO @TalentCulture, #WorkTrends podcast + X (Twitter chat) host. On a goal to humanize the #Workplace. #Recruiting and #Tech nerd, #HR, #FutureOfWork @Forbes
2K Followers 1K FollowingDeveloper | Exploring Gen AI 👨💻
Passionate about LLM and T2I 🧠
Share images generated by 👇🏻
Freepik, Ideogram, Stylar and others
2K Followers 955 FollowingTech Portfolio Manager | AI | Growth Strategist.
I help tech/SaaS startups with product marketing, growth, and a go-to-market strategy.
25K Followers 2K FollowingBren Professor @caltech, Fmr Sr Director of #AI research @nvidia, Fmr Principal Scientist @awscloud, AI+Science, PDE, Neural operators. Views my own.
5K Followers 5K FollowingTruth, Beauty, Poetry, Music and Artificial Intelligence from The Frontier Man, a poet, writer, musician, and visual artist. Poetry, music and art in Highlights