Yihao Wang @Against_Entropy

Joined August 2023

Tweets

21
Followers

15
Following

121
Likes

92

NEXA AI @nexa_ai

a week ago

🚀 From GPU → NPU: on-device AI gets real - here’s what we launched recently ⬇️ OmniNeural-4B - world’s first NPU-aware multimodal model (text, image, audio). 🎙 9× faster audio (vs Whisper) 🖼 3.5× faster image (vs SigLIP) nexaML - run SOTA models locally in 1 line, dead-easy…

1 3 9 255 1

NEXA AI @nexa_ai

a week ago

🚀 Nexa SDK is first to support SOTA VLM, @googleaidevs's Gemma 3n (single-image understanding), with 1-line setup, fully local on any Windows GPU. So your app can 📰 read image-based news & threads 🛍️ auto-label products in marketplaces 🔎 make screenshots & charts searchable

1 2 11 276 0

Download Video

NEXA AI @nexa_ai

2 weeks ago

AI has always been GPU-first. But on-device AI should be NPU-first. Today we’re launching OmniNeural-4B — the world’s first NPU-aware multimodal model, natively understanding text, images, and audio. And introducing nexaML — a generative AI inference engine that runs models on…

11 34 107 33K 50

Download Video

NEXA AI @nexa_ai

4 weeks ago

Run @OpenAI's gpt-oss model on your Mac with MLX & GGUF, side-by-side Nexa-sdk now supports GPT-OSS in MLX (Apple-optimized) and GGUF formats, right from your command line with a single line of code. On an M4 Max, MLX hit ~103 tok/s, about 25% faster than GGUF in our tests.…

2 3 11 5K 2

Download Video

Zack Li-Nexa AI @zacklearner

a month ago

🧠 Every year I lose hundreds of hours digging through local folders for that one PDF, slide, or doc. But now there’s Hyperlink — a fully offline AI agent that instantly searches my files, with citations. No cloud. No spying. Just my files, my device, my AI. Public beta 🔗👇…

NEXA AI @nexa_ai

a month ago

10 33 167 131K 31

Download Video

0 3 3 297 0

NEXA AI @nexa_ai

2 months ago

👀 Sneak peek: new Nexa SDK Beta Ship on‑device AI in one command: • Run MLX models locally • One‑line GGUF quant selection from HF • Multimodal input right in your CLI This is just the warm‑up—bigger updates for developers coming soon. 🔥

Zack Li-Nexa AI @zacklearner

2 months ago

1 0 5 898 0

Download Video

0 4 14 795 2

NEXA AI @nexa_ai

2 months ago

We've been quiet—but busy building. Previously, we created Nexa-SDK and on-device AI models like Octopus-v2, OmniVLM, and OmniAudio, sparking excitement in the community. 🔥 Indeed, on device extends what's possible with AI. We are dedicated to make on-device AI friction‑free…

2 5 37 67K 2

Download Image

merve @mervenoyann

10 months ago

OmniVision-968M: a new local VLM for edge devices, fast & small but performant 👏 it's based on SigLIP-so-400M and Qwen-2.5-0.5B 💨 9x less image tokens, super efficient 📖 aligned with SFT and DPO for reducing hallucinations 🔥 Apache 2.0 license

23 131 931 97K 659

Download Image

AK @_akhaliq

a year ago

Dolphin discuss: huggingface.co/papers/2408.15… Long Context as a New Modality for Energy-Efficient On-Device Language Models This paper presents Dolphin, a novel decoder-decoder architecture for energy-efficient processing of long contexts in language models. Our approach addresses…

2 21 90 17K 46

Download Image

NEXA AI @nexa_ai

a year ago

🚀 Don't miss out on this event! 🌟 🗓️ Date: Thursday, August 1, 2024 🕒 Time: 8:00 PM - 9:00 PM PST 🔗 Register now: lu.ma/nl91404h Join us for the "Getting Started with Octopus V2" workshop, featuring our talented engineer Ethan Wang. Whether you're new to the…

0 1 4 209 0

Download Image

NEXA AI @nexa_ai

a year ago

Check out how our Coral SDK performs on the @SamsungMobile S23! Watch the video to see the impressive speed and token generation capabilities. @SamsungUS @SamsungMobileUS @samsung_dev #AI #SDK #EdgeComputing #MachineLearning #AIDevelopment #CoralSDK

1 3 7 864 0

Download Video

NEXA AI @nexa_ai

a year ago

Our Coral SDK achieves impressive inference speeds, faster than Llama-cpp and MLC-LLM for inference on CPU, and we run the benchmark on Phi3 on @Google Pixel 6. Coral SDK supports @Android, iOS, MacOS, and @Windows with multi-processor and flexible quantization options. Discover…

3 10 95 14K 6

Download Video

MIT CSAIL @MIT_CSAIL

a year ago

On-device AI planning agents can be more powerful than previously thought. While virtual assistants like Siri struggle w/complex tasks, MIT & @NEXA4AI from Stanford show that a 3B on-device model can achieve a 97% success rate in planning. Introducing Octo-planner, a language…

3 23 91 17K 50

Download Video

NEXA AI @nexa_ai

a year ago

🚀Join us for the Super AI Agent Hackathon with @huggingface! 🤗 Build innovative AI Agents applications across domains, pushing the boundaries of what AI Agents can do with the latest Octopus models. 📍 Location: Stanford University/Virtual 🏆 Prize：$7,000 Register now and…

4 32 97 10K 11

Download Image

NEXA AI @nexa_ai

a year ago

🚀 Introducing our Coral SDK! 🌟 Run LLMs and build AI agents on edge devices with ease. 🔹 CPU & GPU support 🔹 Multiple compression options 🔹 Runs on Raspberry Pi 🔹 Available for Android, iOS, macOS, and Windows Build powerful AI on the edge! 🌐✨ Find us:…

1 8 101 6K 8

Download Image

NEXA AI @nexa_ai

a year ago

🌟 In 2007, the introduction of the iPhone revolutionized the traditional keypad phone. Now, NEXA AI is taking this evolution of human-computer interaction further, moving from complex UI workflows 🖥️ to a more intuitive approach: natural language 🗣️ Just tell our AI agent…

4 60 551 39K 24

Download Video

NEXA AI @nexa_ai

a year ago

Octoverse got 🥇#1 on Product Hunt (May 21) Our AI Agent models, surpassing GPT-4o's function-calling abilities, revolutionize app interactions📱 and workflows ✨ Learn more: nexa4ai.com/octoverse

3 7 17 424K 2

Yihao Wang @Against_Entropy

2 years ago

👋Glad to introduce an interesting project of mine - Kanji-Streaming🌈an interesting dialogue system powered by StreamDiffusionIO 💫Chatbot will stream out diffusion-generated fake Kanji, instead of original text 🚀 See GitHub repo for all the details: github.com/AgainstEntropy…