Former Senior AI researcher @Aleph__Alpha
EVE Online player since 2013
Co-Founder Pageshift Entertainment - Building the worst best story telling AIpageshift.aiJoined April 2024
I feel like it is such an achievement that people are starting to appreciate what we are doing here, considering how many times I was told to just simply build a ChatGPT wrapper instead of our own models...
I feel like it is such an achievement that people are starting to appreciate what we are doing here, considering how many times I was told to just simply build a ChatGPT wrapper instead of our own models... https://t.co/iTr82oEAvx
POV: Silicon Valley networking events.
Founder 1: So what are you working on?
Founder 2: We are building this AI Agent B2B SaaS business. What about you?
Founder 1: Oh that’s so cool, we are also building an AI Agent B2B SaaS product.
Founder 2: Awesome, we should connect on…
how the heck is Gemini (via the official chat interface) so unbelievably bad at using web search to look up documentation for a library it clearly does not know how to use
After playing with gpt-oss for a bit, I sadly have to say that it gives me major Microsoft Phi vibes. Heavily overtrained on synthetic data and quite fragile in real world setting
I can’t sleep right now so I started to read the source code of chatterbox from @resembleai and I really have to say, their audio tokenizer is damn smart, and the reason why their model sounds this good. They are basically doing diffusion inference steps to clean up their audio.…
just to quickly explain what I am working on.
so I need a dynamic sparse Mixture of Expert (MoE) kernel, that allows for a highly uneven batch based routing behavior.
In a normal MoE training setting we assume / force an even usage of all experts across the full batch. Which is…
just to quickly explain what I am working on.
so I need a dynamic sparse Mixture of Expert (MoE) kernel, that allows for a highly uneven batch based routing behavior.
In a normal MoE training setting we assume / force an even usage of all experts across the full batch. Which is…
This is now the third time in a row that I was already lying in bed, and went back up because I had a new idea for a parallel algorithm that would turn a really expensive dense multiplication into a really efficient sparse one.
It would be so easy if TPUs would allow Vector…
We are re-writing our code base from Torch to JAX right now.
And oh boy, it is a good feeling to finally use a XLA-based framework again.
This is like waking up from a really long and bad dream
I was in the audience, and one key point that wasn’t mentioned here was their argument around the Jevons Paradox: the idea that people will consume more content simply because it’s easier and cheaper to access.
At pageshift, we strongly believe this will be the case. People…
I was in the audience, and one key point that wasn’t mentioned here was their argument around the Jevons Paradox: the idea that people will consume more content simply because it’s easier and cheaper to access.
At pageshift, we strongly believe this will be the case. People…
69 Followers 2K FollowingI'm a young thinker, a survivor, a visionary- Al is not a god or a gadget- But a mirror of our values, our future, our fears, and our hopes.
800 Followers 5K FollowingAI explorer Interpretability, Alignment, Optimization, Safety & More at AryaXAI | AI for Social Good | AAAI UC 23 Scholar | Prev. @ Mila,Bosch,Manipal.
615 Followers 612 Followingr&d @spellbrush @nijijourney @midjourney | built the strongest team of otaku researchers on earth | 日英のオペレーション・プロダクト・コミュニティ担当|日本語はLLMに頼ってます | Umamusume 80% wr
2K Followers 681 Followingenjoying the late pre-agi; making llms go brrr @Aleph__Alpha; yapping about economics of AI systems at https://t.co/tbsybxOMHz
615 Followers 612 Followingr&d @spellbrush @nijijourney @midjourney | built the strongest team of otaku researchers on earth | 日英のオペレーション・プロダクト・コミュニティ担当|日本語はLLMに頼ってます | Umamusume 80% wr
12K Followers 239 FollowingIn the golden age of machine learning we're bringing hackathon life back to Silicon Valley! Shaping the future of AI, one line of code at a time.
15K Followers 252 FollowingA new way to conference, presented by @a16z. Event submissions now OPEN! up next:
SF: Oct 6-12
LA: Oct 13-19
#SFTechWeek #LATechWeek
125K Followers 972 FollowingPartner @a16z AI 🤖 and twin to @omooretweets | Investor in @elevenlabsio, @krea_ai, @bfl_ml, @hedra_labs, @WaveFormsAI, @ViggleAI, & more
8K Followers 2K FollowingBased on declassified docs & extensive historical research, #EOTS is an exploration into the origins of the UFO phenomenon.
@blacktie_labs
638K Followers 35 FollowingWe're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.
92K Followers 207 FollowingLMArena: Open Platform for Community-driven AI Benchmarking. Graduated from UC Berkeley / @lmsysorg. We’re hiring: https://t.co/1OkfLq2n0I
2K Followers 681 Followingenjoying the late pre-agi; making llms go brrr @Aleph__Alpha; yapping about economics of AI systems at https://t.co/tbsybxOMHz
96 Followers 1 FollowingAccurate, research-backed summaries from thousands of sources. Powered by an agentic, web-enabled Gemini with auxiliary reasoning.
163K Followers 0 FollowingInvented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.
26K Followers 173 FollowingA North Star for open AGI. Co-founders: @fchollet @mikeknoop. President: @gregkamradt. Help support the mission - make a donation today.