News
RLVR helps repetition, not generalization AI researcher Nathan Lambert describes the findings as consistent with expectations. "This isn’t a new intuition," he writes, "but a nice new set of results." ...
Customers are expressing frustration with Amazon Web Services over constraints in its AI platform Bedrock, according to a report from The Information. Despite AWS investing in Anthropic, the company ...
Scientists at Alibaba Group have introduced VACE, a general-purpose AI model designed to handle a broad range of video generation and editing tasks within a single system. The model’s backbone is an ...
OpenAI’s ChatGPT Search recorded approximately 41.3 million monthly users in the European Union over the six-month period ending in March 2025, according to the company’s own data. The figure ...
New analysis from Ahrefs shows Google’s "AI Overviews" are driving down clicks to top-ranked websites by over 34%, directly contradicting Google’s own claims. A recent analysis from Ahrefs suggests ...
AI researcher Sebastian Raschka has published a new analysis that looks at how reinforcement learning is used to improve reasoning in large language models (LRMs). In a blog post, he describes how ...
xAI is making a push on efficient AI with the release of Grok 3 Mini, its newest language model. Both Grok 3 and its Mini sibling are available through the xAI API. The Grok 3 family currently ...
Richard S. Sutton's "Bitter Lesson" lays out a hard truth at the heart of modern AI: Not the clever injection of human knowledge, but scalable learning and search algorithms are what deliver lasting ...
A new study from Anthropic examines how university students are using its language model Claude in daily academic work. The analysis reveals discipline-specific usage patterns and raises concerns ...
With support for up to 200,000 tokens, o3 is the first model to achieve a perfect 100 percent on the Fiction.live benchmark using 128,000 tokens—that’s roughly 96,000 words. For any language model ...
BitNet b1.58 2B4T is a new language model from Microsoft designed to operate with minimal energy and memory usage. Unlike conventional language models that rely on 16- or 32-bit floating point numbers ...
Despite recent progress in image generation quality, the empirical analysis reveals notable weaknesses in how GPT-4o handles complex prompts. Researchers evaluated the model across three categories: ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results