News
Customers are expressing frustration with Amazon Web Services over constraints in its AI platform Bedrock, according to a report from The Information. Despite AWS investing in Anthropic, the company ...
RLVR helps repetition, not generalization AI researcher Nathan Lambert describes the findings as consistent with expectations. "This isn’t a new intuition," he writes, "but a nice new set of results." ...
Scientists at Alibaba Group have introduced VACE, a general-purpose AI model designed to handle a broad range of video generation and editing tasks within a single system. The model’s backbone is an ...
OpenAI’s ChatGPT Search recorded approximately 41.3 million monthly users in the European Union over the six-month period ending in March 2025, according to the company’s own data. The figure ...
New analysis from Ahrefs shows Google’s "AI Overviews" are driving down clicks to top-ranked websites by over 34%, directly contradicting Google’s own claims. A recent analysis from Ahrefs suggests ...
xAI is making a push on efficient AI with the release of Grok 3 Mini, its newest language model. Both Grok 3 and its Mini sibling are available through the xAI API. The Grok 3 family currently ...
Richard S. Sutton's "Bitter Lesson" lays out a hard truth at the heart of modern AI: Not the clever injection of human knowledge, but scalable learning and search algorithms are what deliver lasting ...
With support for up to 200,000 tokens, o3 is the first model to achieve a perfect 100 percent on the Fiction.live benchmark using 128,000 tokens—that’s roughly 96,000 words. For any language model ...
A new study from Anthropic examines how university students are using its language model Claude in daily academic work. The analysis reveals discipline-specific usage patterns and raises concerns ...
BitNet b1.58 2B4T is a new language model from Microsoft designed to operate with minimal energy and memory usage. Unlike conventional language models that rely on 16- or 32-bit floating point numbers ...
Despite recent progress in image generation quality, the empirical analysis reveals notable weaknesses in how GPT-4o handles complex prompts. Researchers evaluated the model across three categories: ...
According to the researchers, the results reveal fundamental weaknesses in color perception—even among the largest models currently available. Color plays a central role in human visual cognition and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results