site:the-decoder.com - Search News

News

Google expands search with Audio Overviews and AI-Powered Voice Search

In addition to Audio Overviews, Google is piloting a new feature called Search Live in the US. Available in the AI Mode of the Google app for Android and iOS, Search Live lets users issue voice ...

the-decoder16d

Salesforce's CRM benchmark finds AI agents struggle in real-world business scenarios

CRMArena-Pro is designed to test how well large language models (LLMs) can function as agents in real-world business settings, especially for CRM tasks like sales, customer service, and pricing. The ...

the-decoder16d

Mechanize is building digital offices to train AI agents to fully automate computer work

Mechanize's ambitions go beyond code. The company wants AI agents to handle every digital task, from planning and communication to execution. "We’ll only truly know we’ve succeeded once we’ve created ...

the-decoder16d

Rednote releases its first open-source LLM with a Mixture-of-Experts architecture

With 300 million monthly users, Rednote is jumping into a crowded Chinese AI market led by companies like Alibaba, Baidu, Tencent, Bytedance, and the upstart Deepseek.The new model comes from ...

the-decoder16d

ChatGPT lost badly to Atari's 1979 Video Chess engine

Some critics say Caruso's experiment compares apples and oranges, but it underscores a core weakness of LLMs: ChatGPT didn't lose because it lacked knowledge. It lost because it couldn't remember.

the-decoder17d

Anthropic shares blueprint for Claude Research agent using multiple AI agents in parallel

Anthropic has shared the design for its new research agent, which uses a multi-agent approach: a main agent analyzes questions, creates strategies, and assigns specialized sub-agents to work on ...

the-decoder17d

Apple's new AI benchmarks show its models still lag behind leaders like OpenAI and Google

Apple developed two models: a compact 3-billion-parameter version for on-device use, and a larger server-based model. In Apple's own benchmarks, the 3B model edges out similarly sized models like Qwen ...

the-decoder17d

OpenAI updates ChatGPT search with smarter answers and image search

OpenAI has rolled out a major update to ChatGPT's integrated search, introducing smarter answers, better handling of long conversations, and a new image search feature. According to OpenAI, the ...

the-decoder17d

Google Deepmind launches Weather Lab to test AI models for tropical cyclone forecasting

Google Deepmind and Google Research have launched Weather Lab, a public platform that tests AI models for forecasting tropical cyclones. The new system uses a type of machine learning called ...

the-decoder17d

Nvidia's Huang disputes Anthropic CEO's claim that AI will eliminate half of entry-level office jobs

Nvidia CEO Jensen Huang is pushing back against Anthropic CEO Dario Amodei, adding to a week of criticism already aimed at Amodei by Meta's AI chief researcher Yann LeCun.Speaking at VivaTech in Paris ...

the-decoder17d

ChatGPT users experienced psychotic episodes after following harmful advice from the chatbot,

Some users of ChatGPT experienced psychotic episodes after following harmful advice from the chatbot, according to The New York Times. In several cases, ChatGPT reinforced dangerous ideas, including ...

the-decoder18d

Anthropic researchers teach language models to fine-tune themselves

Traditionally, large language models are fine-tuned using human supervision, such as example answers or feedback. But as models grow larger and their tasks more complicated, human oversight becomes ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results