site:the-decoder.com - Search News

News

Blackmail becomes go-to strategy for AI models facing shutdown in new Anthropic tests

This wasn't a one-off. In a text-only version of the same test, Claude Opus 4 chose blackmail 96 percent of the time. Google's Gemini 2.5 Flash nearly matched that rate. OpenAI's GPT-4.1 and xAI's ...

the-decoder17d

Elon Musk wants to rewrite "the entire corpus of human knowledge" with Grok

Elon Musk has announced plans to retrain Grok with statements he calls "politically incorrect, but nonetheless factually true," claiming this will correct and expand all human knowledge. Previously, ...

the-decoder15d

Elevenlabs launches 11ai, a voice assistant that uses MCP to integrate with digital workflow tools

11ai also supports custom MCP servers. Teams can connect internal tools or specialized software to 11ai through their own MCP servers, extending the assistant’s functionality to fit their workflows.

the-decoder18d

Apple executives have held internal discussions about potentially bidding for AI startup Perplexity

Apple executives have been talking internally about potentially buying AI startup Perplexity AI, according to a Bloomberg report. The idea is to grab both the technology and talent for Apple's own ...

the-decoder15d

Google hands off Agent2Agent protocol to Linux Foundation for open AI agent standard

Google has handed over the Agent2Agent (A2A) protocol to a new open source project led by the Linux Foundation with the aim of creating a uniform communication standard for AI agents from different ...

the-decoder15d

Microsoft has introduced an AI agent to the Windows Settings menu

The assistant is powered by a new language model called "Mu," which Microsoft developed specifically for the task. Mu has 330 million parameters and uses an encoder-decoder architecture that, ...

the-decoder17d

AI learns math reasoning by playing Snake and Tetris-like games rather than using math datasets

Snake training outperforms math datasets in some areas Training on Snake and rotation problems nudged the base model slightly ahead of MM-Eureka-Qwen-7B, a model specifically trained on math data, ...

the-decoder20d

MiniMax's Hailuo 02 tops Google Veo 3 in user benchmarks at much lower video costs

MiniMax says it's working to improve generation speed, stability, and add new features beyond the current text-to-video and image-to-video options. Competing platforms like Runway already offer more ...

the-decoder29d

Zuckerberg forms elite AI team to catch up with competitors

Mark Zuckerberg is personally setting up a new team of 50 experts, known as the 'Superintelligence Group', to address Meta's backlog in AI development. He is conducting the personnel interviews ...

the-decoder24d

Salesforce's CRM benchmark finds AI agents struggle in real-world business scenarios

Salesforce has launched CRMArena-Pro, a benchmark designed to evaluate AI agents in practical business situations, including multi-step conversations and data protection checks within CRM systems.

the-decoder19d

LAION and Intel introduce tools that help AI gauge the intensity of 40 distinct emotions

LAION and Intel have released Empathic-Insight, a suite of models and datasets that can analyze facial images and audio files across 40 emotion categories, covering not only emotional but also ...

the-decoder23d

New study supports Apple's doubts about AI reasoning, but sees no dead end

The RELIC test works by giving an AI a formal grammar - essentially a precise rule set that defines an artificial language - along with a string of symbols. The model must then decide whether the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results