Latest AI News

Anthropic Revises Safety Commitment, Shifts to Transparency Reports

Anthropic Revises Safety Commitment, Shifts to Transparency Reports

Anthropic has abandoned its earlier pledge to halt training and releasing frontier AI models until it could guarantee safety mitigations. The company now relies on detailed safety roadmaps, regular risk reports, and transparency disclosures instead of strict pre‑conditions. Executives describe the change as pragmatic, while critics argue it highlights the limits of voluntary safety promises without regulatory oversight. The new policy aims to keep Anthropic competitive while still emphasizing safety, but observers note that the shift may signal a broader industry move away from self‑imposed restraints.

Judge Finds No Evidence OpenAI Stole xAI Trade Secrets, Dismisses Lawsuit

Judge Finds No Evidence OpenAI Stole xAI Trade Secrets, Dismisses Lawsuit

A federal judge ruled that xAI has not provided sufficient evidence to prove that OpenAI poached its employees or misappropriated its trade secrets. The court dismissed the claim that OpenAI should be liable for actions taken by new hires before they joined the company, and highlighted the lack of concrete proof that OpenAI acquired, disclosed, or used any confidential information. The decision underscores the challenges xAI faces in substantiating its allegations and signals that the lawsuit will require a stronger evidentiary foundation to proceed.

Riley Walz Joins OpenAI to Pioneer New Human‑AI Interaction Interfaces

Riley Walz Joins OpenAI to Pioneer New Human‑AI Interaction Interfaces

Software engineer and internet provocateur Riley Walz is joining OpenAI to help invent and prototype novel ways for people to work with artificial intelligence. Known for viral projects such as Jmail and Find My Parking Cops, Walz will operate within OAI Labs under research leader Joanne Jang. The hire reflects OpenAI’s push to stay ahead of competitors by expanding beyond ChatGPT and exploring fresh AI collaboration tools.

Anthropic Softens Safety Commitments Amid Pentagon Pressure

Anthropic Softens Safety Commitments Amid Pentagon Pressure

Anthropic announced a revision to its Responsible Scaling Policy, replacing hard safety tripwires with more flexible risk reports and safety roadmaps. The change follows reports that Defense Secretary Pete Hegseth urged the company to grant the military unrestricted access to its Claude AI model, threatening penalties under the Defense Production Act. Anthropic’s leadership argued that strict halts on model training would no longer help anyone given the rapid pace of AI development. Critics warned the shift could erode safeguards and enable a gradual “frog‑boiling” of safety standards.

OpenClaw creator urges AI builders to stay playful and keep experimenting

OpenClaw creator urges AI builders to stay playful and keep experimenting

Peter Steinberger, the developer behind the viral AI agent OpenClaw and now an OpenAI employee, told listeners on OpenAI’s Builders Unscripted podcast that the best way to work with modern AI is to explore, stay playful, and accept that expertise develops over time. He described his own path from a WhatsApp‑integrated tool to the OpenClaw prototype, emphasizing that AI models can solve problems without explicit programming and that learning to code with AI is a skill that improves with practice.

Hacker Exploits Anthropic’s Claude Chatbot to Breach Mexican Government Agencies

Hacker Exploits Anthropic’s Claude Chatbot to Breach Mexican Government Agencies

A hacker leveraged Anthropic's Claude chatbot to identify vulnerabilities and automate attacks against multiple Mexican government agencies, stealing roughly 150GB of data that included taxpayer records and employee credentials. The adversary also used OpenAI's ChatGPT to gather additional network information. Anthropic responded by investigating, disrupting the activity, and banning the involved accounts, while its latest model, Claude Opus 4.6, now includes safeguards against such misuse. Gambit Security, which uncovered the operation, suggested a possible link to a foreign government, though the hacker remains unidentified.