reinforcement learning

News

Researchers unveil IntersectionZoo to evaluate AI learning in complex urban traffic

If there's one thing that characterizes driving in any major city, it's the constant stop-and-go as traffic lights change and ...

Finbold2d

Fraction AI launches mainnet on Base

Decentralized AI agent auto-training platform Fraction AI has announced the launch of its mainnet on the Ethereum Layer 2 (L2) network Base.

Agentic AI In Banking: The Future And The Challenges

The financial world is on the brink of a new era marked by greater efficiency, innovation and customer-centric services.

The ‘era of experience’ will unleash self-learning AI agents across the web—here’s how to prepare

AI visionaries predict an 'Era of Experience' where AI learns autonomously, and it will have important implications for application design.

Tech Xplore8d

Reinforcement learning boosts reasoning skills in new diffusion-based language model d1

A team of AI researchers at the University of California, Los Angeles, working with a colleague from Meta AI, has introduced d1, a diffusion-large-language-model-based framework that has been improved ...

Psychology Today8d

Why AI Gets Learning Right and Cognitive Science Doesn’t

When machines fall short, we adjust. When students do, we blame. Here's what that says about learning and instruction.

AI Is Using Your Likes to Get Inside Your Head

Liking features on social media can provide troves of data about human behavior to AI models. But as AI gets smarter, will it ...

30 seconds vs. 3: The d1 reasoning framework that’s slashing AI response times

Researchers from UCLA and Meta AI have introduced d1, a novel framework using reinforcement learning (RL) to significantly enhance the reasoning capabilities of diffusion-based large language models ...

10d

Is ‘The Era of Experience’ Upon Us? Researchers Propose AI Agents Learn From the World

Computer scientist David Silver was a key developer behind AlphaGo, the pivotal Go-playing program that defeated world ...

GitHub15d

TTRL: Test-Time Reinforcement Learning

We investigate Reinforcement Learning (RL) on data without explicit labels for reasoning tasks in Large Language Models (LLMs). The core challenge of the problem is reward estimation during inference ...

Geeky Gadgets19d

Deepseeks Self Learning Breakthrough That Could Outshine GPT-4

reinforcement learning, and reward modeling. At the heart of this innovation lies Deepseek GRM, an AI judge carefully designed to evaluate responses with unparalleled precision and adaptability.

IEEE20d

Integrating Reinforcement Learning and Virtual Fixtures for Safer Automatic Robotic Surgery

In this letter, we propose a virtual fixture (VF) based safe reinforcement learning framework to ensure safety constraints. The framework ensures that the agent, particularly multi-joint robotic ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results