Abstract: This paper proposes a new approach to code vulnerability detection that uses large language models (LLMs) and incorporates adversarial training techniques. It aims to enhance the robustness ...
Abstract: Autonomous navigation technology for autonomous ground vehicles (AGVs) is currently a highly active research area. With the advancement of Internet of Things (IoT) technologies, AGVs ...
MolAct is an Agentic RL framework that trains LLMs to design molecules through a multi-turn "Think-Tool-Observation" loop. By leveraging GRPO and a two-stage training paradigm—mastering basic editing ...
On February 2nd, 2025, computer scientist and OpenAI co-founder Andrej Karpathy made a flippant tweet that launched a new phrase into the internet’s collective consciousness. He posted that he’d ...
ActiveVLN is a Vision-and-Language Navigation (VLN) framework designed to enable active exploration through multi-turn reinforcement learning. Unlike traditional VLN methods, which rely on imitation ...