Showing 241–260 of 2,256 results
/ Date/ Name
Apr 22, 2026Enhancing Research Idea Generation through Combinatorial Innovation and Multi-Agent Iterative Search StrategiesApr 22, 2026Evian: Towards Explainable Visual Instruction-tuning Data AuditingApr 22, 2026CHASM: Unveiling Covert Advertisements on Chinese Social MediaApr 22, 2026Mythos and the Unverified Cage: Z3-Based Pre-Deployment Verification for Frontier-Model Sandbox InfrastructureApr 22, 2026Knowledge Capsules: Structured Nonparametric Memory Units for LLMsApr 22, 2026MOMO: A framework for seamless physical, verbal, and graphical robot skill learning and adaptationApr 22, 2026Adaptive Defense Orchestration for RAG: A Sentinel-Strategist Architecture against Multi-Vector AttacksApr 22, 2026DialToM: A Theory of Mind Benchmark for Forecasting State-Driven Dialogue TrajectoriesApr 22, 2026Onyx: Cost-Efficient Disk-Oblivious ANN SearchApr 22, 2026SafeRedirect: Defeating Internal Safety Collapse via Task-Completion Redirection in Frontier LLMsApr 22, 2026CyberCertBench: Evaluating LLMs in Cybersecurity Certification KnowledgeApr 22, 2026Building a Precise Video Language with Human-AI OversightApr 22, 2026Surrogate modeling for interpreting black-box LLMs in medical predictionsApr 22, 2026ActuBench: A Multi-Agent LLM Pipeline for Generation and Evaluation of Actuarial Reasoning TasksApr 22, 2026Text Steganography with Dynamic Codebook and Multimodal Large Language ModelApr 22, 2026Hybrid Policy Distillation for LLMsApr 22, 2026Towards Secure Logging: Characterizing and Benchmarking Logging Code Security Issues with LLMsApr 22, 2026Taint-Style Vulnerability Detection and Confirmation for Node.js Packages Using LLM Agent ReasoningApr 22, 2026AgentSOC: A Multi-Layer Agentic AI Framework for Security Operations AutomationApr 22, 2026EvoAgent: An Evolvable Agent Framework with Skill Learning and Multi-Agent Delegation