Showing 201–220 of 2,256 results
/ Date/ Name
Apr 23, 2026Doubly Saturated Ramsey Graphs: A Case Study in Computer-Assisted Mathematical DiscoveryApr 23, 2026Feedback Over Form: Why Execution Feedback Matters More Than Pipeline Topology in 1-3B Code GenerationApr 22, 2026Adaptive Instruction Composition for Automated LLM Red-TeamingApr 22, 2026Dialect vs Demographics: Quantifying LLM Bias from Implicit Linguistic Signals vs. Explicit User ProfilesApr 22, 2026Using Machine Mental Imagery for Representing Common Ground in Situated DialogueApr 22, 2026Enhancing Science Classroom Discourse Analysis through Joint Multi-Task Learning for Reasoning-Component ClassificationApr 22, 2026Cross-Session Threats in AI Agents: Benchmark, Evaluation, and AlgorithmsApr 22, 2026Materialistic RIR: Material Conditioned Realistic RIR GenerationApr 22, 2026Leveraging Multimodal LLMs for Built Environment and Housing Attribute Assessment from Street-View ImageryApr 22, 2026Propensity Inference: Environmental Contributors to LLM BehaviourApr 22, 2026Behavioral Consistency and Transparency Analysis on Large Language Model API GatewaysApr 22, 2026Serialisation Strategy Matters: How FHIR Data Format Affects LLM Medication ReconciliationApr 22, 2026StyleVAR: Controllable Image Style Transfer via Visual Autoregressive ModelingApr 22, 2026A Deep U-Net Framework for Flood Hazard Mapping Using Hydraulic Simulations of the Wupper CatchmentApr 22, 2026Value-Conflict Diagnostics Reveal Widespread Alignment Faking in Language ModelsApr 22, 2026Breaking MCP with Function Hijacking Attacks: Novel Threats for Function Calling and Agentic ModelsApr 22, 2026Differentially Private Model MergingApr 22, 2026Thinking Like a Botanist: Challenging Multimodal Language Models with Intent-Driven Chain-of-InquiryApr 22, 2026SpeechParaling-Bench: A Comprehensive Benchmark for Paralinguistic-Aware Speech GenerationApr 22, 2026AVISE: Framework for Evaluating the Security of AI Systems