Showing 1–18 of 18 results
/ Date/ Name
May 30, 2025MUSE: Model-Agnostic Tabular Watermarking via Multi-Sample SelectionFeb 23, 2025TabGen-ICL: Residual-Aware In-Context Example Selection for Tabular Data GenerationApr 1, 2026Locally Confident, Globally Stuck: The Quality-Exploration Dilemma in Diffusion Language ModelsJun 11, 2025A Call for Collaborative Intelligence: Why Human-Agent Systems Should Precede AI AutonomyOct 13, 2025Deep Research with Open-Domain Evaluation and Multi-Stage Guardrails for SafetyFeb 26, 2025TestNUC: Enhancing Test-Time Computing Approaches and Scaling through Neighboring Unlabeled Data ConsistencyJul 26, 2024Do We Really Need Graph Convolution During Training? Light Post-Training Graph-ODE for Efficient RecommendationAug 12, 2025A Survey on Parallel Text Generation: From Parallel Decoding to Diffusion Language ModelsMay 1, 2025LLM-Based Human-Agent Collaboration and Interaction Systems: A SurveyApr 24, 2024ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value ExtractionOct 7, 2025RECODE-H: A Benchmark for Research Code Development with Interactive Human FeedbackOct 28, 2024Diffusion-nested Auto-Regressive Synthesis of Heterogeneous Tabular DataApr 1, 2026When Users Change Their Mind: Evaluating Interruptible Agents in Long-Horizon Web NavigationMay 6, 2026Towards Robust LLM Post-Training: Automatic Failure Management for Reinforcement Fine-TuningOct 4, 2024Can Watermarked LLMs be Identified by Users via Crafted Prompts?Feb 24, 2025Multi-Agent Autonomous Driving Systems with Large Language Models: A Survey of Recent AdvancesMay 31, 2024DiffPuter: Empowering Diffusion Models for Missing Data ImputationDec 10, 2025d-TreeRPO: Towards More Reliable Policy Optimization for Diffusion Language Models