Showing 1–20 of 45 results
/ Date/ Name
Aug 12, 2018Sample Mixed-Based Data Augmentation for Domestic Audio TaggingSep 14, 2024Enhancing Decision-Making for LLM Agents via Step-Level Q-Value ModelsJan 11, 2026Data-driven active learning approaches for accelerating materials discoveryAug 24, 2022Dynamic Memory-based Curiosity: A Bootstrap Approach for ExplorationJan 11, 2024Optimistic Model Rollouts for Pessimistic Offline Policy OptimizationDec 30, 2023Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA EnsemblesMay 21, 2022Nuclear Norm Maximization Based Curiosity-Driven LearningJul 16, 2020Audio Tagging by Cross Filtering Noisy LabelsJan 22, 2019Unsupervised Learning-based Depth Estimation aided Visual SLAM ApproachFeb 22, 2020Multi-Representation Knowledge Distillation For Audio ClassificationMay 28, 2025Joint$λ$: Orchestrating Serverless Workflows on Jointcloud FaaS SystemsMar 11, 2020Online Meta-Critic Learning for Off-Policy Actor-Critic MethodsNov 26, 2021Who, What, Why and How? Towards the Monetary Incentive in Crowd Collaboration: A Case Study of Github's Sponsor MechanismMay 25, 2021KnowSR: Knowledge Sharing among Homogeneous Agents in Multi-agent Reinforcement LearningJan 27, 2021FedH2L: Federated Learning with Model and Statistical HeterogeneityJan 13, 2022Multi-task Pre-training Language Model for Semantic Network CompletionFeb 17, 2022The Development and Prospect of Code CloneOct 16, 2018Collaborative Deep Learning Across Multiple Data CentersOct 5, 2019Attention-based Fault-tolerant Approach for Multi-agent Reinforcement Learning SystemsJul 12, 2022Trusted Multi-Scale Classification Framework for Whole Slide Image