"au:"Huaimin Wang"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Huaimin Wang"" — arXiv2 Search

Showing 1–20 of 45 results

/ Date/ Name

Aug 12, 2018Sample Mixed-Based Data Augmentation for Domestic Audio Tagging Sep 14, 2024Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models Jan 11, 2026Data-driven active learning approaches for accelerating materials discovery Aug 24, 2022Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration Jan 11, 2024Optimistic Model Rollouts for Pessimistic Offline Policy Optimization Dec 30, 2023Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles May 21, 2022Nuclear Norm Maximization Based Curiosity-Driven Learning Jul 16, 2020Audio Tagging by Cross Filtering Noisy Labels Jan 22, 2019Unsupervised Learning-based Depth Estimation aided Visual SLAM Approach Feb 22, 2020Multi-Representation Knowledge Distillation For Audio Classification May 28, 2025Joint$λ$: Orchestrating Serverless Workflows on Jointcloud FaaS Systems Mar 11, 2020Online Meta-Critic Learning for Off-Policy Actor-Critic Methods Nov 26, 2021Who, What, Why and How? Towards the Monetary Incentive in Crowd Collaboration: A Case Study of Github's Sponsor Mechanism May 25, 2021KnowSR: Knowledge Sharing among Homogeneous Agents in Multi-agent Reinforcement Learning Jan 27, 2021FedH2L: Federated Learning with Model and Statistical Heterogeneity Jan 13, 2022Multi-task Pre-training Language Model for Semantic Network Completion Feb 17, 2022The Development and Prospect of Code Clone Oct 16, 2018Collaborative Deep Learning Across Multiple Data Centers Oct 5, 2019Attention-based Fault-tolerant Approach for Multi-agent Reinforcement Learning Systems Jul 12, 2022Trusted Multi-Scale Classification Framework for Whole Slide Image