Showing 1–20 of 36 results
/ Date/ Name
Oct 10, 2024Gap-Dependent Bounds for Q-Learning using Reference-Advantage DecompositionOct 8, 2025Q-Learning with Fine-Grained Gap-Dependent RegretFeb 23, 2026Gap-Dependent Bounds for Nearly Minimax Optimal Reinforcement Learning with Linear Function ApproximationFeb 4, 2026Legendre Memory Unit with A Multi-Slice Compensation Model for Short-Term Wind Speed Forecasting Based on Wind Farm Cluster DataJun 5, 2025Regret-Optimal Q-Learning with Low Cost for Single-Agent and Federated Reinforcement LearningDec 4, 2023Jellyfish: A Large Language Model for Data PreprocessingAug 30, 2023Large Language Models as Data PreprocessorsSep 18, 2023Adaptive Liquidity Provision in Uniswap V3 with Deep Reinforcement LearningApr 25, 2025SORT3D: Spatial Object-centric Reasoning Toolbox for Zero-Shot 3D Grounding Using Large Language ModelsFeb 17, 2026FAST-EQA: Efficient Embodied Question Answering with Global and Local Region RelevancyOct 2, 2025Support Basis: Fast Attention Beyond Bounded EntriesMar 20, 2025IRef-VLA: A Benchmark for Interactive Referential Grounding with Imperfect Language in 3D ScenesMay 29, 2024Federated Q-Learning with Reference-Advantage Decomposition: Almost Optimal Regret and Logarithmic Communication CostJan 28, 2026Non-Markov Multi-Round Conversational Image Generation with History-Conditioned MLLMsFeb 5, 2025Gap-Dependent Bounds for Federated $Q$-learningNov 5, 2024VLA-3D: A Dataset for 3D Semantic Scene Understanding and NavigationFeb 9, 2025Breaking the Frozen Subspace: Importance Sampling for Low-Rank Optimization in LLM PretrainingApr 20, 2022Characterization of GaN-based HEMTs Down to 4.2 K for Cryogenic ApplicationsAug 6, 2023Spin Coherence and Spin Relaxation in Hybrid Organic-Inorganic Lead and Mixed Lead-Tin PerovskitesSep 29, 2025High-Precision Temperature Estimation Based on Magnetic Nanoparticles Dominated by Brownian Relaxation under Combined AC and DC Magnetic Fields