Showing 1–20 of 21 results
/ Date/ Name
Feb 10, 2026Self-Evolving Recommendation System: End-To-End Autonomous Model Optimization With LLM AgentsOct 21, 2025Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMsJul 25, 2025Object-centric Video Question Answering with Visual Grounding and ReferringJul 10, 2025Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and MethodologyJun 10, 2025Mitigating antenna gain errors with HyFoReS in CHIME simulationsJun 10, 2025Measurement of the Dispersion$\unicode{x2013}$Galaxy Cross-Power Spectrum with the Second CHIME/FRB CatalogApr 23, 2025Emergent Kagome lattice and non-Abelian lattice gauge field of biexcitons in t-MoTe$_2$Apr 1, 2025Transfer learning empowers material Z classification with muon tomographyMar 20, 2025DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question AnsweringOct 9, 2024Towards higher electro-optic response in AlScNAug 21, 2024DocTabQA: Answering Questions from Long Documents Using TablesAug 16, 2024Demonstration of hybrid foreground removal on CHIME dataApr 2, 2024Alpha Invariance: On Inverse Scaling Between Distance and Volume Density in Neural Radiance FieldsDec 21, 2023Bootstrap Masked Visual Modeling via Hard Patches MiningSep 7, 2023DropPos: Pre-Training Vision Transformers by Reconstructing Dropped PositionsMay 23, 2023Pulling Target to Source: A New Perspective on Domain Adaptive Semantic SegmentationApr 4, 2023Towards Open-Vocabulary Video Instance SegmentationApr 20, 2022NFormer: Robust Person Re-identification with Neighbor TransformerMar 14, 2022Removing systematics-induced 21-cm foreground residuals by cross-correlating filtered dataFeb 9, 2021SwiftNet: Real-time Video Object Segmentation