Showing 1–12 of 12 results
/ Date/ Name
Jul 2, 2018Semantic Segmentation with Scarce DataJul 19, 2020ASAP-NMS: Accelerating Non-Maximum Suppression Using Spatially Aware PriorsMar 17, 2026MolmoB0T: Large-Scale Simulation Enables Zero-Shot ManipulationMay 12, 2020RSO: A Gradient Free Sampling Based Approach For Training Deep Neural NetworksSep 25, 2024Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language ModelsSep 11, 2017Recurrent neural networks based Indic word-wise script identification using character-wise trainingNov 19, 2025HinTel-AlignBench: A Framework and Benchmark for Hindi-Telugu with English-Aligned SamplesDec 15, 2025SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement LearningMar 18, 2026Unified Spatio-Temporal Token Scoring for Efficient Video VLMsJan 15, 2026Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and GroundingMar 30, 2026MolmoPoint: Better Pointing for VLMs with Grounding TokensMar 21, 2023ModEFormer: Modality-Preserving Embedding for Audio-Video Synchronization using Transformers