Showing 1–17 of 17 results
/ Date/ Name
Nov 2, 2023Self-Influence Guided Data Reweighting for Language Model Pre-trainingJul 7, 2024WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work TasksNov 11, 2024Combining Domain and Alignment Vectors to Achieve Better Knowledge-Safety Trade-offs in LLMsJun 7, 2024A Deep Dive into the Trade-Offs of Parameter-Efficient Preference Alignment TechniquesDec 6, 2024The BrowserGym Ecosystem for Web Agent ResearchJul 4, 2024ChartGemma: Visual Instruction-tuning for Chart Reasoning in the WildMar 12, 2022Chart-to-Text: A Large-Scale Benchmark for Chart SummarizationNov 2, 2025ColMate: Contrastive Late Interaction and Masked Text for Multimodal Document RetrievalAug 13, 2025BigCharts-R1: Enhanced Chart Reasoning with Visual Reinforcement FinetuningNov 16, 2022Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed RepresentationsOct 3, 2025FocusAgent: Simple Yet Effective Ways of Trimming the Large Context of Web AgentsApr 7, 2025ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question AnsweringMar 12, 2024WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?Dec 13, 2024Too Big to Fool: Resisting Deception in Language ModelsMay 11, 2023Randomized Smoothing with Masked Inference for Adversarially Robust Text ClassificationsJun 30, 2025LineRetriever: Planning-Aware Observation Reduction for Web AgentsJul 5, 2025How to Train Your LLM Web Agent: A Statistical Diagnosis