Showing 1–20 of 59 results
/ Date/ Name
Sep 13, 2018On the Strength of Character Language Models for Multilingual Named Entity RecognitionJan 31, 2022SZx: an Ultra-fast Error-bounded Lossy Compressor for Scientific DatasetsOct 24, 2020Pairwise Representation Learning for Event CoreferenceAug 9, 2023Building Interpretable and Reliable Open Information Retriever for New Domains OvernightDec 15, 2021Event Linking: Grounding Event Mentions to WikipediaJun 14, 2021Scalable and accurate multi-GPU based image reconstruction of large-scale ptychography dataOct 19, 2023ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial AttacksOct 24, 2024ReasonAgain: Using Extractable Symbolic Programs to Evaluate Mathematical ReasoningFeb 21, 2025Self-Taught Agentic Long Context UnderstandingJan 22, 2022Optimizing Huffman Decoding for Error-Bounded Lossy Compression on GPUsNov 1, 2022SOLAR: A Highly Optimized Data Loading Framework for Distributed Training of CNN-based Scientific SurrogatesApr 14, 2023HEAT: A Highly Efficient and Affordable Training System for Collaborative Filtering Based Recommendation on CPUsSep 29, 2023Benchmarking and In-depth Performance Study of Large Language Models on Habana Gaudi ProcessorsJun 5, 2024Zeroth-Order Fine-Tuning of LLMs with Extreme SparsitySep 16, 2024Model Tells Itself Where to Attend: Faithfulness Meets Automatic Attention SteeringDec 19, 2024GFormer: Accelerating Large Language Models with Optimized Transformers on Gaudi ProcessorsJun 10, 2025OAT-Rephrase: Optimization-Aware Training Data Rephrasing for Zeroth-Order LLM Fine-TuningJan 5, 2026CD4LM: Consistency Distillation and aDaptive Decoding for Diffusion Language ModelsOct 16, 2025Directional Reasoning Injection for Fine-Tuning MLLMsNov 13, 2025Instella: Fully Open Language Models with Stellar Performance