Showing 1–20 of 25 results
/ Date/ Name
Dec 11, 2025VL-JEPA: Joint Embedding Predictive Architecture for Vision-languageSep 2, 2025Planning with Reasoning using Vision Language World ModelJun 4, 2025WorldPrediction: A Benchmark for High-level World Modeling and Long-horizon Procedural PlanningApr 24, 2025HalluLens: LLM Hallucination BenchmarkMar 18, 2025Calibrating Verbal Uncertainty as a Linear Feature to Reduce HallucinationsApr 11, 2024High-Dimension Human Value Representation in Large Language ModelsMar 27, 2024Measuring Political Bias in Large Language Models: What Is Said and How It Is SaidNov 3, 2023Mitigating Framing Bias with Polarity Minimization LossApr 21, 2023Learn What NOT to Learn: Towards Generative Safety in ChatbotsFeb 8, 2023A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and InteractivityNov 10, 2022Casual Conversations v2: Designing a large consent-driven dataset to measure algorithmic bias and robustnessNov 9, 2022BLOOM: A 176B-Parameter Open-Access Multilingual Language ModelOct 14, 2022Enabling Classifiers to Make Judgements Explicitly Aligned with Human ValuesMay 12, 2022Towards Answering Open-ended Ethical Quandary QuestionsApr 11, 2022NeuS: Neutral Multi-News Summarization for Mitigating Framing BiasFeb 8, 2022Survey of Hallucination in Natural Language GenerationJun 11, 2021Assessing Political Prudence of Open-domain ChatbotsMay 31, 2021Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel DataApr 18, 2021Dynamically Addressing Unseen Rumor via Continual LearningMar 17, 2021Towards Few-Shot Fact-Checking via Perplexity