Showing 1–20 of 20 results
/ Date/ Name
Nov 25, 2024Self-Generated Critiques Boost Reward Modeling for Language ModelsOct 21, 2024Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions FollowingSep 30, 2024Law of the Weakest Link: Cross Capabilities of Large Language ModelsMay 23, 2023i-Code Studio: A Configurable and Composable Framework for Integrative AIMay 21, 2023i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech DataAug 21, 2022Z-Code++: A Pre-trained Language Model Optimized for Abstractive SummarizationMay 22, 2022Language Models with Image Descriptors are Strong Few-Shot Video-Language LearnersMay 3, 2022i-Code: An Integrative and Composable Multimodal Learning FrameworkMar 16, 2022Training Data is More Valuable than You Think: A Simple and Effective Method by Retrieving from Training DataDec 6, 2021Human Parity on CommonsenseQA: Augmenting Self-Attention with External AttentionOct 16, 2021Leveraging Knowledge in Multilingual Commonsense ReasoningAug 30, 2021Want To Reduce Labeling Cost? GPT-3 Can HelpOct 18, 2020Mixed-Lingual Pre-training for Cross-lingual SummarizationSep 10, 2020Accelerating Real-Time Question Answering via Question GenerationJun 27, 2020Mind The Facts: Knowledge-Boosted Coherent Abstractive Text SummarizationApr 4, 2020A Hierarchical Network for Abstractive Meeting Summarization with Cross-Domain PretrainingJan 3, 2020TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and DenoisingDec 25, 2019Leveraging Lead Bias for Zero-shot Abstractive News SummarizationSep 26, 2019SIM: A Slot-Independent Neural Model for Dialogue State TrackingDec 10, 2018SDNet: Contextualized Attention-based Deep Network for Conversational Question Answering