Showing 1–20 of 26 results
/ Date/ Name
Oct 9, 2020Recursive Top-Down Production for Sentence Generation with Latent TreesMar 13, 2024Scattered Mixture-of-Experts ImplementationOct 11, 2023Sparse Universal TransformerJun 23, 2019Investigating Biases in Textual Entailment DatasetsOct 23, 2024Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth StudyJun 10, 2025The Cell Ontology in the age of single-cell omicsJul 5, 2022Ontology Development Kit: a toolkit for building, maintaining, and standardising biomedical ontologiesMay 8, 2024Digital Evolution: Novo Nordisk's Shift to Ontology-Based Data ManagementJun 18, 2024A framework for developing a knowledge management platformFeb 18, 2026PREFER: An Ontology for the PREcision FERmentation CommunityOct 21, 2020Explicitly Modeling Syntax in Language Models with Incremental Parsing and a Dynamic OracleJun 7, 2023ModuleFormer: Modularity Emerges from Mixture-of-ExpertsDec 23, 2025Distilling to Hybrid Attention Models via KL-Guided Layer SelectionJan 8, 2025A Partition Cover Approach to TokenizationMar 7, 2018Generating Contradictory, Neutral, and Entailing SentencesOct 22, 2018Ordered Neurons: Integrating Tree Structures into Recurrent Neural NetworksOct 29, 2019Ordered MemoryAug 23, 2024Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate SchedulerApr 4, 2025Do Larger Language Models Generalize Better? A Scaling Law for Implicit Reasoning at Pretraining TimeMay 22, 2025PaTH Attention: Position Encoding via Accumulating Householder Transformations