"au:"Kaj Bostrom"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Kaj Bostrom"" — arXiv2 Search

Showing 1–11 of 11 results

/ Date/ Name

Apr 18, 2021Flexible Generation of Natural Language Deductions Jan 16, 2022Natural Language Deduction through Search over Statement Compositions Apr 7, 2020Byte Pair Encoding is Suboptimal for Language Model Pretraining Jul 5, 2023Deductive Additivity for Planning of Natural Language Proofs Feb 6, 2026Generating Data-Driven Reasoning Rubrics for Domain-Adaptive Reward Modeling Jan 9, 2026MaxCode: A Max-Reward Reinforcement Learning Framework for Automated Code Optimization Oct 24, 2023MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning Nov 1, 2022Natural Language Deduction with Incomplete Information Nov 6, 2025VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks Feb 23, 2026ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models Oct 26, 2023Lil-Bevo: Explorations of Strategies for Training Language Models in More Humanlike Ways