Showing 1–20 of 24 results
/ Date/ Name
Nov 7, 2024KnowCoder-X: Boosting Multilingual Information Extraction via CodeNov 18, 2023DSCom: A Data-Driven Self-Adaptive Community-Based Framework for Influence Maximization in Social NetworksMay 28, 2025The Entropy Mechanism of Reinforcement Learning for Reasoning Language ModelsSep 4, 2025Towards a Unified View of Large Language Model Post-TrainingApr 9, 2026Towards Knowledgeable Deep Research: Framework and BenchmarkMar 12, 2024KnowCoder: Coding Structured Knowledge into LLMs for Universal Information ExtractionNov 6, 2024Automating Exploratory Proteomics Research via Language ModelsJun 9, 2025Automating Exploratory Multiomics Research via Language ModelsMar 4, 2025Towards Event Extraction with Massive Types: LLM-based Collaborative Annotation and Partitioning ExtractionSep 10, 2025A Survey of Reinforcement Learning for Large Reasoning ModelsSep 18, 2025FlowRL: Matching Reward Distributions for LLM ReasoningMar 9, 2026How Far Can Unsupervised RLVR Scale LLM Training?Oct 26, 2023Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering PairsFeb 17, 2025Code-Vision: Evaluating Multimodal LLMs Logic Understanding and Code Generation CapabilitiesFeb 10, 2026P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics OlympiadsNov 17, 2025P1: Mastering Physics Olympiads with Reinforcement LearningApr 22, 2025TTRL: Test-Time Reinforcement LearningJan 30, 2025MedXpertQA: Benchmarking Expert-Level Medical Reasoning and UnderstandingSep 11, 2025SimpleVLA-RL: Scaling VLA Training via Reinforcement LearningDec 18, 2025JustRL: Scaling a 1.5B LLM with a Simple RL Recipe