"au:"Jin Peng Zhou"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Jin Peng Zhou"" — arXiv2 Search

Showing 1–20 of 27 results

/ Date/ Name

Feb 26, 2024REFACTOR: Learning to Extract Theorems from Proofs Feb 27, 2025$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training Feb 20, 2023Unsupervised Out-of-Distribution Detection with Diffusion Inpainting Aug 3, 2020Noise Contrastive Estimation for Autoencoding-based One-Class Collaborative Filtering Jul 4, 2024Orchestrating LLMs with Different Personalizations Mar 26, 2024Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with Autoformalization Jul 8, 2024On Speeding Up Language Model Evaluation Feb 25, 2022Does Label Differential Privacy Prevent Label Inference Attacks?Oct 24, 2023Correction with Backtracking Reduces Hallucination in Summarization Dec 21, 2024Towards More Robust Retrieval-Augmented Generation: Evaluating RAG Under Adversarial Poisoning Attacks Aug 18, 2025Cognitive Structure Generation: From Educational Priors to Policy Optimization Aug 20, 2020On Attribution of Deepfakes Mar 8, 2023Magnushammer: A Transformer-Based Approach to Premise Selection Feb 16, 2025Graders should cheat: privileged information enables expert-level automated evaluations Feb 26, 2025Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond Mar 17, 2025INPROVF: Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations Apr 23, 2025Learning to decode logical circuits May 19, 2024Attention to Quantum Complexity Jul 31, 2024Gemma 2: Improving Open Language Models at a Practical Size May 21, 2025Pre-training Limited Memory Language Models with Internal and External Knowledge