Showing 1–20 of 54 results
/ Date/ Name
Feb 16, 2022EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq GenerationJul 13, 2023In-context Autoencoder for Context Compression in a Large Language ModelOct 21, 2022DL-Corrector-Remapper: A grid-free bias-correction deep learning methodology for data-driven high-resolution global weather forecastingMay 10, 2025Climate in a Bottle: Towards a Generative Foundation Model for the Kilometer-Scale Global AtmosphereFeb 1, 2023MB-DECTNet: A Model-Based Unrolled Network for Accurate 3D DECT ReconstructionJun 28, 2024Scaling Synthetic Data Creation with 1,000,000,000 PersonasOct 7, 2020Improving the Efficiency of Grammatical Error Correction with Erroneous Span Detection and CorrectionDec 3, 2019Proximal Newton Methods for X-Ray Imaging with Non-Smooth RegularizationMar 2, 2023Semiparametric Language Models Are Scalable Continual LearnersJan 31, 2022A Metal Artifact Reduction Scheme For Accurate Iterative Dual-Energy CT AlgorithmsMay 20, 2022Lossless Acceleration for Seq2seq Generation with Aggressive DecodingNov 26, 2024Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training TokensJul 3, 2018Reaching Human-level Performance in Automatic Grammatical Error Correction: An Empirical StudyMar 30, 2022Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq GenerationDec 1, 2022Extensible Prompts for Language Models on Zero-shot Language Style CustomizationJun 9, 2021Instantaneous Grammatical Error Correction with Shallow Aggressive DecodingJul 30, 2021A Machine-learning Based Initialization for Joint Statistical Iterative Dual-energy CT with Application to Proton TherapySep 27, 2016Aligning Coordinated Text Streams through Burst Information Network Construction and DeciphermentJan 31, 2020Pseudo-Bidirectional Decoding for Local Sequence TransductionJun 7, 2020BERT Loses Patience: Fast and Robust Inference with Early Exit