Showing 1–17 of 17 results
/ Date/ Name
May 28, 2024XL3M: A Training-free Framework for LLM Length Extension Based on Segment-wise InferenceNov 27, 2020Progressively Stacking 2.0: A Multi-stage Layerwise Training Method for BERT Training SpeedupNov 27, 2020CoRe: An Efficient Coarse-refined Training Framework for BERTFeb 25, 2015Prediction of new thermodynamically stable aluminum oxidesDec 22, 2023Digital twin-assisted three-dimensional electrical capacitance tomography for multiphase flow imagingDec 8, 2021Digital Twin of Electrical Tomography for Quantitative Multiphase Flow ImagingJun 3, 2025HATA: Trainable and Hardware-Efficient Hash-Aware Top-k Attention for Scalable Large Model InferenceSep 10, 2025Triaxial rotor modes in finite-N boson systemsApr 20, 2026AdaCluster: Adaptive Query-Key Clustering for Sparse Attention in Video GenerationFeb 24, 2025BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and InferenceDec 23, 2015Superconductivity of novel tin hydrides (Sn$_n$H$_m$) under pressureOct 16, 2024SIFM: A Foundation Model for Multi-granularity Arctic Sea Ice ForecastingJan 31, 2026IceBench-S2S: A Benchmark of Deep Learning for Challenging Subseasonal-to-Seasonal Daily Arctic Sea Ice Forecasting in Deep Latent SpaceNov 18, 2025LiteCache: A Query Similarity-Driven, GPU-Centric KVCache Subsystem for Efficient LLM InferenceNov 21, 2019High-Temperature Superconductivity in the Ti--H System at High PressuresJun 13, 2025Efficient Long-Context LLM Inference via KV Cache ClusteringAug 30, 2017Complete graphs: the space of simplicial cones, and their path tree representation