Showing 161–180 of 577 results
/ Date/ Name
Oct 17, 2024Malleus: Straggler-Resilient Hybrid Parallel Training of Large-scale Models via Malleable Data and Model ParallelizationOct 11, 2024Observation of $D^+\toη^\primeμ^+ν_μ$ and First Study of $D^+\to η^\prime \ell^+ν_\ell$ Decay DynamicsOct 4, 2024PRF: Parallel Resonate and Fire Neuron for Long Sequence Learning in Spiking Neural NetworksOct 3, 2024Search for lepton number violating decays of $D_s^+\to h^-h^0e^+e^+$Oct 2, 2024Limits on the Low-Energy Electron Antineutrino Flux from the Brightest GRB of All TimeSep 29, 2024Magnetic field of the roAp star KIC~10685175: observations versus theorySep 26, 2024Optimal Quantum Purity AmplificationSep 19, 2024RAD-Bench: Evaluating Large Language Models Capabilities in Retrieval Augmented DialoguesSep 9, 2024Retrofitting Temporal Graph Neural Networks with TransformerSep 6, 2024Study of the decay $D^0\rightarrow ρ(770)^-e^+ν_e$Sep 5, 2024Spindle: Efficient Distributed Training of Multi-Task Large Models via Wavefront SchedulingSep 3, 2024Blocks as Probes: Dissecting Categorization Ability of Large Multimodal ModelsAug 16, 2024A Survey on Benchmarks of Multimodal Large Language ModelsAug 2, 2024Five New Heartbeat Star Systems with Tidally Excited Oscillations Discovered Based on TESS DataJul 31, 2024The Llama 3 Herd of ModelsJul 29, 2024Apple Intelligence Foundation Language ModelsJul 26, 2024Twenty-three New Heartbeat Star Systems Discovered Based on TESS DataJul 17, 2024Observation of $Λ_c^+ \to Λa_0(980)^+$ and Evidence for $Σ(1380)^+$ in $Λ_c^+ \to Λπ^+ η$Jul 16, 2024MEMO: Fine-grained Tensor Management For Ultra-long Context LLM TrainingJul 3, 2024Measurement of the branching fraction of the decay $J/ψ\to p \bar{p} η$