Showing 1–20 of 28 results
/ Date/ Name
Jul 25, 2025Step-3 is Large yet Affordable: Model-system Co-design for Cost-effective DecodingNov 21, 2022VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation LearningMay 17, 2023Are You Copying My Model? Protecting the Copyright of Large Language Models for EaaS via Backdoor WatermarkSep 29, 2025Random Policy Valuation is Enough for LLM Reasoning with Verifiable RewardsAug 26, 2025GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository LeveragingApr 24, 2025Step1X-Edit: A Practical Framework for General Image EditingAug 14, 2025NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at ScaleAug 4, 2025SE-Agent: Self-Evolution Trajectory Optimization in Multi-Step Reasoning with LLM-Based AgentsFeb 17, 2025Step-Audio: Unified Understanding and Generation in Intelligent Speech InteractionDec 23, 2025Step-DeepResearch Technical ReportNov 21, 2025Learning to Compress: Unlocking the Potential of Large Language Models for Text RepresentationDec 7, 2022Text Embeddings by Weakly-Supervised Contrastive Pre-trainingAug 29, 2022LED: Lexicon-Enlightened Dense Retriever for Large-Scale RetrievalJun 16, 2022Towards Robust Ranker for Text RetrievalJul 22, 2025Step-Audio 2 Technical ReportJun 10, 2025Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language ModelDec 17, 2025Step-GUI Technical ReportApr 10, 2023Inference with Reference: Lossless Acceleration of Large Language ModelsJun 1, 2022Task-Specific Expert Pruning for Sparse Mixture-of-ExpertsAug 20, 2021Smart Bird: Learnable Sparse Attention for Efficient and Effective Transformer