arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Mehrdad Farajtabar"" — arXiv2 Search
Showing 1–2 of 2 results
/ Date
/ Name
Jul 17, 2025
Apple Intelligence Foundation Language Models: Tech Report 2025
Dec 14, 2023
Weight subcloning: direct initialization of transformers using larger pretrained ones