Showing 1–20 of 26 results
/ Date/ Name
Oct 28, 2022MiCRO: Multi-interest Candidate Retrieval OnlineJan 27, 2022Learning Stance Embeddings from Signed Social GraphsNov 10, 2019CCAligned: A Massive Collection of Cross-Lingual Web-Document PairsOct 15, 2019Facebook AI's WAT19 Myanmar-English Translation Task SubmissionApr 17, 2021XLEnt: Mining a Large Cross-lingual Entity Dataset with Lexical-Semantic-Phonetic Word AlignmentFeb 11, 2022TwHIN: Embedding the Twitter Heterogeneous Information Network for Personalized RecommendationFeb 3, 2025Competitive Programming with Large Reasoning ModelsOct 21, 2020Beyond English-Centric Multilingual Machine TranslationSep 15, 2022TwHIN-BERT: A Socially-Enriched Pre-trained Language Model for Multilingual Tweet Representations at TwitterMay 12, 2022kNN-Embed: Locally Smoothed Embedding Mixtures For Multi-interest Candidate RetrievalAug 18, 2019Parsimonious Morpheme Segmentation with an Application to Enriching Word EmbeddingsNov 14, 2019Mining News Events from Comparable News Corpora: A Multi-Attribute Proximity Network Modeling ApproachJan 31, 2020Massively Multilingual Document Alignment with Cross-lingual Sentence-Mover's DistanceNov 3, 2019Leveraging Pretrained Image Classifiers for Language-Based SegmentationSep 17, 2021Classification-based Quality Estimation: Small and Efficient Models for Real-world ApplicationsFeb 8, 2021Quality Estimation without Human-labeled DataMay 31, 2021Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel DataJun 24, 2014Scalable Topical Phrase Mining from Text CorporaJul 18, 2021As Easy as 1, 2, 3: Behavioural Testing of NMT Systems for Numerical TranslationDec 21, 2024OpenAI o1 System Card