Showing 1–18 of 18 results
/ Date/ Name
Jun 13, 2018OpenEDGAR: Open Source Software for SEC EDGAR AnalysisFeb 19, 2021An Empirical Analysis of the R Package EcosystemNov 9, 2009Properties of the United States Code Citation NetworkMar 21, 2025KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing ApplicationsDec 29, 2016Measuring the temperature and diversity of the U.S. regulatory ecosystemJul 23, 2014Predicting the Behavior of the Supreme Court of the United States: A General ApproachAug 24, 2015Law on the Market? Abnormal Stock Returns and Supreme Court Decision-MakingDec 11, 2016A General Approach for Predicting the Behavior of the Supreme Court of the United StatesMar 22, 2010A Mathematical Approach to the Study of the United States CodeJun 10, 2018LexNLP: Natural language processing and information extraction for legal and regulatory textsApr 10, 2025The KL3M Data Project: Copyright-Clean Training Resources for Large Language ModelsNov 14, 2025Binary BPE: A Family of Cross-Platform Tokenizers for Binary AnalysisFeb 23, 2023Natural Language Processing in the Legal DomainSep 23, 2019Sensitivity of collective outcomes identifies pivotal componentsNov 27, 2025Binary-30K: A Heterogeneous Dataset for Deep Learning in Binary Analysis and Malware DetectionSep 9, 2009Distance Measures for Dynamic Citation NetworksApr 5, 2025Precise Legal Sentence Boundary Detection for Retrieval at Scale: NUPunkt and CharBoundaryNov 23, 2025OpenGloss: A Synthetic Encyclopedic Dictionary and Semantic Knowledge Graph