Regulation, Evaluation, and Governance Lab
About
News
Projects
Publications
Data
Work With Us
Contact
Methods
Methods
,
Adjudication
Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risk of Language Models
Arxiv
August 2024
Methods
Statistical Uncertainty in Word Embeddings: GloVe-V
Empirical Methods in Natural Language Processing
June 2024
Methods
,
Adjudication
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools
ArXiv
May 2024
Methods
,
Adjudication
Corpus Enigmas and Contradictory Linguistics: Tensions Between Empirical Semantic Meaning and Judicial Interpretation
Minnesota Journal of Law, Science & Technology
May 2024
Methods
,
Adjudication
MultiLegalPile: A 689GB Multilingual Legal Corpus
ACL 2024
May 2024
Methods
,
Policy
Not (Officially) in My Backyard: Characterizing Informal Accessory Dwelling Units and Informing Housing Policy With Remote Sensing
Journal of the American Planning Association
June 2024
Methods
Quantifying the Uncertainty of Imputed Demographic Disparity Estimates: The Dual-Bootstrap
March 2024
Methods
,
Policy
On the Societal Impact of Open Foundation Models
February 2024
Methods
Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models
Journal of Legal Analysis
January 2024
Methods
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
NeurIPS D&B
August 2023
featured
projects
Compliance
Tax Enforcement
See All projects