Open-source tools and packages built at the intersection of economics, machine learning, and software engineering.
StataAgent
A specialized agent model that integrates with Stata for natural-language data analysis. Ask questions like "What is the average income of individuals in 2022?" and the agent queries Stata and returns a structured answer.
Explain — LLM Package for Stata
A Stata package that integrates directly with large language models to assist with code debugging, explanation, and improvements. Supports both hosted APIs and locally-run models, letting researchers get inline AI assistance without leaving their Stata workflow.
Large-scale Text Classification with ANNOY + SLM
Demonstrates how to combine lightweight SLM embeddings with the ANNOY (Approximate Nearest Neighbors) library for efficient multi-class text classification at scale — a practical alternative to fine-tuning large models for classification tasks.