Open Source

We contribute to the broader machine learning community through open source projects.

models-eval-playbook

active

CLI-based playbook for evaluating LLMs with MLflow logging. Supports multiple providers including Azure OpenAI, Ollama, OpenRouter, and Alibaba Cloud. Enables testing a model under evaluation against a configurable judge model.

Team: Applied AI Research
License:
Last Release: 2026-02

financialqa

active

Multi-structured financial document question answering using RAG. Extracts text, tables, and figures from financial report PDFs via Azure Document Intelligence, with LangChain-based chunking and Azure OpenAI embeddings. Associated with the ECIR 2026 paper "Understanding Multi-Structured Documents via LLMs."

Team: University of Waterloo Partnership
License:
Last Release: 2026-01