Open Source

We contribute to the broader machine learning community through open source projects.

financialqa

active

Multi-structured financial document question answering using RAG. Extracts text, tables, and figures from financial report PDFs via Azure Document Intelligence, with LangChain-based chunking and Azure OpenAI embeddings. Associated with the ECIR 2026 paper "Understanding Multi-Structured Documents via LLMs."

Team: University of Waterloo Partnership
License:
Last Release: 2026-01