Building trustworthy, resource-efficient AI for the next generation of software.

The AURA Research LabAI for Understanding, Reliable Automation — investigates how foundation models can be made measurably more efficient, interpretable, and trustworthy for software engineering. We work across model-centric and output-centric perspectives, from quantization and parameter-efficient fine-tuning to neurosymbolic reasoning and rigorous reliability evaluation.

Our mission.

// ABSTRACT

AI systems that write our software should be cheap enough to deploy, honest about what they do not know, and accountable when they fail.

Foundation models are reshaping software engineering at every layer of the stack — from autocomplete to autonomous code review. But the same models are also expensive, opaque, and frequently wrong in ways that matter. Our lab is built to address that gap.

The work spans top-tier venues — ICSE, FSE, ASE, TSE, TOSEM, EMSE — and is anchored by an NSF CRII award and ongoing collaborations across William & Mary, USI Lugano, William & Mary's Charles Center program, and partner institutions.

// OBJECTIVES · 03
  • O.01
    Resource-efficient AI for code
    Shrink foundation models through quantization, distillation, and parameter-efficient fine-tuning so they run inside developer tools — without a data-center bill.
  • O.02
    Trustworthy by construction
    Rigorous reliability evaluation, neurosymbolic reasoning, and causal analysis to tell when a model is actually right versus merely fluent.
  • O.03
    Open by default
    Every paper, dataset, and replication package shipped publicly. Reproducibility is a baseline, not a feature.

Research themes.

AI for code · Understanding what models do · Reliable evaluation of their behavior · Automation that practitioners can trust. The threads cross-pollinate — efficiency feeds reliability, neurosymbolic reasoning feeds interpretability, and so on.

THREAD 01/04 · A — AI
A

Resource-Efficient Foundation Models for Code

Quantization, parameter-efficient fine-tuning, and distillation that make code-intelligence models cheap enough to deploy at developer scale — without sacrificing functional or non-functional code quality.

QuantizationPEFTLoRA / QLoRADistillationPruning
THREAD 02/04 · U — UNDERSTANDING
U

Neurosymbolic Program Reasoning & Interpretability

Combining neural language models with grammars, type systems, and program analysis — plus feature-level explanations that move beyond opaque token-by-token predictions toward something developers can actually read.

Program analysisASTConstrained decodingFeature attributionType systems
THREAD 03/04 · R — RELIABLE
R

Reliability, Causality & Honest Evaluation

Causal reasoning and counterfactual analysis to understand cause-and-effect in software systems — from flaky test debugging to LLM-as-a-Judge evaluation methodologies that measure what matters, not what's easy.

Causal inferenceCounterfactualsLLM-as-JudgeRobustnessReproducibility
THREAD 04/04 · A — AUTOMATION
A

AI Agents for the Software Lifecycle

Designing autonomous agents that plan, reason, and execute multi-step software workflows — from issue triage to code review and test generation — with measurable end-to-end behavior on benchmarks like SWE-bench.

ReActTool useMulti-agentSWE-benchCode review automation

Active projects & funding.

Funded research lines and ongoing student-led projects.

AMOUNT / PROGRAM
TITLE & DESCRIPTION
METADATA
$175,000
CISE Research Initiation Initiative
● ACTIVE · Sole PI

CRII: SHF: Trustworthy & Resource-Efficient Foundation Models for Software Engineering

A two-year initiation award supporting research on making foundation models for code measurably more trustworthy and substantially less resource-intensive — through quantization, parameter-efficient fine-tuning, and rigorous reliability evaluation tailored to software engineering tasks.

AWARD #
2451058
AGENCY
NSF / CISE
PERIOD
06/2025 — 05/2027
ROLE
Sole PI
$8,000
Charles Center · 2 undergraduate awards
● AWARDED · Charles Center Summer Research Grants

Undergraduate research on AI optimization & resource-efficient code intelligence

Two summers of W&M Charles Center funding supporting undergraduate research on neural compression, LoRA-style fine-tuning, and code-quality evaluation under model-topology changes. Leon Huang's 2026 work also received a Monroe Scholars Conference Travel Grant.

RECIPIENTS
B. Tremblay · L. Huang
YEARS
2025 · 2026
EACH AWARD
$4,000
TOTAL
$8,000

Lab members.

A growing group of PhD students and undergraduates working at the intersection of AI and software engineering.

Antonio Mastropaolo
ID / DIR-00
// PERSONNEL RECORD · LAB DIRECTOR

Antonio Mastropaolo

Lab Director · Assistant Professor of Computer Science, W&M

PhD from USI Lugano (2024). Research at the intersection of AI, NLP, and Software Engineering. Editorial board member of IEEE Computer, NSF CRII PI, and serves on 14+ program committees including ICSE, FSE, and ASE. Two ACM SIGSOFT Distinguished Reviewer awards.

05.A

PhD students

N = Six
05.B

Undergraduate researchers

N = Two
ID / UG-01
Benjamin Tremblay
Benjamin (Ben) Tremblay
Neural Compression · LoRA · Code Quality
// Charles Center Grant ($4K) · 2025
ID / UG-02
Leon Huang
Leon Huang
Resource-Efficient AI for Code
// Charles Center Grant ($4K) · 2026
// Monroe Scholars Travel Grant · 2026