Evaluation Plan

Research Questions

RQ1 - Spatial isolation: Does Haven prevent unauthorized memory and device access across partitions under deliberate violation attempts?
RQ2 - Temporal isolation: Does Haven bound RTOS task latency under maximum Linux CPU load to within a predictable worst-case?
RQ3 - TCB size: Is the Haven trusted computing base small enough to be manually auditable (< 10 KLOC)?
RQ4 - Overhead: What is the EL2 exit latency overhead introduced by Haven relative to a native (no-hypervisor) baseline?

Test	Method	Pass Condition
Cross-partition memory read	Linux partition attempts to read RTOS IPA	Stage-2 fault, Linux does not receive data
Cross-partition DMA write	Ethernet (Linux) DMA targets RTOS PA	SMMU fault, RTOS memory unchanged
IRQ injection	Linux issues SGI to RTOS core	EL2 drops, RTOS does not execute handler
Peripheral access	Linux accesses RTOS-owned UART MMIO	Stage-2 fault

All tests are in tests/isolation/ (planned) and tests/integration/test_isolation_negative.c.

Setup:

Metrics collected:

Target bounds (to be validated):

Count with cloc src/core/:

cloc src/core/ --include-lang=C,C/C++\ Header

Target: < 5000 SLOC (excluding comments and blank lines).

Measure using ARM PMU cycle counter:

Expected overhead: < 5% for workloads not involving frequent EL2 exits.

Component	Specification
Board	NXP i.MX95 Dev Kit
Linux image	Yocto Kirkstone, kernel 6.6 LTS
RTOS image	FreeRTOS 10.6.2
Haven build	`CC=aarch64-linux-gnu-gcc make build`
Load generator	`stress-ng 0.17.x`
Timer measurement	ARM Generic Timer (CNTPCT_EL0), 1 ns resolution
UART capture	`minicom -D /dev/ttyUSB0 -b 115200 -C capture.log`

After each campaign run:

make evidence

This produces build/evidence/imx95/ with: