Back to Projects

Fenrir

Applied AI Engineer Experimental

Updated: 2026-04-18

Local LLM behavior evaluator with pressure conditions, canonical readouts, and explicit uncertainty guardrails.

Impact: Makes pressure-sensitive behavior shifts inspectable without pretending a heuristic readout is a universal safety score.

What I built

  • Runs a setup-first local UI for endpoint configuration, connection tests, evaluation launch, and canonical readout export.
  • Defines hybrid MVP batteries across authority override, reputation shielding, and urgency tradeoff conditions.
  • Keeps claims bounded with explicit uncertainty, non-diagnostic language, and heuristic readout contracts.

Proof: Run `python3 scripts/start_fenrir.py`, configure an endpoint, and execute the hybrid MVP evaluation.

PythonFastAPIPytestYAMLLocal UI