Evaluation Infrastructure

Built for the field.
Runs on your machine.

Twelve years of field evaluation experience, from pastoralist communities in South Sudan to health facilities in rural Burundi, encoded into open-source tools.

8 Live Tools

298 Indicators

11 Sectors

0 Data Transmitted

Why This Exists

I kept running into the same problem. You find a useful tool, you build your workflow around it, and then the pricing changes. Or the platform pivots. Or the internet goes down in the middle of a field visit and you lose access to your own work.

PRAXIS runs entirely in your browser. Everything stays on your machine. No server calls, no tracking. If the wifi cuts out in Juba or Ouagadougou, the tools still work. MIT licensed, permanently free, fully yours.

"I spent years looking for evaluation tools built by someone who had actually run an evaluation in a conflict zone. I could not find them. So I built them."

Research · Agent-based model

Featured

Most evaluations miss their moment.

A finished evaluation only informs a decision if it lands while the window is still open, and someone still has attention to spare. An agent-based model traces one seeded run of a portfolio: the current rules against a full redesign, running in step.

0% used, baseline

→

35% used, redesign

Launch the full simulation →

Baseline run · traced Wk 208

window missed deprioritised

The Local Toolkit

Eight offline-first modules.

Each module runs entirely in your browser. No server, no telemetry, fully MIT-licensed. Pick any module to launch.

01 Module

Design Advisor

AI-scored recommendations across 16 evaluation designs. Accounts for data, ethics, and operational context.

Launch Advisor →

Sample Size Calculator

Eight designs covered: RCT, Cluster, Stepped-Wedge, Diff-in-Diff. Handles ICC and attrition offline.

Launch Calculator →

Theory of Change

Visual canvas with assumptions mapping. Build your programme's logic model from inputs through to impact.

Inputs → Activities → Outcomes → Impact

Launch Canvas →

04 Module

Evaluation Matrix

Link evaluation questions, judgement criteria, indicators, and methods into a structured exportable table.

EQ1	Relevance	Doc review
EQ2	Effectiveness	Mixed
EQ3	Impact	Quasi-exp.
EQ4	Sustainability	KII + FGD

Launch Matrix →

05 Module

Data Explorer

Offline CSV/XLSX engine. Auto-detect types, summarise distributions, run regressions, flag quality issues.

n=1,247μ = 4.2 σ = 1.1

Launch Explorer →

06 Module

Indicator Bank

298 indicators across 11 sectors. Filter by PEPFAR MER codes, sector, or keyword.

Open Bank →

Data Quality Planner

Three-layer Grant Data Evaluability Diagnostic followed by a weekly Data Quality Improvement plan. 29 implementation protocols across six dependency levels, built for grant programmes in low-resource settings.

3-layer assessment 29 action protocols Weekly planner + dashboard

Launch DQI Planner →

Synthesis Preview Conditional Pass

Layer 1 · Indicator quality

8/10

Layer 2 · System functionality

5/10

Layer 3 · RSSH feasibility

Deck Generator

Turn a Workbench evaluation into a live-presenter deck. Inception Brief (pre-fieldwork) and Findings Brief (post-fieldwork) share one engine: keyboard nav, speaker notes on a second screen, PDF export.

16 inception slides 13–16 findings slides PDF + JSON export

Launch Deck Generator →

Beyond the Toolkit

AI Evaluation Skills

Claude Project knowledge modules encoding validated evaluation methodology. Six core modules plus sector-specific extensions covering over 20 evaluation designs and 16 validated scales.

View on GitHub →

360

Coming 2026

PRAXIS 360 Workflow

The integrated end-to-end evaluation workflow built on the .praxis file format. From Theory of Change through to reporting, with every finding traceable back to the evaluation framework.

Learn More