Evaluation Infrastructure

Built for the field.
Runs on your machine.

Twelve years of field evaluation experience, from pastoralist communities in South Sudan to health facilities in rural Burundi, encoded into open-source tools.

8 Live Tools
298 Indicators
11 Sectors
0 Data Transmitted

Why This Exists

I kept running into the same problem. You find a useful tool, you build your workflow around it, and then the pricing changes. Or the platform pivots. Or the internet goes down in the middle of a field visit and you lose access to your own work.

PRAXIS runs entirely in your browser. Everything stays on your machine. No server calls, no tracking. If the wifi cuts out in Juba or Ouagadougou, the tools still work. MIT licensed, permanently free, fully yours.

"I spent years looking for evaluation tools built by someone who had actually run an evaluation in a conflict zone. I could not find them. So I built them."

The Local Toolkit

Eight offline-first modules.

Each module runs entirely in your browser. No server, no telemetry, fully MIT-licensed. Pick any module to launch.

01 Module

Design Advisor

AI-scored recommendations across 16 evaluation designs. Accounts for data, ethics, and operational context.

Cluster-RCT
92%
Stepped-Wedge
85%
Diff-in-Diff
40%
Launch Advisor
02 Module

Sample Size Calculator

Eight designs covered: RCT, Cluster, Stepped-Wedge, Diff-in-Diff. Handles ICC and attrition offline.

24
Clusters
×
30
Per Cluster
=
720
Total N
Launch Calculator
03 Module

Theory of Change

Visual canvas with assumptions mapping. Build your programme's logic model from inputs through to impact.

Inputs Activities Outcomes Impact
Launch Canvas
04 Module

Evaluation Matrix

Link evaluation questions, judgement criteria, indicators, and methods into a structured exportable table.

EQ1RelevanceDoc review
EQ2EffectivenessMixed
EQ3ImpactQuasi-exp.
EQ4SustainabilityKII + FGD
Launch Matrix
05 Module

Data Explorer

Offline CSV/XLSX engine. Auto-detect types, summarise distributions, run regressions, flag quality issues.

n=1,247μ = 4.2 σ = 1.1
Launch Explorer
06 Module

Indicator Bank

298 indicators across 11 sectors. Filter by PEPFAR MER codes, sector, or keyword.

Health47
Education38
Protection31
WASH28
+ 7 more sectors154
Open Bank
07 Featured

Data Quality Planner

Three-layer Grant Data Evaluability Diagnostic followed by a weekly Data Quality Improvement plan. 29 implementation protocols across six dependency levels, built for grant programmes in low-resource settings.

3-layer assessment 29 action protocols Weekly planner + dashboard
Launch DQI Planner
Synthesis Preview Conditional Pass
Layer 1 · Indicator quality
8/10
Layer 2 · System functionality
5/10
Layer 3 · RSSH feasibility
3/10
12
Actions
6
This week
3
Overdue
08 Featured

Deck Generator

Turn a Workbench evaluation into a live-presenter deck. Inception Brief (pre-fieldwork) and Findings Brief (post-fieldwork) share one engine: keyboard nav, speaker notes on a second screen, PDF export.

16 inception slides 13–16 findings slides PDF + JSON export
Launch Deck Generator
Programme
Maternal Health
07Methodology
11Evaluability

Beyond the Toolkit

AI Evaluation Skills

Claude Project knowledge modules encoding validated evaluation methodology. Six core modules plus sector-specific extensions covering over 20 evaluation designs and 16 validated scales.

View on GitHub →
360
Coming 2026

PRAXIS 360 Workflow

The integrated end-to-end evaluation workflow built on the .praxis file format. From Theory of Change through to reporting, with every finding traceable back to the evaluation framework.

Learn More