90-Day Pilot Proposal
Prove the value on your data. One course, one semester, measurable results.
Scope
What the Pilot Covers
The pilot targets one course or assessment product within your platform. We work with one semester of student interaction data to measure the impact of QLM's question selection against your current approach.
This is not a proof-of-concept on synthetic data. It is a controlled evaluation on your production item bank with your real learners, producing results that your psychometrics and product teams can independently verify.
What We Measure
Timeline
Success Criteria
We define success in advance so there is no ambiguity about whether the pilot delivered value. These thresholds are negotiable before the pilot starts, but fixed once it begins.
| Metric | Threshold | How Measured |
|---|---|---|
| Item Reduction | ≥ 20% fewer items at equivalent SE | Mean items administered in treatment vs. control, at matched standard error threshold |
| Calibration Improvement | Parameter SE reduction ≥ 15% | Average standard error of item parameters at pilot end vs. pilot start |
| DIF Detection | ≥ 1 Category B/C flag not in prior review | Mantel-Haenszel DIF analysis vs. your most recent annual study results |
| Predictive Validity | Correlation ≥ 0.80 with outcome measures | Pearson r between QLM ability estimates and your designated outcome variable |
Pilot Pricing
The pilot is designed to be zero-risk. You pay nothing until the pilot is complete and you have independently verified the results. Production pricing is based on assessment volume and is detailed in the ROI Model.
What We Need From You
- 1 Item Bank — Your question bank in the format described in the Integration Guide. JSON, CSV, or JSONL. Minimum 50 items in the target domain; recommended 200+ for robust calibration.
- 2 Student Response Data Feed — Real-time submission of responses (item_id, correct/incorrect, response time) via the API as learners complete assessments during the pilot period.
- 3 One Technical Contact — A developer or technical lead who can implement the 3 API calls and troubleshoot during the integration phase. Expected time commitment: 2-4 hours total.
- 4 Outcome Data (for predictive validity) — End-of-semester grades, certification pass/fail, or other outcome measures for learners in the pilot cohort. Provided after the pilot period ends.
Start Your Pilot
Request a sandbox key and we will schedule a 30-minute kickoff to scope your pilot.
Request Sandbox Access