What is SafeInsights?
SafeInsights is a national large-scale education research hub that enables researchers to analyze learning data from multiple education data organizations (DOs, e.g., edtech platforms, educational institutions) in a secure, privacy-preserving environment. Your research code runs within protected data enclaves, and you receive only approved, aggregated results—you never see raw student data.
Our mission is to advance research that drives better learning for everyone.
SafeInsights is housed at Rice University and funded by a $90 million investment from the U.S. National Science Foundation. The project is a multi-institutional collaboration including:
- Leading edtech platforms, educational institutions, and other Data Organizations
- Privacy and security experts
- Learning scientists, education, and AI/ML for education researchers
- Legal and ethics scholars
- Community engagement leaders
- Technology infrastructure specialists
- and many more
How It's Different
SafeInsights inverts the traditional research model: instead of you receiving a data export, you send your research queries to the where the data is collected.
| Traditional Approach | SafeInsights Approach |
|---|---|
| Data exported to you | Your analyses code goes to the data |
| You see individual records | You never see raw data |
| Privacy risks if leaked | Data never leaves secure environment |
| Each edtech/Data Organization(DO) has different research request process | Standardized process across edtech/DO |
| Limited sample sizes | Access to millions of learners |
The Core Innovation
SafeInsights uses a novel approach: "We open the data while releasing none of it."
Instead of giving you data:
- You write analysis code (in R, Python, SQL) on the SafeInsights Integrated Development Environment (IDE)
- Submit it to the DO of your choice via the SafeInsights Research Portal
- Personnel at the DO run your code in the secure enclave
- Only approved, aggregated outputs are released to you
What You Can Do with SafeInsights (Version 1)
SafeInsights currently supports post-hoc or secondary analysis of existing data on SafeInsights member DOs. This means, depending on the DO choose to work with, you can:
Analyze Engagement Patterns
Study how learners interact with educational content over time
- Time on task
- Resource usage patterns
- Navigation behaviors
- Completion rates
Examine Performance Relationships
Investigate connections between learning behaviors and outcomes
- Practice frequency and test scores
- Resource use and achievement
- Learning strategies and retention
Compare Across Groups
Explore differences in how different populations engage and succeed
- First-generation vs. continuing-generation students
- Different prior preparation levels
- Various demographic groups (where data available)
Build Predictive Models
Create models to forecast outcomes from historical patterns
- Early warning systems
- Success prediction
- At-risk identification
Track Longitudinal Trends
Follow patterns over time within courses or across terms
- Engagement over semester
- Learning curve analysis
- Retention patterns
Learn more about post-hoc research patterns →
What's Not Available Yet
SafeInsights Version 1 focuses exclusively on post-hoc analysis. We do not currently support:
❌ Interventions or A/B tests - You cannot modify the student experience or randomly assign conditions
❌ Cross-platform studies - You cannot combine data from multiple education platforms in a single study (called "fusion")
❌ Direct data access - You cannot download or directly view individual student records
These capabilities are planned for future releases, stay tuned!
Who Uses SafeInsights
SafeInsights is designed for:
Education Researchers studying learning at scale
- Graduate students conducting dissertation research
- Postdocs investigating learning patterns
- Faculty researchers with questions about digital learning
Data Scientists exploring educational data
- Methodologists testing new analytic approaches
- Learning scientists building models
- Quantitative researchers needing large samples
Edtech Researchers at partner organizations
- Internal research teams at education platforms
- Product researchers validating features
- Data analysts supporting evidence-based design
Scale and Reach
SafeInsights provides access to data from:
- Millions of learners across K-12 and higher education
- Multiple subject areas including STEM, humanities, social sciences
- Diverse contexts from homework systems to full course platforms
- Longitudinal data spanning months to years
Built on Strong Principles
SafeInsights operates according to principles of:
Privacy First
- Student data never leaves protected enclaves
- Researchers never see individual-level information
- Multiple layers of protection and review
Learn about privacy protections →
Research Quality
- Rigorous proposal review process
- IRB oversight required
- Open science practices encouraged
- Reproducibility supported
Typical Research Timeline
From initial idea to receiving results:
- Initial research request review: X-X days
- Code review: X-X days/weeks
- Output review: X-X days/weeks
Total: Average ETA for full research cycle between X-X weeks
The timeline varies based on study complexity and DO review processes. Early feasibility checks help avoid delays later.
Common Questions
Q: Will I ever see individual student records?
A: No. You receive only aggregated results that meet privacy thresholds (typically minimum cell sizes of 10-20). Individual records never leave the secure enclave. You will have access to synthetic data samples to draft your analyses based on.
Q: What programming languages can I use?
A: At the moment, we are supporting Python, R, and SQL. Over time, we will be able to offer support for other analytical tools of choice. You write your analysis using simulated data, then your code runs on real data within the enclave.
Q: Does this cost money?
A: SafeInsights access is free. Some studies may incur computational costs charged by the data organization (at cost, not profit). Most small-to-medium studies have minimal or no costs.
Q: Do I need IRB approval?
A: Yes, in most cases. Even though you don't directly access identifiable data, you're still conducting human subjects research that requires institutional ethics review.
Next Steps
Understand the approach
- What is post-hoc or secondary analysis? - Learn about the scope and limits
- How SafeInsights works - See the technical architecture
Explore what's available
- Data Catalog Overview - Browse participating platforms
- OpenStax data - Explore one member's offerings
Design your study
- Research patterns guide - Find examples for different research questions
- Study lifecycle - Understand the complete process
- Proposal Guide - Write your proposal
Get support
- FAQ - Quick answers to common questions
- Get help - Contact options for specific questions
- Glossary - Key terms defined
This page last updated: December 2025