| 📋 Resources | 📊 Directory |
|---|---|
| 🚀 Onboarding | 👥 People |
| 📅 Presentation schedule | 📁 Active projects / Submit new project |
| 📊 Lab Dashboard | 🗓️ Upcoming events |
| 🏢 IMES Room Reservations |
This repository is the operational database for MIT Critical Data. It tracks our people, projects, events, funding, and outputs as plain-text YAML files, organized in Git for auditability and transparency.
This database is only as useful as the data in it. Keeping records accurate and up to date is a shared responsibility — everyone in the lab is expected to contribute updates when things change.
- people/ — lab members, alumni, and collaborators
- projects/ — active and past research projects
- events/ — conferences, workshops, datathons, and talks
- funding/ — grants, sponsored research, and other funding sources
- outputs/ — papers, datasets, software, and other lab products
- talks/ — lab presentation schedule, one YAML file per talk
- scripts/ — automation and validation utilities
- docs/ — schema documentation and contribution guides
The most common task in this repo is adding notes from meetings, calls, or project check-ins. To do this:
- Navigate to the relevant record folder (e.g.
data/projects/mimic-iv/updates/) - Create a new file prefixed with today's date:
YYYY-MM-DD-description.ext
Examples:
2026-01-15-meeting-notes.md— written notes2026-01-15-fathom-transcript.pdf— raw transcript export2026-01-15-decisions.md— key decisions recorded
- Commit and push using your preferred Git tool (e.g. GitHub Desktop, command line, or the GitHub web editor for quick edits)
The date prefix is the only required convention — it keeps updates sorted chronologically and findable. No reformatting of raw transcript exports is needed; upload them as-is.
To update a project's core details (status, team, funding links), edit project.yaml directly in the same folder.
Lab meetings are held every Tuesday. Each talk is a single YAML file in data/talks/.
File naming: YYYY-MM-DD-speaker-slug.yaml (e.g. 2026-06-10-jane-smith.yaml)
Minimal example:
title: My Research Talk
speaker: Jane Smith
speaker_affiliation: Stanford Medicine
date: 2026-06-10
time: "11:00 ET"
type: invited
status: scheduledFull fields:
| Field | Required | Notes |
|---|---|---|
title |
✅ | Talk title |
speaker |
✅ | Freeform name — guest or lab member |
speaker_affiliation |
Institution or company | |
speaker_link |
Folder name in data/people/ if the speaker is a lab member |
|
date |
✅ | YYYY-MM-DD |
time |
e.g. "11:00 ET" |
|
type |
lab-meeting, journal-club, external, invited, other |
|
status |
✅ | scheduled, completed, cancelled |
abstract |
Free text | |
slides_url |
Link to slides — title becomes clickable | |
recording_url |
Link to recording — title becomes clickable | |
notes |
Post-talk discussion summary | |
tags |
Lowercase kebab-case list |
Generate the schedule:
py scripts/generate_schedule.pyThis prints a Markdown table of every Tuesday for the next 4 months, with booked slots filled in and empty slots shown as blank rows.
Each data type lives in its own subdirectory as a YAML file. To add or update a record:
- Create a folder named in lowercase kebab-case (e.g.
data/people/alice-smith/) - Add a
person.yaml(orproject.yaml, etc.) using the schema inCLAUDE.md - Commit and open a pull request for review
If your project doesn't exist in this repository yet, submit it here and it will be added.
Day-to-day tasks are tracked in GitHub Issues. Labels are automatically generated from project and funding records.
See CLAUDE.md for full schema documentation, or reach out to Ken at kepaik@mit.edu.