Design and run the cloud data architecture behind the world’s largest basketball database. Lead schema design, multi‑source ingestion (scrapers, video logging, manual uploads), and a distributed data team while upholding speed, quality, and governance across prod, staging, dev, and beta environments.
Architect and maintain multi‑environment MySQL / Cloud SQL schemas—adding new fields, enforcing referential integrity, and tuning performance.
Run end‑to‑end data operations: coordinate scraping, video logging, manual uploads, and API feeds; automate ETLs to deliver clean, analytics‑ready tables.
Lead and mentor a 5–10‑person data team, setting priorities and unblocking issues across all collection verticals.
Monitor pipeline health and database performance, optimize queries, and uphold rigorous standards for data quality, security, and compliance.
5–7+ years in data architecture/engineering on cloud production systems
Expert in SQL, Python, and ETL orchestration tools (Airflow/Composer)
Deep experience with GCP services (Cloud SQL, Pub/Sub, Storage)
Proven success managing multi‑environment schemas and ingestion from scraping, video, APIs, and manual uploads
Strong project‑management and communication skills
Direct experience with sports data; basketball familiarity strongly preferred
Hands‑on work with college or professional basketball statistics
Experience with streaming platforms (Kafka/Dataflow) or columnar warehouses (BigQuery/Snowflake)
Familiarity with dbt, Terraform, or Looker
Background in startups or sports‑tech environments