Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 3 Next »

Insights & Analytics Pipeline sub-session

How does the Analytics Pipeline work?

How can we make Analytics Pipeline near-realtime?

  • Smaller hourly partitions for more frequent runs
  • Streaming data using Apache Sparc (instead of hadoop, hive, sqoop..).  edX Analytics team are in the process of doing this right now!
  • Use a lighter-weight solution if direct MySQL queries are sufficient

What reporting do we want?

  • Dashboard for blended learning use case (small classes, many copies of "same" course")
    • Data per learner
    • Divide by class, subject, geographic, organisation
  • Timeline to show learning rates/engagement/enrollment as course progresses
    • Tag significant events (course start/end, advertising events, assignment deadlines...)
  • Survey data integration, e.g. to measure learner satisfaction

How has reporting improved?

  • No labels