Administration

Typical data warehouse management issues:

  • Unscheduled system downtime
  • Tedious management of existing data warehouse
  • Constantly increasing temporary storage space and/or memory

Causes:

  • Scaling is hard
    • Designing and setting up new hardware takes too long
    • Configuring new hardware to work with older generations is error-prone or impossible
    • Common DBA tasks for installing new nodes are manual: repartitioning, re-indexing, or re-creating constraints
  • System is not designed for fault-tolerance
    • Integrating high availability and recovering from failures is hard
    • Restoring the replicas/backups when hardware component fails is complex and time-consuming
    • Detecting and recovering from transient software failures is difficult or impossible
    • System management is difficult
    • No single-system view of the data warehouse
    • Queries cannot be monitored
    • Run-away queries cannot be cancelled
    • Adding/removing hardware to the cluster is time consuming and error-prone
    • Constantly tuning the system is time consuming and difficult

Remedies:

Top Picks
Whitepaper: New MapReduce Whitepaper
Webcast: Bringing Big Data Analytics to the Enterprise - 11/12, with Merv Adrian
Webinar: Service Oriented 'Analytics' - 11/19, with James Kobelius