Difference between revisions of "HvABigDataVisualisation"

From PDP/Grid Wiki
Jump to navigationJump to search
Line 2: Line 2:
  
 
This data is used not only here, but by over 20000 researchers worldwide, who transfer the data back and forth between the data curation centres, the more than 300 compute centres, and thoudans of small analysis workstations and desktops. Data flows globally, and even just at Nikhef compute and disk clusters are interconnected by 240 gigabit-per-second links, and international connectivity exceeds 100 gigabit-per-second.  
 
This data is used not only here, but by over 20000 researchers worldwide, who transfer the data back and forth between the data curation centres, the more than 300 compute centres, and thoudans of small analysis workstations and desktops. Data flows globally, and even just at Nikhef compute and disk clusters are interconnected by 240 gigabit-per-second links, and international connectivity exceeds 100 gigabit-per-second.  
 +
 +
= More questions than answers? =
 +
 +
The Phase-II visualisation challenge leaves you with plenty of things to try out. Use your creativity to visualise, explain and analyse the data: big data lives by propaganda (and agitation)!
 +
 +
* How can global data flows be presented?
 +
* Can one conceive visualisations for the general public? ?for users? or for both?
 +
* Identifying troublesome inter-peer links (and local disk servers) via analytics techniques
 +
 +
 +
 +
* Which systems (or group of systems) uses the most bandwidth (in and out separately)
 +
* Which systems (or group of systems) generates the most connections?
 +
 +
* What does the time distribution of transfers look like (bandwidth and also number)?
 +
* What "funny behavior" is there (machine learning anomaly detection)
 +
 +
but there's surely more to do with the data you have!
  
 
= About the data analytics cluster =
 
= About the data analytics cluster =

Revision as of 11:00, 3 February 2016