1. Linked Statistical Data Analysis

    Samos Summit, Share PSI 2.0, Samos, 2014-07-01

    #LinkedData #samos2014

    Sarven's avatar Sarven Capadisli http://csarven.ca/#i @csarven

  2. Statistical Data on the Web (Characteristics)

    Hypercube
    • Decentralized
    • Heterogeneous
    • Structured
    • High volume
    • Formats (e.g., CSV, Excel, PC-Axis, SDMX-ML, XML)
    • Distribution and Access

    Clean? Synchronised? Comparable? Provenance? Trustable? Analyses?

  3. Statistical Linked Dataspaces

    .. from Government, IGO, NGO data

  4. Statistical Linked Dataspaces (2010-2011)

  5. Statistical Linked Dataspaces (2012)

  6. Statistical Linked Dataspaces (2013)

    Source format? SDMX-ML

  7. Statistical Linked Dataspaces (2014)

    Source format? SDMX-ML

  8. Statistical Linked Dataspaces (2014 Procrastination Plan)

  9. Linked SDMX adoption

    • Swiss Federal Statistics Office / Bern University of Applied Sciences (pilot)
    • Italian National Institute of Statistics / SpazioDati (pilot)
    • LOD2 Statistical Workbench (part of EU FP7 project)
    • FAO?
    • ?
  10. 270a Cloud (Statistical Linked Dataspaces)

  11. Triples count

  12. Linked Statistical Artefacts

    • Dataset: http://worldbank.270a.info/dataset/world-bank-finances
    • Structure: http://uis.270a.info/structure/1.0/CUL_DS
    • Observation: http://ecb.270a.info/dataset/SEE/A/AT/WBR0/EXT/X/E/2011
    • Dimension: http://oecd.270a.info/dimension/1.0/TIME
    • Measure: http://frb.270a.info/component/Z1/measure/1.0/OBS_VALUE
    • Attribute: http://transparency.270a.info/classification/attribute/matching-percentiles
    • Concept: http://imf.270a.info/concept/1.0/PGI/REF_AREA
    • Code list: http://fao.270a.info/code/0.1/CL_UN_COUNTRY
    • Hierarchical code list: http://bfs.270a.info/code/1.0/HR_HGDE_HIST
    • Regression Analysis: http://stats.270a.info/analysis/worldbank:GC.DOD.TOTL.GD.ZS/transparency:CPI2009/year:2009

    Cool URIs? 1, 5, 100, 10000 years? Ha!

  13. Provenance

    PROV-O Key Concepts

  14. Interesting queries?

    • Number of people born in Bern before 1900
    • Inflation rate in Italy when the prime minister was ...
    • Development projects in low-middle income countries situated above the equator
  15. How about interesting analysis?

    • statistically significant analysis about GDP and mortality-rate
    • strong correlations
    • predicting or forecasting
    • Investigating the WHYs
  16. stats.270a.info

    Citizen-centric interfaces for statistical stuff.

    Intended for data journalists, researchers, non-developers!

    ... and Linked Data friendly.

  17. Analysis user-interface (Plot) 1/3

    http://stats.270a.info/analysis/worldbank:SP.DYN.IMRT.IN/transparency:CPI2009/year:2009

  18. Analysis user-interface (Summary) 2/3

    http://stats.270a.info/analysis/worldbank:SP.DYN.IMRT.IN/transparency:CPI2009/year:2009

  19. Oh yeah?

    Provenance user-interface

  20. Analysis user-interface (Provenance) 3/3

    http://stats.270a.info/provenance/fa698e46868fe348865678884e89ef84b0be6c64

  21. Make sense of the data

    Pirates and global warming
  22. Adding some context?

    Time Series Example
  23. .. but what is really interesting here?

  24. Linked Statistical Data Analysis

    Sarven's avatar Sarven Capadisli

    http://csarven.ca/#i

    @csarven

  25. Credits