A normalized, well-structured, relational database that you could query like:
Give me all pH - values where we also have turbidity measurements and discharge is < 20 m3/s.
And plot them. In blue. With yellow dots. On a logarithmic scale.
A motivating example:
On Tue, Aug 25, 2015 at 6:27 PM, Matthew MacLeod <Matthew.MacLeod@aces.su.se>
wrote:
Hey HvW…
Recep and I have been working a bit on some ideas related to your
Remoteness Index.
It would help us a lot to have the raw data (as NetCDF files, for
example) that underlie your Remoteness Index maps that you
published in ES&T. We are most interested in the map for the
night-light emission scenario, but would be happy to have the raw
data for the CROP scenario also.
Do you still have those files backed up somewhere??
Thanks! -matt
Data Mangement Rule at the time:
My private data management strategy at the time:
rsync -av /home/hvwaldow hvw@climstor.unibe.ch:/.../private/archive/phd
recovery ...
recovery ...
On Wed, Sep 16, 2015 at 2:30 AM, Harald von Waldow <harald@vonwaldow.ch> wrote:
Hi Matt & Recep,
sorry for the delay again. You know how it is ..
Please find attached an archive with 4 files:
.....
....
Feb 1, 2016:
Happy End
What data to upload?
E.g.
Rule of thumb: Everything that is consistent and complete enough and worth checking and documenting to an extent that it stands for itself and could be used by others without additional information.
Details are going to be figured out individually.
A collection
of
"Data
Packages"
The system doesn't look into the files.
no selection/search/analysis based directly on file content.
technically, everything can be dumped into a package, without any pre-processing.
Some file-formats should be avoided, though!
Variables: ...
Systems: ...
package = files + meta-data
"resources" = files
Organizations are, ideally, comparatively homogeneous with respect to data-management needs.
"Data Managers"
CKAN has a remote API that exposes all of CKAN’s core features.
Access via http:
http://eaw-ckan-dev1/api/3/action/package_search?fq=tags:fish
Used to programmatically read, create and modify packages.
# sh
curl 'http://eaw-ckan-dev1.eawag.wroot.emp-eaw.ch/api/3/action/organization_list'
{"help": ... "result": ["aquatic-ecology", "aquatic-entomology",
"environmental-chemistry", "environmental-microbiology",
"fish-ecology-and-evolution", "gis-services", "it-services", ... ] }
# Python
from ckanapi import *
ckanremote = RemoteCKAN("http://eaw-ckan-dev1.eawag.wroot.emp-eaw.ch",
apikey=os.environ["CKAN_DEV1_APIKEY_HVW_ADM"])
pkg = ckanremote.action.package_show(id="reform")
print(pkg)
## R
> library('ckanr')
> ckanr_setup(url="http://eaw-ckan-dev1",key=Sys.getenv("CKAN_DEV1_APIKEY_HVW_ADM"))
> package_list(as="table")
[1] "data-from-population..."
[2] "record"
...
Support for writing such client code expected as major task during pilot-phase.
March 21 — Dec 31: Pilot-Phase
New features until 2017