[Esip-documentation] cdm_data_type: summary and minimalist proposal

Nan Galbraith via Esip-documentation esip-documentation at lists.esipfed.org
Tue Dec 9 13:06:16 EST 2014


Do you want input via email, on the 'talk' page, or on the Active Issues 
page?

I just noticed something else on the main (draft) page that I think 
should be
changed:

    Several attributes explicitly allow the entry of multiple entities
    as comma-separated
    values. The entities in such lists which contain a comma must be
    enclosed in straight
    double quotation marks ("), which will not be considered part of the
    entity.

I'm not sure if this is correct, but it's certainly confusing; netcdf 
inserts double quotes
(e.g. in ncdump) for non-numeric, and we seem to be advising people to 
add a pair
on their own.  I think we're just getting too far into the weeds with 
this level of detail.

I recommend we drip the second sentence, or the whole paragraph on 
comma-separated
lists.

Nan




On 12/9/14 12:32 PM, John Graybeal via Esip-documentation wrote:
> As previously promised, this email summarizes the status of the 
> cdm_data_type attribute.
>
> To start a discussion, I propose the removal of specific references 
> and code lists, to produce the following definition:
>
> "The organization of the data, as derived from the Common Data Model's 
> Scientific Data layer and understood by THREDDS. (This is a THREDDS 
> "dataType", and is different from the CF NetCDF attribute 
> 'featureType', which indicates a Discrete Sampling Geometry file in CF.)"
>
> Below is background material, if you want to understand the problem 
> and the details.
>
> John
>
> *The Problem*
>
> At a minimum, the current 1.3 definition isn't perfect. Its list of 
> valid values (point, /profile/, /section/, station, /station_profile/, 
> trajectory, grid, image, or swath) doesn't agree with the referenced 
> list (point, station, trajectory, grid, image, swath, /radial/). And 
> the NODC guidance [3] is different as well, saying "The current 
> choices are: Grid, Image, Station, Swath, and Trajectory."
>
> Additional concerns are expressed in Bob Simons' email of 10/16, as 
> follows:
>   1) For cdm_data_type, it is unfortunate that previous versions of 
> ACDD included a link to a specific list. Surely the intention is to 
> evolve as Unidata/THREDDS/the common data model evolves, even if that 
> particular list doesn't evolve. Can we please remove that link and add 
> the values from the CF DSG chapter that aren't in the current list 
> here: timeSeries, timeSeriesProfile, trajectoryProfile?
>   2) And please remove the NODC guidance link. That is NODC guidance 
> about the DSG variants that NODC prefers and is not strictly relevant 
> to a list of cdm data types.
>
> *History/References*
>
> In version 1.1, the definition for this attribute was "The *THREDDS 
> data type* appropriate for this data set.", with the bolded text 
> referencing 
> http://www.unidata.ucar.edu/projects/THREDDS/tech/catalog/InvCatalogSpec.html#dataType
>
> In making version 1.3, we removed cdm_data_type, then re-added it for 
> backward compatibility. The following definition was proposed and yet 
> survives: "The organization of the data, as derived from the Common 
> Data Model's Scientific Data layer and understood by THREDDS (this is 
> a *THREDDS "dataType"*). One of point, profile, section, station, 
> station_profile, trajectory, grid, image, or swath. Please note that 
> this is different from the CF NetCDF attribute 'featureType' that 
> indicates a Discrete Sampling Geometry file---for guidance on those 
> terms, please see the *NODC guidance*." The first bold text points to 
> the same address as in v1.1; the second points to 
> http://www.nodc.noaa.gov/data/formats/netcdf/ [3].
>
> A detailed review of the discussion through Oct 6 starts at line 15 of 
> this Active Issues page 
> <https://docs.google.com/spreadsheets/d/19fl5AgGkckG03yTchUjYUp4YnR09Fn1Nqps2KHenkC4/edit#gid=0> [5]. 
> That latest proposal was to keep v1.3 cdm_data_type as is, and not 
> deprecate it, as there are some examples of its utility.
>
> A nice review of the current NetCDF-Java code usage of cdm_data_type 
> is here <https://github.com/Unidata/thredds/issues/72> [1]. The upshot 
> is that if featureType is present, it is sufficient for that library; 
> but older applications may still require cdm_data_type. This thread 
> also points out an issue in the NODC guidance page at 
> http://www.nodc.noaa.gov/data/formats/netcdf/v1.1/ [4]. (Reference [3] 
> resolves to this). Another more recent, and arguably more relevant, 
> code citation is at Unidata's THREDDS code: 
> https://github.com/Unidata/thredds/blob/target-4.3.22/cdm/src/main/java/ucar/nc2/constants/FeatureType.java [6]; 
> it has a still longer list of items.
>
> *Options*
>
> These options are mix-and-match.
>
> The default option is doing nothing. Other conceivable options are:
> - Revert to previous wording from 1.1.
> - For the NODC issue: Just remove the NODC guidance link phrase 
> ("---for guidance on those terms, please see the NODC guidance").
> - For the inconsistency in the list of acceptable terms:
>    A) Remove the link to *THREDDS "dataType"* (and let users figure 
> things out for themselves, or add more terms such as listed under (1) 
> above).
>    B) Change the list of terms to match the current link to *THREDDS 
> "dataType"*.
>    C) Change the link to *THREDDS "dataType"* and update the list of 
> terms to match whatever we point to.
> - For the general confusion about what this term is for, add something 
> like this sentence for context: "This attribute is maintained for 
> compliance with older files and applications, and is neither needed 
> nor recommended for most purposes (use featureType instead)."
>
>
> *References*
> *
> *
> [1] THREDDS issue discussion of cdm_data_type: 
> https://github.com/Unidata/thredds/issues/72
> [2] THREDDS data type reference in 1.1 and 1.3: 
> http://www.unidata.ucar.edu/projects/THREDDS/tech/catalog/InvCatalogSpec.html#dataType
> [3] NODC guidance reference in 1.3: 
> http://www.nodc.noaa.gov/data/formats/netcdf/  (forwards to [4])
> [4] NODC guidance referenced by @shane-axiom: 
> http://www.nodc.noaa.gov/data/formats/netcdf/v1.1/
> [5] ACDD 1.3 Reconciliation Pages: Active Issues: 
> https://docs.google.com/spreadsheets/d/19fl5AgGkckG03yTchUjYUp4YnR09Fn1Nqps2KHenkC4/edit#gid=0
> [6] Unidata THREDDS code with a list of feature types: 
> https://github.com/Unidata/thredds/blob/target-4.3.22/cdm/src/main/java/ucar/nc2/constants/FeatureType.java
>
>
> _______________________________________________
> Esip-documentation mailing list
> Esip-documentation at lists.esipfed.org
> http://www.lists.esipfed.org/mailman/listinfo/esip-documentation


-- 
*******************************************************
* Nan Galbraith        Information Systems Specialist *
* Upper Ocean Processes Group            Mail Stop 29 *
* Woods Hole Oceanographic Institution                *
* Woods Hole, MA 02543                 (508) 289-2444 *
*******************************************************


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.lists.esipfed.org/pipermail/esip-documentation/attachments/20141209/e240fd06/attachment-0001.html>


More information about the Esip-documentation mailing list