Dataset ID: nesdis_blendedsic_shem_daily

General Information

  • Link to Data Landing Page
    • Name : Blended Sea Ice Concentration from AMSR2/VIIRS Daily 1km, Antarctic
    • Version : 1.0
    • Contact : Richard Dworak
    • Data Published : 12/18/2023
  • Is this raw data or a derived/processed data product? Derived
  • Is this observational data, simulation/model output, or synthetic data? Synthetic
  • Is the data single-source or aggregated from several sources? Aggregated

Data Quality

  • Timeliness:
    • Will the dataset be updated? Yes, it will be updated.
      • Updated when new data are added daily
      • Will there be different stages of the update? No
  • Data completeness
    • Is there any documentation about the completeness of the dataset? Yes
    • How complete is the dataset compared to the expected spatial coverage? Complete
    • How complete is the dataset compared to the expected temporal coverage? Complete
  • Data consistency
    • Is this dataset self-consistent in that its units, data types, and parameter names do not change over time and space? Yes
    • Is this dataset’s units, data types, and parameter names consistent with similar data collections? Yes
    • Are there processes to monitor for units, data types, and parameter consistency? No
      • If yes, what measures are taken? Manual review [TODO]
Note
  • Check for pole hole, flag values?
  • Data consistency: create data management plan and add review process
  • Data bias
    • Is there known bias in the dataset? No
    • Have measures been taken to examine bias? No
    • Is there reported bias in the data? No known bias
    • Is there quantitative information about data resolution in space and time? Yes
    • Are there published data quality procedures or reports? No
    • Is the provenance of the dataset tracked and documented? No
    • Are there checksums / other checks for data integrity? No
    • What is the size of the dataset? Depending on the resource, this might be total data volume, dimensionality, number of images, data files, table rows, image size, etc. _ approximately 15 ~ 540 MB (file size per day)_

Data Documentation

  • Does the dataset metadata follow a community/domain standard or convention? Yes
    • If the metadata follows a community/domain standard, which standard is it? CF-1.6, ACDD-1.3, NOAA CDR v1.0, GDS v2.0, COARDS
    • Is the dataset metadata machine-readable? Yes
    • Does it include details on the spatial and temporal extent? Yes
  • Is there a comprehensive data dictionary/codebook that describes what each element of the dataset means? parameters? Yes
    • Is the data dictionary standardized? Yes
    • Is the data dictionary machine-readable? Yes
    • Do the parameters follow a defined standard? Yes
      • If the parameters follow a defined standard, which standard it is? CF-1.6
    • Are parameters crosswalked in an ontology or common vocabulary (e.g. NIEM)? Not applicable
  • Does the dataset have a unique persistent identifier, e.g. DOI? Yes (CHECK DOI)
  • Is there contact information for subject-matter experts? Yes
  • Is there a mechanism for user feedback and suggestions? Yes
  • Are there example codes/notebooks/toolkits available showing how the data can be used? Yes
  • What is the license for the data?
    • NCEI Data Licensing These data may be redistributed and used without restriction
    • Is the license standardized and machine-readable (e.g. Creative Commons)? Yes
  • Has this dataset already been used in AI or ML activities? No
  • Are there recommendations on the intended use of the data, and uses that are not recommended? Yes [TODO: link to documentation]

Data Access

  • What is/are the major file formats? netcdf
    • Is this format machine-readable? Yes
    • Is the data available in at least one open, non-proprietary format? Yes
    • Are there tools/services to support data format conversion? Yes
      • If so, provide the link to the tools/services
  • Data delivery: ERDDAP
    • Does data access require authentication (e.g., a registered user account)? No
    • Can the file be accessed via direct file downloading or ordering? Yes
    • Is there an Application Programming Interface (API) or web service to access the data? Yes
    • If there is an API, does the API follow an open standard protocol (e.g., OGC)? Yes
    • If there is an API, is there documentation for the API? Yes
      • If “Yes”, please provide a URL to the documentation.
    • Is the data available publicly via cloud services? Yes
  • For restricted data, have measures been taken to provide some access while still applying appropriate protection for privacy and security? Yes
    • Has the data been aggregated to reduce granularity? Yes
    • Has the data been anonymized / de-identified? Not applicable
    • Is there secure access to the full dataset for authorized users? No

Data Preparation

  • Have null values/gaps been filled? Yes
  • Have outliers been identified? No
  • Is the data gridded (regularly sampled in time and space)? Yes
    • Regularly gridded in space and constant time-frequency
    • If the data is gridded, was it transformed from a different original sampling? Yes, from a different regular sampling
    • If the data is resampled from the original sampling, is the data also available at the original sampling? Yes
  • Are there associated targets or labels for supervised learning techniques (i.e., can this be used as a training dataset for supervised learning techniques)? Yes

Additional Metadata

  • PolarWatch Metadata: https://polarwatch.noaa.gov/erddap/info/nesdis_blendedsic_shem_daily/index.html