seamore README file
seamore is a software to cmorize simulation data to a given CMIP6 data request. It is being used to create publication ready data from CMIP6 simulations using the Sea-ice Ocean Model (FESOM) of the Alfred Wegener Institute (AWI). Any feedback is greatly appreciated, please mail to: Jan Hegewald jan.hegewald@awi.de
Variables, Data Request and Tables
Datarequests
github.com/PCMDI/cmip6-cmor-tables
[github.com/PRIMAVERA-H2020/cmip6-cmor-tables PRIMAVERA_Data_Request_v1_0_3](github.com/PRIMAVERA-H2020/cmip6-cmor-tables PRIMAVERA_Data_Request_v1_0_3)
Controlled Vocabularies
github.com/WCRP-CMIP/CMIP6_CVs
Global Attributes, CMIP6 File Naming, Directory Structure
docs.google.com/document/d/1h0r8RZr_f3-8egBMMh7aqLwy3snpD6_MrDz1q8n5XUk/edit
Basic Usage
From the command line type seamore help
to show basic instructions and a list of commands. Type seamore help <COMMAND>
to get more detailed help for a specific command.
Configuration File
Process a configuration file to create CMOR ready Fesom outout using the process
command:
seamore process exampleconfig.seamore
The configuration file consists of 4 parts of ruby-DSL code for seamore.
Part 1, the CMOR Data Request and other global information:
cmip6_cmor_tables "01.00.27", "/path/cmip6-cmor-tables/Tables"
cmip6_cvs_dir "/path/CMIP6_CVs"
merge_years_step 1
version_date 2018, 12, 18 # YYYY, MM, DD
source_id "AWI-CM-1-1-MR"
grid_description_file "/pool/glob/griddes.nc"
Note, the merge_years_step
joins multiple years to a single file. Each new file will start at year xxx1, i.e. if files for the years 1950,1951,1952 are merged with merge_years_step 2
, the resulting files will range from 1950–1950 and 1951–1952.
Part 2, one or multiple blocks for each experiment id, i.e. historical, piControl, abrupt-4xCO2
experiment_id "piControl" do # name from controlled vocabularies, multiple expreiment_id blocks are allowed in a single file
indir "/path/fesom_output", 2401, 2450 # optionally limit input to a range of years
outdir "/path/seamore_generated"
variant_label "r1i1p1f1" # see external Global Attributes document
parent_variant_label "r1i1p1f1" # must be omitted if there is "no parent" in the controlled vocabularies
parent_first_year 1901 # must be omitted if there is "no parent" in the controlled vocabularies
branch_year_in_parent 2401
end
Part 3, steps of the processing pipeline
Remove or comment individual lines to skip a specific step.
cmorize_defaults do
mergefiles
time_seconds_to_days
auto_insert_time_bounds
auto_downsample_frequency
auto_convert_unit
apply_grid
set_global_attributes
set_local_attributes
fix_cf_names
compress
apply_cmor_filename
end
Part 4, the variables to create during cmorization
The following lines can be generated from the match_available
command, i.e.
seamore match_available /path/cmip6-cmor-tables/Tables /path/fesom_output
If variables can be found in the data request tables but have a different unit, match_available
will list them as a comment and the mapping has to be entered manually. If seamore is able to automatically to the unit conversion, it can still generate the according cmorized file.
The format of the cmorize
lines is:
<actual Fesom variable name>_<actual Fesom variable frequency> => [<cmore variable name>_<target table name>]
The frequencies and table names are part of the CMOR Tables and Controlled Vocabularies. If multiple output tables (frequencies) are requred for the same input variable, use a comma separated list for the output variables: cmorize volo_mon => [volo_Omon, volo_Odec]
Current automatic unit conversion includes:
psu to 0.001
psu2 to 1e-06
W/m^2 to W m-2
1.0 to 1
1 to %
1.0 to %
K to degC
Code fore these conversions belongs to the AUTO_CONVERT_UNIT
class, currently in the step.rb
file. github.com/FESOM/seamore/blob/01e321231cf976b3459c61107853134dcc8412a7/lib/step.rb#L207
cmorize evs_mon => [evs_Omon]
cmorize fsitherm_mon => [fsitherm_Omon]
cmorize hfds_mon => [hfds_Omon, hfds_Odec]
cmorize mlotst_day => [mlotst_Eday, mlotst_Omon]
cmorize omldamax_day => [omldamax_Oday]
cmorize opottemptend_mon => [opottemptend_Emon, opottemptend_Oyr]
cmorize pbo_mon => [pbo_Omon]
cmorize prsn_mon => [prsn_Omon]
cmorize rsdo_mon => [rsdo_Omon] # NO match: rsdo 'W/m^2' mon (!! rsdo 'W m-2' exists in datarequest)
cmorize siarean_mon => [siarean_SImon]
cmorize siareas_mon => [siareas_SImon]
cmorize sidmassevapsubl_mon => [sidmassevapsubl_SImon]
cmorize sidmasssi_mon => [sidmasssi_SImon]
cmorize sidmassth_mon => [sidmassth_SImon]
cmorize siextentn_mon => [siextentn_SImon]
cmorize siextents_mon => [siextents_SImon]
cmorize sifllatstop_mon => [sifllatstop_SImon]
cmorize sisnconc_mon => [sisnconc_SImon] # NO match: sisnconc '1' mon (!! sisnconc '%' exists in datarequest)
cmorize sisnmass_mon => [sisnmass_SImon]
cmorize sisnthick_mon => [sisnthick_SImon]
cmorize sispeed_mon => [sispeed_SImon]
cmorize sistrxdtop_day => [sistrxdtop_SImon]
cmorize sistrxubot_day => [sistrxubot_SImon]
cmorize sistrydtop_day => [sistrydtop_SImon]
cmorize sistryubot_day => [sistryubot_SImon]
cmorize sithick_day => [sithick_SIday, sithick_SImon]
cmorize sitimefrac_day => [sitimefrac_SIday, sitimefrac_SImon] # NO match: sitimefrac '1.0' day (!! sitimefrac '1' exists in datarequest)
cmorize siu_day => [siu_SIday, siu_SImon]
cmorize siv_day => [siv_SIday, siv_SImon]
cmorize sivol_mon => [sivol_SImon]
cmorize sivoln_mon => [sivoln_SImon]
cmorize sivols_mon => [sivols_SImon]
cmorize so_day => [so_Omon, so_Odec] # NO match: so 'psu' day (!! so '0.001' exists in datarequest)
cmorize soga_mon => [soga_Omon, soga_Odec] # NO match: soga 'psu' mon (!! soga '0.001' exists in datarequest)
cmorize sos_day => [sos_Oday, sos_Omon, sos_Odec] # NO match: sos 'psu' day (!! sos '0.001' exists in datarequest)
cmorize tauuo_day => [tauuo_Omon, tauuo_Odec]
cmorize tauvo_day => [tauvo_Omon, tauvo_Odec]
cmorize thetao_day => [thetao_Omon, thetao_Odec]
cmorize thetaoga_mon => [thetaoga_Omon, thetaoga_Odec]
cmorize tos_day => [tos_Oday, tos_Omon, tos_Odec]
cmorize tso_3hrPt => [tos_3hr] # NO match: tso 'K' 3hrPt
cmorize uo_mon => [uo_Omon, uo_Odec]
cmorize vo_mon => [vo_Omon, vo_Odec]
cmorize volo_mon => [volo_Omon, volo_Odec]
cmorize wfo_mon => [wfo_Omon, wfo_Odec]
cmorize wo_mon => [wo_Omon, wo_Odec]
cmorize zos_day => [zos_Omon]
cmorize zossq_mon => [zossq_Omon]
Misc
Currently seamore utilizes the following commands to do the file conversion. See github.com/FESOM/seamore/blob/01e321231cf976b3459c61107853134dcc8412a7/lib/file_command.rb#L5C11-L5C11
cdo
ncn
ncks
nccopy
ncatted
ncrename
Aborted Jobs
If a seamore process
job has been killed or ctrl-c
ed, it can be resumed! Manually delete all *.inprogress
from the output directoy and re-run the original seamore process
command to resume cmorization.
Tab Auto Completion
There is an autocompletion file for bash and zsh which can besourced to get autocomplete: env/complete.sh
Install Notes
Automatic installation
If the environment scripts are already available for the machine you are using you'll be able to run the following to automatically install seamore and build its dependencies:
./configure.sh
Before starting using bin/seamore
make sure you execute:
source env.sh
Machines that support this type of installation - Albedo
In case there is no ruby available
cd /path
wget https://cache.ruby-lang.org/pub/ruby/2.6/ruby-2.6.4.tar.gz
tar -xf ruby-2.6.4.tar.gz
mkdir ruby-2.6.4_bin
cd ruby-2.6.4
./configure --disable-install-doc --prefix=/path/ruby-2.6.4_bin
make -j `nproc --all`
make install
Manual installation of seamore
cd /path/seamore && bundle install
# if bundle install fails, we only need gli
gem install --install-dir /path/rubygems/ gli
export GEM_PATH=$GEM_PATH:/path/rubygems/
One can add the seamore module file to .zshrc, .bashrc like export MODULEPATH=/path/modules:$MODULEPATH
Then to use seamore:
module load seamore
Unit Tests
seamore contains a set of carefully written unit tests. To run:
cd src
ruby test/test.rb