Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Length of Datasets #147

Open
alexborowiak opened this issue Apr 12, 2021 · 1 comment
Open

Length of Datasets #147

alexborowiak opened this issue Apr 12, 2021 · 1 comment

Comments

@alexborowiak
Copy link

Hi,
It would be nice to be able to get the model run time of datasets. For the project I am working on now I need only extended length runs (e.g. models that have been run to 2300).

@paolap
Copy link
Member

paolap commented Apr 12, 2021

Hi Alex,

I'm tagging this as an enhancement, I'm assuming you mean having an extra query option at the command line.
If you use clef by importing the modules in your code (https://clef.readthedocs.io/en/stable/code.html#examples) you can get 2 extra fields in your query results.
fdate tdate and a True/False flag called time_complete that means that the files constitutes an "unbroken" time series.

df
project institute model ... fdate tdate time_complete
path ...
/g/data/al33/replicas/CMIP5/combined/MIROC/MIRO... CMIP5 MIROC MIROC5 ... 20060101 21001231 True

You could use the results to perform another query based on tdate >= 23000101 or something similar in your case.

I will look at ways to offer this on the command line, we discussed this previously but the CMIP5 data is so irregular that it introduced quite a few exceptions.
Thanks for your suggestion

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants