Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

"Integration" with Open Science Data Federation #120

Open
yarikoptic opened this issue Nov 22, 2023 · 1 comment
Open

"Integration" with Open Science Data Federation #120

yarikoptic opened this issue Nov 22, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@yarikoptic
Copy link
Member

yarikoptic commented Nov 22, 2023

Proposed change

yet to be figured out, but a user asked

Please clarify whether data saved in DANDI can be reached and used through OSDF.
OSDF: https://osg-htc.org/services/osdf

From https://osg-htc.org/docs/data/stashcache/overview/: The OSG operates the Open Science Data Federation (OSDF), which provides organizations with a method to distribute their data in a scalable manner to thousands of jobs without needing to pre-stage data at each site.

Some caching solutions are part of the framework: https://osg-htc.org/docs/data/stashcache/install-cache/

Overall -- it sounds like a solution allowing making (public or private ) local data resources available to remote compute resources through unified interface with some caching to make it efficient. So, if someone tackles it I guess OSDF DANDI users would benefit form

  • uniform access to DANDI data via OSDF interfaces
  • caching
  • may be discovery of some kind of a resource

Wasn't sure where to seek clarification yet... or what it would entail for a DANDI resource to expose itself as part of OSDF. If there was some other example of a public resource exposed to OSDF, might have provided a template to follow. Searching github got to https://github.com/PerilousApricot/docs/blob/de67fc125efb334ca67b011bde179689f035ffda/docs/data/stashcache/vo-data.md?plain=1#L3 so I will email help@opensciencegrid.org for either such integration makes sense if there are examples

Alternative options

regular means to access from S3 directly, via DANDI API, via datalad/git-annex, dandi-cli

Who would use this feature?

Users of OSDF

(Optional): Suggest a solution

yet to be figured out if there is a problem to be solved ;)

@yarikoptic yarikoptic added the enhancement New feature or request label Nov 22, 2023
@yarikoptic
Copy link
Member Author

Current solution approach: development of efficient WEBDAV interface which could then be used as an already supported "standard" protocol to implement desired by OSDF caching etc. Current webdav "frontend" is developed at https://github.com/dandi/dandidav and deployed at https://dandi.centerforopenneuroscience.org

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant