Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add functionality to load/save distisets to/from disk (#673)
* Add functionality to load/save distisets to/from disk * Add tests for saving/loading distiset from disk * Add functionality to load/save distisets to/from disk * Update docs * Include code blocks from Examples in docstrings * Add tests for the dataset card * Fix call to yaml.safe_load found in code review * Copy path movements from hugging face load_from_disk definition * Add universal_pathlib dependency to better deal with remote paths when calling Distiset.load_from_disk * Fix download of distiset and add option to write the data to a user specified dir * Remove parameter in test as it isn't really tested with a remote filesystem * Remove unnecessary markdown extension and fix type from variables * Update src/distilabel/distiset.py Co-authored-by: Gabriel Martín Blázquez <gmartinbdev@gmail.com> * Update src/distilabel/distiset.py Co-authored-by: Gabriel Martín Blázquez <gmartinbdev@gmail.com> * Update src/distilabel/distiset.py Co-authored-by: Gabriel Martín Blázquez <gmartinbdev@gmail.com> * Update src/distilabel/distiset.py Co-authored-by: Gabriel Martín Blázquez <gmartinbdev@gmail.com> * Cast Path to str --------- Co-authored-by: Gabriel Martín Blázquez <gmartinbdev@gmail.com>
- Loading branch information