iSamples Data Import using Flat Files and a GitHub Repository

iSamples Flat File Data Format

The iSamples flat file data format is backed by a frictionlessdata.io schema. The schema is human-readable and is located in the iSamples GitHub. From an end-user perspective, the workflow for adding records to iSamples is quite simple:

Clone the template repository from the iSamples GitHub.
Add rows to the simple_isamples.csv file. (Note that if you are using one of the other file formats, you'll commit that file instead).
Push the changes to GitHub. As part of this push, a workflow is triggered that will generate a harvestable sitemap from the repository.
Once the github workflow is complete, you can look at the gh-pages branch of the repository to see the sitemap.

Supported file formats

Since we are built on top of frictionlessdata, technically we support any file format that they support. However, the formats that we have tested in the iSamples repository are:

csv
tsv
xlsx
xls

It ought to be a relatively trivial process to add new formats, so just ask if the one you want isn't already done.

Implementation Details

The template repository has a GitHub workflow that:

Checks out the iSamples in a Box repository
Installs the python environment
Runs the validate command on isb_things.py
If the file format passes validation, runs the sitemap command on isb_things.py, and publishes output to a new sitemaps directory.
Takes that sitemaps directory and publishes that to the gh-pages branch on the target repository.

Validation Errors

If a file is pushed that doesn't pass validation, the errors will be available on github for inspection. The workflow is aborted and the sitemap isn't generated. The errors will have a code and specify a line number, for example:

code             message
16
---------------  ---------------------------------------------------------------------------------------------------------------------
17
missing-label    There is a missing label in the header's field "label" at position "2"

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
simple_isamples.csv		simple_isamples.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

iSamples Data Import using Flat Files and a GitHub Repository

iSamples Flat File Data Format

Supported file formats

Implementation Details

Validation Errors

About

Releases

Packages

License

isamplesorg/csv_import_test

Folders and files

Latest commit

History

Repository files navigation

iSamples Data Import using Flat Files and a GitHub Repository

iSamples Flat File Data Format

Supported file formats

Implementation Details

Validation Errors

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages