Skip to content

Commit

Permalink
Update files.md
Browse files Browse the repository at this point in the history
  • Loading branch information
ddooley authored Nov 13, 2024
1 parent 8dff95a commit c4ab6fa
Showing 1 changed file with 8 additions and 0 deletions.
8 changes: 8 additions & 0 deletions docs/Data_Standardization/files.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,6 +14,11 @@ nav_order: 7
1. TOC
{:toc}

### File standards ###

#### File Character sets ###: Data "serialized" into a text file will be encoded as strings characters from a character set which may include accents etc. A popular [UTF-8](https://en.wikipedia.org/wiki/UTF-8) standard (used to encode most web pages) includes character encodings that cover many languages and [dingbats](https://en.wikipedia.org/wiki/Dingbat) to boot! Sadly software often has to guess what encoding an input file has, and some versions of programs like [MS Excel](https://support.guidebook.com/hc/en-us/articles/360016372414) have their own coding, leading to confusion in translation.


## Why standardize file names?

Standardizing file naming conventions helps researchers better organize their own work and collaborate with others.
Expand All @@ -27,6 +32,9 @@ Benefits include:

All research projects for the Genome Canada's Climate-Smart Agriculture and Food Systems Initiative have created a [Data Management Plan (DMP)](../datamanagementplan.md) using the [DMP Assistant of Portage](https://dmp-pgd.ca/). This DMP typically includes recommended file naming protocols for each research project.




## Recommendations

Briney, Kristin A. 2020. “File Naming Convention Worksheet”. June 2. [https://doi.org/10.7907/894q-zr22](https://doi.org/10.7907/894q-zr22).
Expand Down

0 comments on commit c4ab6fa

Please sign in to comment.