I say to you….slow down, smell the agar plates, take the time to document your data. Your future self will thank you, profusely.
Proper documentation provides the context that your data needs to persist through time, to integrate into new systems and to give you credit for your contributions in the form of data citations. Where possible, you should consider contributing the following information along with your dataset.
- Complete Metadata for description and reuse– Descriptive metadata is information such as title of work, creator name, data submitted and description. This information can be searched to increase discoverability of your content. If you are following a suggested schema for your discipline, indicate that in the description or in the next document – the README file.
- README.txt File – This is a text document that provides relevant information such as purpose of the project and the organizational structure or relationship of the files. It explains terms that are unique to the dataset, keywords, omissions and errors. If you are using a file naming convention, you can explain it in the readme file. It is also the place to put additional details that may not be included in the descriptive metadata, such as additional information about external storage of the data, metadata schema followed or researcher contact information.
A good example can be found in the Data ab Initio blog post README.txt by Kristin Briney (http://dataabinitio.com/?p=378)
- Data Dictionary/Codebook – This document explains all the variables and abbreviations associated with the dataset.
- Methodology/Protocol – This document is the details steps taken to collect the data. If you are submitted modified data instead of a raw data set, you can explain those steps in this document.
More tips at the Love Your Data website to help keep you write that love note to the future. Follow the event on Twitter at #LYD16.
Amy Koshoffer is the science informationist for UC Libraries based in the Geology Math and Physics library .
- ICPSR: http://www.icpsr.umich.edu/icpsrweb/content/deposit/guide/index.html
- DataOne https://www.dataone.org/best-practices
- Mantra http://datalib.edina.ac.uk/mantra/
- DataDryad FAQ: http://datadryad.org/pages/faq
- Digital Curation Centre website: http://www.dcc.ac.uk/resources/how-guides
- Dataabinitio Blog by Kristin Briney