The library offers a range of helpful services. All of our appointments are free of charge and confidential.
This guide was designed to benefit U of G researchers and outlines the best practices needed for organizing research data. Whether you are a faculty member, staff, or student, by the end of this guide you should be able to:
Before sharing and preserving your project's data, you need to decide what content will be included in your data files. Consider whether to include raw data, analysis data, or a subset of your data. If there is sensitive or personally identifying information, ensure you have Research Ethics Board (REB) approval and informed consent from participants to share/deposit these data and anonymize the data if necessary. If using secondary data, ensure you have the right to share, redistribute, and deposit it.
Consider data structure throughout the research life cycle to enhance readability, accessibility, and compatibility with different software, promoting long-term usability.
When working with spreadsheet data, it is important to use rectangular format with only columns and rows, excluding any dynamic or linked content such as charts, formulas, and inter-worksheet links.
The example below depicts an unstructured data table with structural issues including:
The example below depicts a structured data table presented in rectangular format. The data table includes:
Files/folders should be named consistently and descriptively to promote effective browsing and retrieval. Names should reveal file and folder content and provide context.
When naming your folders and files:
An example of a filename convention:
Where ‘YYYYMMDD’ is a standard date format, ‘Content Description’ is a file description, ‘Version’ is versioning information, and ‘ext’ is the file extension.
Your folder directory structure should maximize clarity and discoverability of different types of files. Keep it simple, limit the directory structure to no more than four levels and ten or less sub-folders within each level.
To promote future readability, save files in open-source, well-documented, software-agnostic formats that are likely to remain usable over the long term (e.g., TXT, CSV, JPEG, MP3, MP4, etc.). Additionally, save and store files in uncompressed and unencrypted format.
Please refer to the U of G Research Data Repositories Data Deposit guide for recommended file formats that promote long-term access and preservation of research data.
If your Excel files have important formatting needed to support interpretation and reuse of the data, keep the original Excel file, but save a copy in a preservation friendly format.
If you are depositing your Excel files in a repository, it is recommended that multi-sheet workbooks be broken down into separate worksheet files before depositing.
Robust metadata is essential for understanding and reusing data. It covers all aspects of the data including:
When depositing data to a repository, if required, file restrictions can limit access to designated users or upon request. Alternatively, embargoes can be used to temporarily restrict access, such as before manuscript publishing or for patent/commercialization purposes.
Clearly define the terms of use for the data so that potential users may understand how they can utilize the data.
When depositing data, you can select the terms based on its copyright status, like a CC0 Public Domain Dedication waiver, a Creative Commons license, or a customized data use agreement.
Check the Licensing your dataset in the U of G Research Data Repositories guide for details on choosing terms.
Supplemental documentation provides all necessary information for comprehending, evaluating, analyzing research data, and for reproducing research results without contacting the data creator. It can include a readme file, a codebook, a data dictionary, user guide, commented script file and a data management plan (DMP). You can find a Readme template in the How to deposit research data in the University of Guelph Research Data Repositories guide.
Supplemental documentation should:
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.