Creating a new volume
What stage is my project
New Project
Excellent! BEFORE collecting data is the best time to start planning for curation of your project. In this case, you would utilize the upload as you go method of data curation.
Complete Steps 1-5, 7, and 8 Now.
After each data collection session, follow the directions for Enter Data Individually in Step 6.
Ongoing
Completed
The following items need to be considered to prepare for upload:
- Do you have (at least some of) participants’ permission to share their data with other authorized Databrary investigators? If not, who is allowed access to the videos?
- Do you have IRB permission to share the data on Databrary?
- All video tapes need to be digitized
- How are the data organized? Will these need to be re-organized to utilize the Databrary Spreadsheet structure?
Step 1: Login/Go to Profile
Click the pink Login
button in the middle of the Databrary homepage to sign into your Databrary account.
Click on Your Profile
Step 2: Begin Create Volume
Click on Create Volume
to the right of Data Owner Volumes
There are multiple tabs on the left side of the screen that need to be completed to set up a new volume. These tabs need to be completed in order.
Step 3: Enter Dataset Overview
Volume Overview is created by choosing if the volume is related to a published resource and answering the related questions (Title, Description, Internal Short Name and additional citation information if necessary).
Please describe the context of data collection and the types of data that will be uploaded (e.g. video, pictures, coding files, survey data, etc.). The recommended template is provided. Please be sure to fill in all of the brackets with the relevant information for your volume.
This dataset focuses on [Provide provide a description of behavor, location of study, and age range]. Dataset includes [Enter description of videos, questionnaires, materials, etc.]
Videos were [Provide provide a general description of what the videos were coded for. (Some examples may include: transcribed for mother speech and child vocalizations; coded for mother and child object interaction; child locomotion and emotion).] (see Materials folder).
This volume contains [Enter list the types of files along with approximate numbers for each. Some examples are provided below]:
- XX videos (MP4/MOV files)
- XX Datavyu transcription files (OPF files)
- XX Datavyu coding files (OPF files)
- Questionnaire data (.xls file)
- Several video exemplars of child behaviors (.mp4 files)
- Transcription manual (.docx file)
- Coding manual (.docx file)
- Scripts for processing and exporting data from Datayu (Ruby)
- Materials folder containing stimuli, copies of questionnaires used in study, additional participant demographic information, and other materials utilized for the study.
- Exemplar study intake video of experimenter with participant
If you need this data to be associated with a DOI, please answer YES
to the question Does this volume correspond to a published paper or resource
When all items are entered, select Save
You may have the option to enter Keywords associated with the Volume. See below for description.
Step 4: Manage Access
Next, go to Manage Access to choose volume sharing options and add collaborators (faculty, staff, students) that already have an account on Databrary that is Sponsored by an Institution or an Authorized Investigator.
Click Save
when done.
Please be sure to pick the appropriate sharing setting, Private, Share Volume Overview Only or Share Entire Volume. The default sharing setting is Private.
Private: keeps the volume private to you and your chosen affiliates and collaborators
Share Volume Overview Only: This will only share the Title and description of the volume exactly as it is entered in the Overview Tab
Shared:
Please note: Only the Authorized Investigator who created this volume can ‘Share Entire Volume’
Step 5: Design Spreadsheet
Now, go to Design Spreadsheet and choose all the categories and sub-categories you would like to include in your spreadsheet for participant/session metadata.
Definitions for all Spreadsheet Options
participant: An individual human subject whose data are used or represented
ID
A unique, anonymized, primary identifier, such as a participant ID.
info
Other information or alternative identifier
birthdate
Date of birth (used with session date to calculate age)
description
A longer explanation or description (‘description’ is NOT sortable through the Databrary volume filters)
gender
“Male”, “Female”, or any other relevent gender
race
As classified by NIH, or user-defined classification
ethnicity
As classified by NIH (Hispanic/Non-Hispanic), or user-defined classification
gestational age
Pregnancy age in weeks between last menstrual period and birth (or pre-natal observation)
pregnancy term
“Full term”, “Preterm”, or other gestational term (assumed “Full term” by default)
birth weight
Weight at birth (in grams, e.g., 3250)
disability
Any developmental, physical, or mental disability or disabilities (assumed “Typical” by default)
language
Primary language(s) spoken by and to participant (assumed “English” by default)
country
Country where participant was born (assumed “US” by default)
state
State/territory where particpiant was born
setting
The physical context of the participant (please do not use for new data: see the context category instead)
pilot: Indicates that the methods used were not finalized or were non-standard
name
A label or identifier referring to the pilot method (This is the only field in the “pilot” section that is sortable through the Databrary volume filters)
description
A longer explanation or description of the pilot method
exclusion: Indicates that deta were not usable
name
A label or identifier referring to the exclusion criterion (This is the only field in the “exclusion” section that is sortable through the Databrary volume filters)
reason
The reason for excluding these data
description
A longer explanation or description of the reason for excluding data
condition: An experimenter-determined manipulation (within or between sessions)
name
A label or identifier referring to the exclusion criterion (This is the only field in the “label” section that is sortable through the Databrary volume filters)
reason
The reason for excluding these data
description
A longer explanation or description of the reason for excluding data
group: A grouping determined by an aspect of the data (participant ability, age, grade level, experience, longitudinal visit, measurements used/available)
name
A label or identifier referring to the exclusion criterion (This is the only field in the “group” section that is sortable through the Databrary volume filters)
reason
The reason for excluding these data
description
A longer explanation or description of the reason for excluding data
task: A particular task, activity, or phase of the session or study
name
A label or identifier referring to the exclusion criterion (This is the only field in the “task” section that is sortable through the Databrary volume filters)
reason
The reason for excluding these data
description
A longer explanation or description of the reason for excluding data
context: A particular setting or other aspect of where/when/how data were collected
name
A label or identifier referring to the exclusion criterion
setting
The physical context
language
Language used in this context (assumed “English” by default)
country
Country of data collection (assumed “US” by default)
state
State/territory of data collection
Step 6: Enter Data
Now it is time to Enter Data. There are two options:
Upload a participant metadata file.
See video instructions or follow the step by step instructions below.
What is an appropriately formatted .csv file?
- ensure that the Header row is exactly the same as the Spreadsheet Preview in the
Design Spreadsheet
tab.
so this .csv file would look like:
Now it is time to upload the participant .csv file.
Click the participant
tab. Then Choose File
to navigate to the appropriate file. Finally click Upload
.
You will see a window that summarizes the data uploaded and will point out any potential errors (e.g. wrong header names, missing data)
Please review these data
Then click Yes
.
Now the uploaded data will show like this.
After the participant metadata is uploaded, the data regarding the testing session and any relevant filed get uploaded individually.
See video instructions or follow the step by step instructions below.
Click on the folder
tab.
Click on add folder
and choose to add a session or materials folder.
Session folders are for participant sessions and materials folders are for thing related to the study such as surveys used, sample consent forms, and stimuli.
Session folders need to be added individually. Once the Participant ID is entered, the related metadata will be filled in. Then the rest of the session information will need to be added individually.
Enter Data Individually (the upload as you go method)
- Click inside the ID box to add a participant.
- Complete each metadata item selected to create the participant, then
- Add files related to that participant
Step 7: Add Links
Please share more information in your Databrary Volume using Add Links. Here you will want to add links to any outside resources related to this dataset (e.g. published paper, github repository)
Step 8: List Funding
Finally, please share your funding source for your volume in List Funding. If your funding source is not listed here, please email Databrary Staff at contact@databrary.org to have it added.
Volume Tags vs. Keywords
What is the difference between a Tag and a Keyword?
Tag - Anyone can tag a volume Keyword - Only Volume Owners/Authors/Investigators can enter keywords
How do Tags and Keywords affect the search function?
Tags can be searched for on the Databrary Search Page
Keywords are for??? Maybe these are the same keywords that would have been used in the associated Journal article.
Can I tag a Session?
This appears to be possible, but the search function cannot present results by session. Therefore only Volume Tags are currently searchable.