Biodiversity data: From data collection to publication
Data lifecycle, FAIR/CARE principles, Metadata, Data Management Plan, Data paper
The objective of this five-day course, co-organized with the PNDB (Pôle National de Données de Biodiversité) and GBIF France (Global Biodiversity Information Facility for France), is to 1) contextualize the issues surrounding the understanding, sharing and (re)use of biodiversity data and metadata, and 2) enhance the skills of communities involved in one or more stages of the data cycle.
This course is delivered in French and takes place in November at CESAB’s premises in Montpellier. The fee is 250 € for the week, including lunch. Travel, accommodation, and evening meal costs are the responsibility of the participants.
Proficiency in R software is required.
List of speakers:
- Anne-Sophie ARCHAMBEAU (GBIF France)
- Nicolas CASAJUS (FRB-CESAB)
- Camille COUX (FRB-CESAB)
- Yvan LE BRAS (PNDB)
- Olivier NORVEZ (Data Terra)
- Sophie PAMERLON (GBIF France)
The training will focus on lessons and exercises during the first four days, with small-group projects scheduled for the final day.
General context
- Current challenges in biodiversity data
- The data ecosystem
- Reproducibility concepts
- What is data/metadata?
- Major types of biodiversity data
- Framework and good practices (data lifecycle, FAIR, etc.)
- Data management plan
Data acquisition
- Best practices for data collection
- Major biodiversity databases
- Major environmental databases
- Data acquisition: Web portals, API & web scraping
Data management
- Structure, formats & files
- Processing & cleansing data
- Introduction to OpenRefine
- Virtual Research Environments
DataSHARE projects
- FRB-CESAB DataSHARE projects
- Zoom on two groups
Legal aspects
- Sharing, licensing, etc.
Opening data
- (Meta)data: standards
- Storage and archiving
- Dissemination and sharing
- Data & software paper