Review
Quiz yourself on the concepts learned in this section. |
-
Why is it best to clean your data?
-
to make them as fit for use as possible
-
to achieve your data quality goals
-
data should be cleaned by the users, not the providers
-
-
How should you organize your data cleaning workflow?
-
work alone, you know your data best
-
ask your colleagues for expertise
-
work at an institutional level to harmonize data quality workflows
-
-
Which is best:
-
prevent errors from occurring
-
correct errors as soon as you find them in your database or spreadsheet
-
not cleaning errors but documenting them as you go, so people who reuse your data know where they are
-
-
Whose responsibility is data quality?
-
The person(s) who record data on the field
-
The data transcribers
-
The database manager
-
Everyone involved in the management of data
-
The people who use your data
-
GBIF
-
-
Which tools can be used to clean your data ?
-
Excel & other spreadsheets management tools
-
OpenRefine
-
Your database software
-
Online tools such as Scientific Names Resolver or Google Maps
-
← Exercise tips | ↑ Data management | ⌂ Home | Data publishing →