Exercise 3

Data management

Data has now been compiled into a spreadsheet format by the volunteers from the Copenhagen Ornithological Society. Taking the role of the Ornithology Curator in the Bird Department, you have been assigned the responsibility for data quality issues on the dataset.

Through retrospective georeferencing, coordinates have been added to the dataset along with the locality, but no other higher geography. Since all the observations were made in Denmark, continent and country can easily be added. Additionally, only the scientific name was provided. Higher taxonomy can be derived utilizing software tools such as OpenRefine. You are also aware that there are typographic errors that were made by the digitizers.

  1. Download UC3-DL-3-ForCleaning.zip. (45 KB)

  2. Identify and correct any invalid years.

  3. Verify and correct taxonomy.

  4. Verify coordinates are correct for the two given localities. Correct any that are not. Coordinates should be in decimal format.

  5. Add any data for missing elements that can be derived using the available data

  6. Remember to keep the original information provided and document your changes and assumptions as part of the individual records and the metadata.

  7. Use the exercise sheet to provide your answers and submit the spreadsheet.

dataset should contain only years 1883-1939