Dataset comparison

The 'dataset comparison' allows for comparison of two datasets in ChecklistBank on a scientific name by scientific name basis.

All datasets in ChecklistBank can be compared including the COL Checklist releases. The tool provides an overview of the differences in scientific names between datasets. The set of names to be compared can be filtered by various parameters, and a root taxon can also be selected to include only a certain group. Synonyms and names below a certain rank can be excluded on demand, i.e. you can just compare accepted genera or families.

Please make sure you are logged in to ChecklistBank. If you don’t know how to do that, follow the steps here.

You can find the tool in the menu on the left. Expand the Tools menu by clicking on the arrow. Then click ‘Dataset comparison’.

CLB tools compare

Example: Lysiloma (genus in Fabaceae)

We use the Fabaceae checklist, which was also used in the ‘name match tool’ tutorial. We compare this list to the 2021 COL Checklist for which the data on the genus Lysiloma are provided by the ILDIS World Database of Legumes checklist (Roskov et al. 2014).

When you open the tool, the start screen of the dataset comparison allows you to choose two datasets to make the comparison on the basis of scientific names.

Step 1. Select dataset 1

  • Start typing WCVP-Fabaceae and this dataset will appear in the menu. Select it.

  • In the box 'Find taxon', type 'Lysiloma' and select it.

Step 2. Select dataset 2

  • Type COL 2021, and select the COL21 Annual Checklist 2021.

  • Click on 'Go to Lysiloma' to select the same genus as in dataset 1.

  • Click on ‘Show diff’

CLB compare selection

Step 3. Check you selection

The next view shows you which datasets and roots you have selected.
You can choose whether the scientific name comparisons should include authorship and synonyms, for this example select both options.
Additionally, you can select whether to show the parent, and at which level. For this example, also select the box 'Show parent'.

Step 4. Get results*

Once you have made your selection, click ‘Get diff’.

CLB diff Lysiloma

The results of the comparison can be explored in the webpage.

You can see two columns, one for each dataset. Scientific name differences are shown in each column where a '-' or '+' sign indicates in which dataset the difference occurs. The results of the comparisons can also be downloaded in a CSV file for further investigation.

CLB compare Lysiloma result