Find Duplicate References

When searching for duplicates in entire Project, folders owned Folders are included in the search, but folders shared by others are omitted. 
Search within each shared folder to Find Duplicates in that Folder.

Find Duplicates is available in shared folders when the user has “Can modify” access. When searching in shared folder, duplicates can be deleted and the folder will then be updated with changes for all users with access.

In search results, up to 100 original references plus all associated duplicates are displayed.

Click on Find duplicates under Tools on References Tools and Actions menu. This will allow for finding duplicates for all references in current Project.

Find duplicates in Tools menu

OR
Click on Duplicates tab and click on Find duplicates in the Reference Organization sidebar. This will allow for finding duplicates for all references in current Project.

Find duplicates under Duplicates tab on Reference Organization sidebar

OR
Under My Folders, click More actions... and click Find duplicates. This will initiate Find duplicates for current Folder.

Find duplicates for a Folder

Alternatively, select option to search in current Project, which will search for duplicated for all references.

Find duplicates location search options


Select Find duplicates options in the modal.

Find duplicate references modal


Select how to prioritize finding duplicate references by choosing Primary reference determined by:

Completeness - Reference that has the most information populated.
Newest - Reference that was most recently added.
Oldest - Reference that was added first.

Find duplicates primary reference options

Select Matching Settings

References must match exactly
In this method, the selected fields must be identical. Casing, special characters, and the order of Author names are ignored.
If Author, Year, ISBN, ISSN, DOI or Page Number field(s) is/are chosen in conjunction with Title, empty fields are included. 
For example, selecting Title and Year where Title is found to be a match, but Year is blank in one record and populated in another, will result in a match.

References that are similar
In this method, data is weighted for similarity for selected fields. If enough items are similar, a match will be suggested.
If Year, ISBN, ISSN, DOI, or Page Number are selected, close matches will appear for those references where an absolute match is found in the selected fields.
References that are similar match is based on the Levenshtein Distance Algorithm.

Select the fields to be considered in search.

Title, Author (Full name) and Year fields are pre-selected, but not required. 
At least one field must be selected to proceed.

Find duplicates fields

Click Find Duplicates. 

Deduplication request starts. A modal is displayed and Finding duplicates progress message is displayed under Duplicates tab, in Reference Organization sidebar.

It is not required to stay on this page, or be logged in, while the process is running. Find duplicates processing time varies based on the number of records and match criteria, and will run in its entirety until it is complete.
 

Deduplication request started modal


Initiating a search for duplicates in a shared folder will prompt a confirmation message. While the deduplication process itself does not alter folder contents, any changes made within the shared folder will apply to all users who have access.

Finding duplicates in shared folder modal

After the deduplication request is completed, click on the Process completed/See results link under Duplicates tab, in Reference Organization sidebar.
 

Find duplicates process completed message


Duplicate references search results page (shown in Table View) display: 

  • Location and search criteria for duplicate search results
  • Total, Primary and Duplicates counts
  • Duplicates marked with red bar and pre-selected to make them easy to identify
Duplicate references in Table View


Any time after performing a Find Duplicates search, click on See last results under Duplicates tab, in Reference Organization sidebar to view the most recent search results. 

See last results link

Duplicate references on the current search results page can be removed from Folder (if in Folder), selected references can be moved to Trash, or All Duplicates in current search results can be moved Trash.

Delete duplicate references options



 

Was this article helpful?
0 out of 0 found this helpful