In 2020 we released our first public data file, something we’ve turned into an annual affair supporting our commitment to the Principles of Open Scholarly Infrastructure (POSI). We’ve just posted the 2022 file, which can now be downloaded via torrent like in years past.
We aim to publish these in the first quarter of each year, though as you may notice, we’re a little behind our intended schedule. The reason for this delay was that we wanted to make critical new metadata fields available, including resource URLs and titles with markup.
Unfortunately, Bryan Vickery has moved onto pastures new. I would like to thank him for his many contributions at Crossref and we all wish him well.
I’m now pleased to announce that Rachael Lammey will be Crossref’s new Director of Product starting on Monday, May 16th.
Rachael’s skills and experience are perfectly suited for this role. She has been at Crossref since 2012 and has deep knowledge and experience of all things Crossref: our mission; our members; our culture; and our services.
Since we announced last September the launch of a new version of iThenticate, a number of you have upgraded and become familiar with iThenticate v2 and its new and improved features which include:
A faster, more user-friendly and responsive interface A preprint exclusion filter, giving users the ability to identify content on preprint servers more easily A new “red flag” feature that signals the detection of hidden text such as text/quotation marks in white font, or suspicious character replacement A private repository available for browser users, allowing them to compare against their previous submissions to identify duplicate submissions within your organisation A content portal, helping users check how much of their own published content has been successfully indexed, self-diagnose and fix the content that has failed to be indexed in iThenticate.
A re-cap We kicked off our Ambassador Program in 2018 after consultation with our members, who told us they wanted greater support and representation in their local regions, time zones, and languages.
We also recognized that our membership has grown and changed dramatically over recent years and that it is likely to continue to do so. We now have over 16,000 members across 140 countries. As we work to understand what’s to come and ensure that we are meeting the needs of such an expansive community, having trusted local contacts we can work closely with is key to ensuring we are more proactive in engaging with new audiences and supporting existing members.
There are now two versions of iThenticate available. Most subscribers are on v1, and the instruction on this website explain how to set up and use v1. Some new subscribers can start using v2 from September 2021 - we’ll let new subscribers know if v2 is appropriate for them when they apply. You can find out more about iThenticate v2 on our blog.
To work out which version you’re on, take a look at the website address that you use to access iThenticate. If you go to ithenticate.com then you are using v1. If you a bespoke URL, https://crossref-[your member ID].turnitin.com/ then you are using v2.
v1 Creating and finding your Similarity Report, keep reading:
For each document you submit for checking, the Similarity Report provides an overall similarity breakdown. This is displayed in the form of percentage of similarity between the document and existing published content in the iThenticate database. iThenticate’s repositories include the published content provided by Crossref members, plus billions of web pages (both current and archived content), work that has previously been submitted to Turnitin, and a collection of works including thousands of periodicals, journals, publications.
Matches are highlighted, and the best matches are listed in the report sidebar. Other matches are called underlying sources, and these are listed in the content tracking mode. Learn more about the different viewing modes (Similarity Report mode, Content tracking mode, Summary report mode, Largest matches mode).
If two sources have exactly the same amount of matching text, the best match depends on which content repository contains the source of the match. For example, for two identical internet source matches, the most recently crawled internet source would be the best match. If an identical match is found to an internet source and a publication source, the publication source would be the best match.
Accessing the Similarity Report (v1)
To access the Similarity Report through iThenticate, start from the folder that contains the submission, and go to the Documents tab. In the Report column, you will see a button - click this Similarity Score to open the document in the Document Viewer.
The Document Viewer (v1)
The Document Viewer screen opens in the last used viewing mode. There are three sections:
Along the top of the screen, the document information bar shows details about the submitted document. This includes the document title, the date the report was processed, the word count and the number of matching sources found in the selected databases.
The left panel is the document text. This shows the full text of the submitted document, highlighting areas of overlap with existing published content.
The colors correspond to the matching sources, listed in the sources panel on the right.
The layout will depend on your chosen report mode:
Match Overview (show highest matches together) shows the best matches between the submitted document and content from the selected search repositories. Matches are color-coded and listed from highest to lowest percentage of matching word area. Only the top or best matches are shown - you can see all other matches in the Match Breakdown and All Sources modes.
All Sources shows matches between the submission and a specifically selected source from the content repositories. This is the full list of all matches found, not just the top matches per area of similarity, including those not seen in the Match Overview because they are the same or similar to other areas which are better matches.
Match Breakdown shows all matches, including those that are hidden by a top source and therefore don’t appear in Match Overview. To see the underlying sources, hover over a match, and click the arrow icon. Select a source to highlight the matching text in the submitted document. Click the back arrow next to Match Breakdown to return to Match Overview mode.
Side-By-Side Comparison is an in-depth view that shows a document’s match compared side-by-side with the original source from the content repositories. From the All Sources view, choose a source from the sources panel, and a source box highlights on the submitted document similar content within a snippet of the text from the repository source. In Match Overview, select the colored number at the start of the highlighted text to open this source box. To see the entire repository source, click Full Source View, which opens the full-text of the repository source in the sources panel and all the matching instances. The sidebar shows the source’s full text with each match to the document highlighted in red. Click the X icon in the top right corner of the full source text panel to close it.
Use the view mode icons to switch between the Match Overview (default, left icon) and All Sources Similarity Report viewing modes. Click the right icon to change the Similarity Report view mode to All Sources.
Viewing live web pages for a source (v1)
You may access web-based sources by clicking on the source title/URL. If there are multiple matches to this source, use the arrow icons to quickly navigate through them.
If a source is restricted or paywalled (for example, subscription-based academic resources), you won’t be able to view the full-text of the source, but you’ll still see the source box snippet for context. Some internet sources may no longer be live.
From Match Overview, click the colored number at the start of a piece of highlighted text on the submitted document. A source box will appear on the document text showing the similar content highlighted within a snippet of the text from the repository source. The source website will be in blue above the source snippet - click the link to access it.
From Match Breakdown or All Sources, select the source for which you want to view the website, and a diagonal icon will appear to the right of the source. Click this to access it.
Page owner: Kathleen Luschek | Last updated 2020-May-19