...
Journal Article Tag Suite
The Journal Article Tag Suite is an XML format used to describe scientific literature published online. It is a technical standard developed by the National Information Standards Organization and approved by the American National Standards Institute with the code Z39.96-2012.
The NISO project was a continuation of the work done by NLM/NCBI, and popularized by the NLM's PubMed Central as a de facto standard for archiving and interchange of scientific open-access journals and its contents with XML.
With the NISO standardization the NLM initiative has gained a wider reach, and several other repositories, such as SciELO and Redalyc, adopted the XML formatting for scientific articles.
The JATS provides a set of XML elements and attributes for describing the textual and graphical content of journal articles
as well as some non-article material such as letters, editorials, and book and product reviews.
JATS allows for descriptions of the full article content or just the article header metadata;
and allows other kinds of contents, including research and non-research articles, letters, editorials, and book and product reviews.
History
Since its introduction, NCBI's NLM Archiving and Interchange DTD suite has become the de facto standard for journal article markup in scholarly publishing. With the introduction of NISO JATS, it has been elevated to a true standard.Even without public data interchange, the advantages of NISO JATS adoption affords publishers in terms of streamlining production workflows and optimizing system interoperability.
Timeline
; NLM JATS; NISO JATS
Technical scope
By design, this is a model for journal articles, such as the typical research article found in an STM journal, and not a model for complete journals.Tag sets
There are three tag sets:; Journal Archiving and Interchange
; Journal Publishing
; Article Authoring
Document type definitions define each set and incorporate other standards such as MathML and XHTML Tables.
Document structure
JATS Publishing set defines a document that is a top-level component of a journal such as an article, a book or product review, or a letter to the editor. Each such document is composed of front matter and up to three optional parts. These must appear in the following order:; Front matter
; Body
; Back matter
; Floating material
Following the front, body, back, and floating material, there may be either one or more responses to the article or one or more subordinate articles.
Example
This is the minimal article's structure,PUBLIC "-//NLM//DTD JATS Journal Publishing DTD v1.0 20120330//EN"
"JATS-journalpublishing1.dtd"
...
The
DOCTYPE
header is optional, a legacy from SGML and DTD-oriented validators. The dtd-version
attribute can be used even without a DTD header.The root element
article
is common for any version of JATS or "JATS family", as NLM DTDs. The rules for front
, body
and back
tags validation, depends on the JATS version, but all versions have similar structure, with good compatibility in a range of years. The evolution of the schema preserves an overall stability.Less common, "only
front
", "only front
and back
" variations are also used for other finalities than full-content representation. The general article composition isTools
There are a variety of tools for create, edit, convert and transform JATS.They range from simple forms to complete conversion automation:
Conversion
;To JATS: Take as input a scientific document, and, with some human support, produce a JATS output.- OpenOffice and MS Word documents to JATS:
- * Typeset: provides automated set of converters for .
- *OxGarage: can convert documents from various formats into "National Library of Medicine DTD 3.0".
- *meTypeset: meTypeset "is a fork of the OxGarage stack" "to convert from Microsoft Word.docx format to NLM/JATS-XML".
- *eXtyles: automates time-consuming aspects of document editing in Microsoft Word and exports to JATS XML.
- Markdown to JATS: pandoc's "pandoc-jats" plugin. Pandoc 2.0 can convert a number of input formats to JATS.
- PDF to JATS: this is a very difficult problem to solve. Success depends on how well structured your PDFs are and, for batch conversion, how consistently structured your PDFs are.
- * Shabash Merops
- *Typeset's
- * The Public Knowledge Project is developing a pipeline for converting PDF to JATS. It will include use of pdfx.
- * CERMINE Content ExtRactor and MINEr
- from JATS to HTML
- * JATS Preview Stylesheets, see classical conversor.
- * eLife Lens converts NLM XML to JSON for displaying using HTML and Javascript.
- from JATS to PDF: some JATS Preview Stylesheets, XSLT + XSL-FO conversion.
- from JATS to EPUB.
- Generic : DtdAnalyzer — compare JATS with other DTDs and helps into create a XML representation, XSLT and Schematron generation, and other tools.
Editors
- provides a WYSIWYM editor for scholarly articles. Supports XML exports in NISO JATS and NLM JATS standards. It is mostly used by Journals and Publishers looking to convert author submitted MS-Word files to XML, PDF, HTML and ePuB.
- JATS Framework for oXygen XML Editor: users of oXygen XML Editor and oXygen XML Author can now install support for current versions of NISO JATS. Based on an identifier given in a DOCTYPE declaration, oXygen will detect that you are editing a JATS document and provide stylesheets and utilities.
- FontoXML for JATS: WYSIWYS editor for editing and reviewing JATS content:
- PubRef "Pipeline": Browser-based realtime-preview JATS editor:
- Annotum: a WordPress theme that contains WYSIWYG authoring in JATS, peer-review and editorial management, and publishing.
- JATS edition for web-based XML editor Xeditor.
- Texture Editor of the Substance Consortium. The first online "born to JATS" editor.
Preview
- JATS Preview Stylesheets: the JATS Preview Stylesheets are a series of.xsl,.xpl,.css, and.sch files that will create.html or.pdf versions of valid NISO Z39.96-2012 JATS 1.0 files. It is primarily intended for internal use by publishers and a basis for customization.
- Typeset - Allows to generate HTML from JATS XML within a click. Also, offers capacity to generate custom HTML based on the requirements of the journal.
- PubReader – "The PubReader view is an alternative web presentation... Designed particularly for enhancing readability on tablet and other small screen devices, PubReader can also be used on desktops and laptops and from multiple web browsers".
Customization
JATS central repositories
As NISO JATS began the de facto and de juri standard for open access journals, the scientific community has adopted the JATS repositories as a kind of legal deposit, more valuable than the traditional digital libraries where only a PDF version is stored. Open knowledge need richer and structured formats as JATS: PDF and JATS must be certified as "same content", and the set "PDF+JATS" forming the unit of legal deposit.List of JATS repositories and its contained:
- PubMed Central:
- * US PubMed Central: in 2016 ~3.8 million articles
- * Europe PubMed Central: in 2016 ~3,7 million articles
- * PubMed Central Canada: in 2013 ~2.6 million articles.
- SciELO: in 2016 ~0.6 million articles
Alternatives and semantic
There are some effort and experiments using RDF conversion in the 2012, with no impact in the JATS community.Later, in ~2016, for Semantic Web context, with SchemaOrg initiative, the class was defined, receiving better reception. It is an initial "JATS-like standardization" for RDF contexts of use.