21. How can I port SAPIENT to work with other XML schemas?
There are two aspects to this question, namely the following:
a) How can SAPIENT recognise papers written in other XML schemas?
b) How can SAPIENT annotate papers according to annotation schemas other than CISP?
In anser to a) SAPIENT requires the presence of <PAPER>, <TITLE>, <ABSTRACT>, <BODY> and <P> elements.
You need to be able to map your XML or txt documents to the above elements. Future plans consist in extending SAPIENT to do this mapping for you. The .xsl which converts the .xml to .html so that SAPIENT can display the document also requires the <BODY> to incorporate at lease one <DIV> element.
Two documents are provided from http://www.aber.ac.uk/compsci/Research/bio/art/sapient/ to give you an indication of how SAPIENT will work. The first one, test.xml, is a full paper in SciXML and the second, testsmall.xml is a minimal version of a document that will be accepted by SAPIENT.
In answer to b), in order to write your own sentence based schemas to use with SAPIENT, you need to obtain the source code for SAPIENT. To this, you need to add a new .xsl file in the uk.ac.aber.art_tool.art_tool_web.xsl package and substitute mode2.xsl for this new file wherever it is referenced in ARTSciXMLDocument.java.
The latter class is found in the uk.ac.aber.ar_tool package. To give you an example, of an alternative annotation schema we have included the dummy fruit.xsl. Instructions are available as comments in fruit.xsl, mode2.xsl and art-tool.js .
Re-compile SAPIENT running ant and you will have a new version of SAPIENT, working for your particular sentence base annotation schema.