Questions
- Where do I go to learn about XSLT
- Asking questions about Xalan-C++
- What is Xerces-C++?
- Which version of Xerces should I be using?
- Should I be using the Xerces DOM or Xalan DOM?
- Problems with samples in Windows
- Building on Windows
- Building on UNIX
- Make errors on UNIX platforms
- What is ICU
- A tar checksum error on Solaris
- Xalan-C++ in Apache
- Is Xalan-C++ thread-safe?
- What can I do to speed up transformations?
- Stylesheet validation
- What does the XalanDOMException HIERARCHY_REQUEST_ERR mean?
- Submitting Patches
- Transformation Output Methods
- Problems Using Sun's Forte/Workshop Compiler with code containing std::istrstream
- Modifying an instance of XalanDocument
- Changing Where Error Output is Sent
- Programmatic Error Information
- String Transcoding
- Error Code/Exception Summary
- Extension Functions
- Outputting results to a file on Windows 95/98
- Using format-number and ICU
- Perl wrapper for Xalan-C++?
- Missing LocalMsgIndex.hpp file
Answers
1. Where do I go to learn about XSLT
The definitive sources are the W3C XSLT and XPath recommendations: W3C Recommendation 16 November 1999 XSL Transformations (XSLT) Version 1.0 and XML Path Language (XPath) Version 1.0.
For a brief listing of tutorials, discussion forums, and other materials, see Getting up to speed with XSLT.
2. Asking questions about Xalan-C++
For specific questions on Xalan-C++, see list archives: xalan-c-users and xalan-dev. You must subscribe to these Apache mailing lists before posting your questions.
The Apache Software Foundation has information on how you can subscribe to the mailing lists.
You can post messages to the lists by sending mail to:
Post message to xalan-c-users. (User's list)
Post message to xalan-dev list. (Developer's list)
Again, please review the archives before posting a new question.
3. What is Xerces-C++?
Xerces-C++ is a validating XML parser written in a portable subset of C++. Xerces-C++ makes it easy to give your application the ability to read and write XML data. Like Xalan-C++, Xerces-C++ is available from the Apache XML site: http://xerces.apache.org
4. Which version of Xerces should I be using?
The Xalan-C++ release notes includes information about the Xerces-C++ release with which the Xalan-C++ release has been coordinated and tested. See Status
5. Should I be using the Xerces DOM or Xalan DOM?
The Xalan DOM implementation is highly optimised for transformations. However, whilst you can build documents in the Xalan DOM, subsequent modification will not work. The Xalan DOM is designed to be either an input or an output from a transformation, not as a general DOM implementation.
So in cases where you want to simply transform documents using Xalan, using the internal DOM implementation is the best approach.
In cases where you want to modify the DOM document on the fly, you should use the Xerces DOM as the base document. You can wrap the Xerces DOM in a wrapper (see passing in a Xerces DOM) to then use as an input to a Xalan transformation. Alternatively you can output the result of a transformation to a Xerces DOM document (see working with DOM input and output). In either case, the Xerces document can be freely modified. However, after you modify the document, you need to re-build the wrapper so that any changes are replicated in the Xalan wrappers.
6. Problems with samples in Windows
You may be mixing debug and release versions of executables and libraries. In other words, if you are compiling the sample for debug, then you should link with the debug version of the Xalan-C++ and Xerces-C++ libraries and run with the debug version of the dynamic link libraries.
You must also make sure your application is linking with the Debug multithreaded DLL run-time library or the Multithreaded DLL run-time library. To check this setting do the following in Visual C++:
- Select Settings from the Project menu.
- Click the C/C++ tab.
- In the Category drop-down list, select Code Generation.
- In the Use run-time library drop-down list, select Multithreaded DLL for the Win32 Release configuration, or select Debug Multithreaded DLL for the Win32 Debug configuration.
Once you have changed this setting, you must rebuild your project.
7. Building on Windows
In order to build Xalan-C++ on Windows, you will need the following:
- The Xalan-C/C++ source distribution package.
- The Xerces-C/C++ source distribution package or a compatible binary distribution package.
- A compatible Microsoft Visual Studio .NET (2003, 2005, 2008, 2010) software development platform.
The Xalan-C/C++ Version 1.11 (pre-release) is available from the Apache Subversion repository at http://svn.apache.org/repos/asf/xalan/c/trunk/.
After Xalan-C/C++ Version 1.11 is released, it can be downloaded from: Xalan Distributions.
The Xerces-C/C++ Version 3.1.1 is can be downloaded from: Xerces Distributions.
If you are building with the IBM-ICU International Components for Unicode library, you will need to rebuild both the Xerces and Xalan libraries.
For more details, see Steps for doing a Windows build.
8. Building on UNIX
To build Xalan-C++ on supported UNIX platforms, you need Xerces-C++, the GNU make utility, and a supported C++ compiler. For more details see: Steps for doing a UNIX build.
9. Make errors on UNIX platforms
You must use the GNU make utility. Other make utilities may not work with the Xalan Makefile
10. What is ICU
The IBM-ICU International Components for Unicode(ICU) is a C and C++ library that provides robust and full-featured Unicode support on a wide variety of platforms. Xalan-C++ uses the ICU to extend support for encoding, number formatting, and sorting.
The ICU is available for download from http://oss.software.ibm.com/icu/index.html.
Xalan release 1.10 was tested with International Components for Unicode(ICU) version 3.2. The curent Xalan release 1.11 is not fully tested with IBM-ICU.
For more details see: Using the International Components for Unicode (ICU).
11. A tar checksum error on Solaris
The Solaris tar utility you are using does not properly handle files with long pathnames. You must use GNU tar (gtar), which handles arbitrarily long pathnames and is freely available on every platform on which Xalan-C++ is supported. If you don't already have GNU tar installed on your system, you can obtain it from the Free Software Foundation http://www.gnu.org/software/tar/tar.html. For additional background information on this problem, see the online manual GNU tar and POSIX tar for the utility.
12. Xalan-C++ in Apache
A simple Apache module called ApacheModuleXSLT is provided as a sample. It demonstrates how to integrate Xalan-C++ with Apache.
13. Is Xalan-C++ thread-safe?
Instances of XalanTransformer are not thread-safe; each thread should use its own instance.
In order to support very efficient use in multi-threaded applications, Xalan-C++ is designed to avoid synchronization as much as possible. Each thread of execution is required to have its own set of "support" objects that contain the state of the transformation. Accordingly, no synchronization is required when multiple threads are executing.
Parsed ("compiled") stylesheets (see Compiling stylesheets) and parsed source documents may be freely shared by multiple threads of execution without worrying about providing synchronized access to them. The only exception to this rule: You use XercesParserLiaison to parse a document after calling XercesParserLiaison::setBuildBridgeNodes(false) or XercesParserLiaison::setThreadSafe(false). In this case, the document cannot be shared by multiple threads of execution. For reasons of performance, we do not recommend the use of XercesParserLiaison, so this should not be an issue for most applications.
All other objects in Xalan-C++ are not thread-safe. Each thread must have its own instance of each object.
See the ThreadSafe sample program for more information.
14. What can I do to speed up transformations?
To maximize performance, here are some suggestions for you to keep in mind as you set up your applications:
- Use a compiled stylesheet when you expect to use the stylesheet more than once.
- Set up your stylesheets to function efficiently.
- Don't use "//" (descendant axes) patterns near the root of a large document.
- Use xsl:key elements and the key() function as an efficient way to retrieve node sets.
- Where possible, use pattern matching rather than xsl:if or xsl:when statements.
- xsl:for-each is fast because it does not require pattern matching.
- Keep in mind that xsl:sort prevents incremental processing.
- When you create variables, <xsl:variable name="fooElem" select="foo"/> is usually faster than
<xsl:variable name="fooElem"><xsl:value-of-select="foo"/></xsl:variable>.
- Be careful using the last() function.
- The use of index predicates within match patterns can be expensive.
15. Stylesheet validation
An XSL stylesheet is an XML document, so it can have a DOCTYPE and be subject to validation, but you probably will have to write a custom DTD for the purpose.
The XSLT Recommendation includes a DTD Fragment for XSL Stylesheets with some indications of what you need to do to create a complete DTD for a given stylesheet. Keep in mind that stylesheets can include literal result elements and produce output that is not valid XML.
You can use the xsl:stylesheet doctype defined in xsl-html40s.dtd for stylesheets that generate HTML.
16. What does the XalanDOMException HIERARCHY_REQUEST_ERR mean?
It means that an attempt was made to add a node to a DOM that would create an invalid structure. For example, text nodes are not allowed as children of the document node.
This is a common error when attempting to transform to DOM. Source documents and stylesheets that might produce valid serialized XML might not produce value DOM. The usual suspect is text nodes being generated before the document element is generated.
If you think you have seen this error because of a bug in Xalan-C++'s source tree implementation, please post a bug report on Bugzilla, and attach a minimal source document and stylesheet that produce the problem to the bug report.
17. Submitting Patches
Your contributions are much appreciated! You can e-mail your patches to Xalan Development Mailing List or raise an issue on the Jira issue tracking system.
The Xalan projects use Jira as the issue tracking system. Any significant bug or feature request is posted to this system. You must subscribe to the system in order to submit patches and raise issues.
- Subscribe to Jira at: https://issues.apache.org/jira
- Browse the issues at: https://issues.apache.org/jira/browse/XALANC
Issues posted to the project on Jira at XALANC are automatically posted to the Xalan Development Mailing List
Our mailing lists are moderated. You should subscribe to the mailing list in order to post a message, otherwise message delivery requires manual intervention or may be dropped.
18. Transformation Output Methods
The output parameter of XalanTransformer::transform() is an XSLTResultTarget which has many constructors.
Output to a file:
- The easiest way is to use a null-terminated string containing the file name to create an XSLTResultTarget. Or, use an instance of std::ofstream. The command line executables, and many of the sample applications use file names, so take a look through the source code for more information.
Output to an in-memory buffer:
- Use an instance of std::ostrstream or std::ostringstream to create an XSLTResultTarget. See the StreamTransform sample for more information.
Input to another transformation:
- Any of the previous output targets could be used as the input to another transformation, but the FormatterToSourceTree is probably the best for efficiency reasons. See the source code for the TestXSLT command line program for more information.
19. Problems Using Sun's Forte/Workshop Compiler with code containing std::istrstream
There is a bug in Sun's C++ standard library implementation for the Forte/Workshop compiler. The short answer is that you need to get a patch. The bugzilla subsystem for Xalan issue tracking is no longer available. The ticket recorded here included a patch.
The issue is resolved if you use the SunStudio platform for your code development. The Solaris SunStudio is now available from Oracle.
20. Modifying an instance of XalanDocument
No, you aren't going crazy. Xalan's default source tree is read-only for efficiency. If you need a DOM that supports modifications, use the Xerces DOM instead. See the TransformToXercesDOM sample for more information.
21. Changing Where Error Output is Sent
By default, XalanTransformer creates a XalanTransformerProblemListener (a subclass of ProblemListener) that writes output to std::cerr. To change this you can:
- Redirect std::cerr from the command line.
- Call XalanTranformer::setWarningStream with a different std::ostream before calling XalanTransformer::transform.
- Instantiate your own XalanTransformerProblemListener with a different output stream and call XalanTransformer::setProblemListener() before calling XalanTransformer::transform().
- Subclass some ProblemListener type and do custom handling of errors (you still then need to tell XalanTransformer instances to use your ProblemListener.)
In most case you probably want to do one of the first two.
22. Programmatic Error Information
Create a custom ErrorHandler (a Xerces-C++ class) and call XalanTransformer::setErrorHandler before parsing any sources.
23. String Transcoding
See the static method XalanDOMString::transcode, or the functions TranscodeToLocalCodePage in the API documentation. However, you should be very careful when transcoding Unicode characters to the local code page, because not all Unicode characters can be represented.
24. Error Code/Exception Summary
There isn't, but we're working on it.
25. Extension Functions
Did you declare the namespace on the xsl:stylesheet or xsl:transform element? It should look like this:
<xsl:stylesheet version="1.0" xmlns:xalan="http://xml.apache.org/xalan"> ...rest of stylesheet
If you did and you still have problems, you might want to ask the mailing list.
26. Outputting results to a file on Windows 95/98
Well, you can, but it doesn't always work. Neither Windows 95 or 98 are supported or tested. There have been reports of problems on it, especially regarding Unicode support. See this post.
27. Using format-number and ICU
Did you build with ICU support? See Using the International Components for Unicode (ICU).
28. Perl wrapper for Xalan-C++?
There is no Apache Perl wrapper, however Edwin Pratomo has written a wrapper for Xalan-C++ version 1.4 that can be found on CPAN
29. Missing LocalMsgIndex.hpp file
The LocalMsgIndex.hpp file is not shipped with the distributions because this file is generated during compile time. This file is created at the start of the build process and customized for the locale and message set you are using.
On Windows, the LocalMsgIndex.hpp header is generated by the Localization project. By building any project that has a dependency on the Localization project, will trigger the Message Localization component to built and the LocalMsgIndex.hpp to be generated.
On Windows binary distributions, the LocalMsgIndex.hpp file is a member of the "Include
" directory.
On Unix binary distributions, the LocalMsgIndex.hpp file is a member of the "include/xalanc/PlatformSupport
" directory.