We're updating the issue view to help you get more done. 

DublinCore Catalog XML parsing is slow

Description

This appears to be a regression from 2.x.

The XML parse operation in DublinCoreXmlFormat.readImpl(InputSource in) appears to take about 4ms to run, of which 2ms is:

1 2 3 final SAXParserFactory factory = SAXParserFactory.newInstance(); factory.setValidating(false); factory.setNamespaceAware(true);

and another 2ms for:

1 factory.newSAXParser().parse(in, this);

This is problematic when a lot of XML catalogs are parsed for the same request, as for (parsing XML catalogs for a large number of series).

It seems this was much faster in 2.x, although it's not clear why.

Possibly strategies to improve this include re-using the SAXParserFactory rather than creating a new one for each call. Some discussion here:

https://www.ibm.com/developerworks/library/x-perfap2/index.html

Best is to avoid storing commonly-used attributes only inside XML blobs.

Steps to reproduce

None

Status

Assignee

Karen Dolan

Reporter

Stephen Marquard

Criticality

None

Tags (folksonomy)

None

Components

Affects versions

3.3

Priority

Major