Geoscience Australia Geomet/Geocat Catalog

Introduction

As part of the GA/CSIRO eResearch project, Geomet/Geocat metadata will be harvested into RDA. The Geomet/Geocat records will be stored in Geonetwork and will be harvested into RDA in a similar manner as AuScope Catalog.

Workflow

  1. Geomet/Geocat metadata are extracted into XML files in ANZLIC format. GA developed a XML extractor for such purpose.
  2. GA Geonetwork imports the ANZLIC records. This is conducted manually by GA. GA is working on automation of the process
  3. ANDS harvester harvests RIF-CS records into RDA. An RIC-CS converter (rif.xsl) is available ( GN 2.6.3 patches).

Geocat to ANZLIC mapping

Geomet to ANZLIC mapping

Connect to external data source

GA Geonetwork and the underlying database are not deployed on the same server. The underlying database is behind a firewall. GA has a solution to connect to a Tomcat managed data source rather than Geonetwork managed Jeeve data source. The solution was developed in GN 2.2 and wasn't made into GN truck. The OAI-PMH protocol which supports RIF-CS harvest only matured since GN2.6.2. Therefore, some effort has been made to port the external data source code from GN 2.2 to GN 2.6.3 and the GN trunk. ( GN 2.6.3 patches and GN trunk patch)

Source code of GA Geonetwork 2.2:

http://apacsrv4.arrc.csiro.au/test/geonetwork_2.2/trunk

The external data source code was commited in revision 255, refer to the attached jira note ( jira_notes.htm).

Note: The patch for trunk hasn't been submitted to the Geonetwork developers.

To configure an external data source for Geonetwork:
  1. Add and resource entry in config.xml
      <resource enabled="true" mode="server">
         <name>main-db</name>
         <provider>jeeves.resources.dbms.DataSourceProvider</provider>
         <config>
            <jndiName>jdbc/GeonetDS</jndiName>
            <vendor>mysql</vendor>
         </config>
      </resource>
  1. Create a context file for Geonetwork (e.g. tomcat/conf/Catalina/localhost/geonetwork.xml)
<?xml version="1.0" encoding="UTF-8"?>
<Context
    docBase="/geonetwork"
    privileged="true"
    reloadable="true">
  <Resource
    auth="Container"
    name="jdbc/GeonetDS"
    type="javax.sql.DataSource"
    password="password"
    driverClassName="com.mysql.jdbc.Driver"
    maxIdle="5"
    maxWait="10000"
    username="geonetwork"
    url="jdbc:mysql://localhost:3306/geonetwork"
    validationQuery="select 1"
    logAbandoned="true"
    maxActive="15"
    testWhileIdle="true"
    timeBetweenEvictionRunsMillis="1800000"
    numTestsPerEvictionRun="5"/>
</Context>

Geonetwork Version

The stable 2.6.3 release is used in this activity.

The extracted XML metadata comply with ANZLIC profile, and are validated by the XML extractor. Geonetwork 2.6.3 doesn't have the ANZLIC profile and it is used for hosting the ANZLIC metadata, not for creating and validating ANZLIC records.

Progress Notes as of 2011-06-28

  1. External Data Source patch for Geonetwork 2.6.3 is completed and tested. Patch for trunk is to be submitt to Geonetwork developer (ask Rob)

  2. GA is in process of setting up Geonetwork on a new server.

  3. Geomet/Geocat records are in Sandbox. GA to update copy right information as requested by ANDS.

  4. Once all the outstanding tasks are done, harvest Geomet/Geocat RIF-CS records into Sandbox and then RDA.

Topic attachments
I Attachment Action Size Date Who Comment
GEOCAT_ANZLIC_Mandatory_Elements_v0.99.xlsxls GEOCAT_ANZLIC_Mandatory_Elements_v0.99.xls manage 2759.0 K 16 Jun 2011 - 09:33 XiangtanLin Geocat to ANZLIC mapping
GEOMET_ANZLIC_Mandatory_Elements_v1.1.xlsxls GEOMET_ANZLIC_Mandatory_Elements_v1.1.xls manage 2780.0 K 16 Jun 2011 - 09:33 XiangtanLin Geomet to ANZLIC mapping
jira_notes.htmhtm jira_notes.htm manage 61.0 K 28 Jun 2011 - 12:06 XiangtanLin Jira notes on GA external data source
Topic revision: r6 - 06 Aug 2012, RiniAngreani