EPrints Technical Mailing List Archive

Message: #02742


< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First

[EP-tech] Re: harvester (question)


Aaah thanks !

I believe this information will interest many french universities..

jm

Le 11/03/2014 12:40, Sebastien Francois a écrit :
Jean-Marie,

I've just created the "french" github repository > https://github.com/eprints/eprints-fr/

... and added the STAR importer plug-in > https://github.com/eprints/eprints-fr/tree/master/star/

I'll eventually add the HAL exporter (covering the latest API > http://api-preprod.archives-ouvertes.fr/docs) and other goodies...


Seb.

On 03/03/14 12:17, Jean-Marie Le Bechec wrote:
yes Seb!

I am interested by your importer! And yes, why not a github project for french tools...

jm


Le 03/03/2014 11:29, Sebastien Francois a écrit :
Hi JM,

I have an importer for TEF (for STAR, via OAI), d'ya want to try that?

Perhaps we could put some French related tools under a common github project?

Seb.


On 03/03/14 08:18, Jean-Marie Le Bechec wrote:
hi Seb,

I need to harvest an OAI server in a format other than Dublin Core (TEF format). I can not get specific metadata with the same name.

For example :
...
<tef:thesisAdmin>
                    <tef:auteur>
                      <tef:nom>nom1</tef:nom>
...

and
...
<tef:directeurThese>
                      <tef:nom>nom2</tef:nom>
                      <tef:prenom>Carine</tef:prenom>
                      <tef:autoriteInterne>MADS_DIRECTEUR_DE_THESE_1</tef:autoriteInterne>
                      <tef:autoriteExterne autoriteSource="Sudoc">073367826</tef:autoriteExterne>
                    </tef:directeurThese>
                    <tef:directeurThese>
                      <tef:nom>nom3</tef:nom>
                      <tef:prenom>Louise</tef:prenom>
                      <tef:autoriteInterne>MADS_DIRECTEUR_DE_THESE_2</tef:autoriteInterne>
                      <tef:autoriteExterne autoriteSource="Sudoc">035036672</tef:autoriteExterne>
                    </tef:directeurThese>
...
in the same record !

I need to extract all this data.

I tried things like :

my $nom;
foreach my $node ($xml->findnodes( "//auteur/nom/*" ))
       {
               $nom = $node->textContent;     
       }

but it does not work (no result)

any idea ?


Thanks !

Jean-Marie

-- 

***********************************************
Jean Marie Le Bechec
Service Commun de la Documentation
Responsable ingenierie documentaire
&
Direction du Systeme d'Information
Referent Etudes

Institut National Polytechnique de Toulouse
6 allee Emile Monso - bp 34038 -
31029 Toulouse cedex 4
Tel : 05 34 32 31 16
Mail : lebechec@inp-toulouse.fr
*********************************************** 


*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/



*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/

-- 

***********************************************
Jean Marie Le Bechec
Service Commun de la Documentation
Responsable ingenierie documentaire
&
Direction du Systeme d'Information
Referent Etudes

Institut National Polytechnique de Toulouse
6 allee Emile Monso - bp 34038 -
31029 Toulouse cedex 4
Tel : 05 34 32 31 16
Mail : lebechec@inp-toulouse.fr
*********************************************** 


*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/



*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/

-- 

***********************************************
Jean Marie Le Bechec
Service Commun de la Documentation
Responsable ingenierie documentaire
&
Direction du Systeme d'Information
Referent Etudes

Institut National Polytechnique de Toulouse
6 allee Emile Monso - bp 34038 -
31029 Toulouse cedex 4
Tel : 05 34 32 31 16
Mail : lebechec@inp-toulouse.fr
***********************************************