Claudio Atzori
|
3d58f95522
|
[stats update] properly invalidating impala metadata
|
2021-04-15 15:03:05 +02:00 |
miconis
|
f64e57c112
|
refactoring of the id generation, sparkcreatemergerels collects entities to create root id after a join
|
2021-04-15 10:59:24 +02:00 |
miconis
|
176a5e493d
|
Merge branch 'stable_ids' of code-repo.d4science.org:D-Net/dnet-hadoop into stable_ids
|
2021-04-14 18:06:34 +02:00 |
miconis
|
3525a8f504
|
id generation of representative record moved to the SparkCreateMergeRel job
|
2021-04-14 18:06:07 +02:00 |
Sandro La Bruzzo
|
3f77bfceb0
|
fixed test failure on jenkins
|
2021-04-14 10:03:01 +02:00 |
Claudio Atzori
|
3125cef545
|
code formatting
|
2021-04-14 09:11:54 +02:00 |
Sandro La Bruzzo
|
44a0064df6
|
Merge remote-tracking branch 'origin/stable_ids' into stable_ids
|
2021-04-13 17:48:12 +02:00 |
Sandro La Bruzzo
|
479abd10cb
|
Add into ORCID workflow a method that extracts orcid directly to the dump generated by Enrico
|
2021-04-13 17:47:43 +02:00 |
Claudio Atzori
|
710cd1e8f2
|
Merge pull request 'add xslt, personname cleaner' (#104) from andreas.czerniak/BrStableId_dnet-hadoop:stable_ids into stable_ids
Reviewed-on: D-Net/dnet-hadoop#104
LGTM
|
2021-04-13 14:43:05 +02:00 |
Claudio Atzori
|
d1ca025b0b
|
[cleaning] remiving authors without fullname or providing 'deactivated' keyword. Removing test test titles
|
2021-04-13 14:32:41 +02:00 |
miconis
|
1542196a33
|
bug fix: starting node of duplicate scan wf changed
|
2021-04-13 10:15:43 +02:00 |
miconis
|
369ed1cd8a
|
bug fix: lookupurl parameter added to dedup record job
|
2021-04-13 09:08:05 +02:00 |
Andreas Czerniak
|
3b694074ff
|
add xslt, personname cleaner
|
2021-04-13 07:04:27 +02:00 |
Claudio Atzori
|
511c0521e5
|
[dedup] avoiding NPEs handling OpenOrg relations
|
2021-04-12 17:45:11 +02:00 |
miconis
|
d442e25cbc
|
bug fix: ids in self mergerels are not marked deletedbyinference=true
|
2021-04-12 15:56:22 +02:00 |
miconis
|
11b22b2d23
|
bug fix in the query, it now exports only relations with non-hidden organizations
|
2021-04-08 11:51:47 +02:00 |
miconis
|
0857100fb8
|
implementation of the tests for the openorgs integration in the openaire provision
|
2021-04-07 18:42:16 +02:00 |
miconis
|
bf685d849f
|
addition of pids in the query for the export of openorgs for the provision, addition of ec_fields in the openorgs model
|
2021-04-07 14:27:43 +02:00 |
miconis
|
eaaefb8b4c
|
implementation of the procedure to reuse content of different dbs when creating the raw graph
|
2021-04-06 14:35:51 +02:00 |
miconis
|
c39c82dfe9
|
modification of the jobs for the integration of openorgs in the provision, dedup records are no more created by merging but simply taking results of openorgs portal
|
2021-04-06 14:31:00 +02:00 |
Claudio Atzori
|
1e7e5180fa
|
[Graph model] updated definition of ExternalReference: added alternateLabel, removed description (#6503)
|
2021-04-02 12:32:12 +02:00 |
Claudio Atzori
|
e686b8de8d
|
[ORCID-no-doi] integrating PR#98 D-Net/dnet-hadoop#98
|
2021-04-01 17:11:03 +02:00 |
Claudio Atzori
|
ee34cc51c3
|
[ORCID-no-doi] integrating PR#98 D-Net/dnet-hadoop#98
|
2021-04-01 17:07:49 +02:00 |
Claudio Atzori
|
70e49ed53c
|
[OpenOrgsWf] trivial refactoring
|
2021-04-01 10:30:51 +02:00 |
Claudio Atzori
|
7941d7be29
|
WIP: using common definitions from ModelConstants
|
2021-03-31 18:33:57 +02:00 |
Claudio Atzori
|
879e8cc7ef
|
WIP: using common definitions from ModelConstants
|
2021-03-31 17:12:01 +02:00 |
Claudio Atzori
|
72ce741ea6
|
WIP: using common definitions from ModelConstants
|
2021-03-31 17:07:13 +02:00 |
Sandro La Bruzzo
|
616d2ecce2
|
splitted workflow collecting datacite into two workflows.
Released on beta
|
2021-03-31 15:45:58 +02:00 |
Claudio Atzori
|
9237d55d7f
|
[OpenOrgsWf] cleanup
|
2021-03-29 17:40:34 +02:00 |
Claudio Atzori
|
7f4e9479ec
|
[OpenOrgsWf] graph construction wf: allow to skip the import openorgs node (importOpenorgs true|false)
|
2021-03-29 16:59:16 +02:00 |
miconis
|
2709d08fc2
|
Merge branch 'stable_ids' into openorgswf
|
2021-03-29 16:39:07 +02:00 |
miconis
|
f446580e9f
|
code refactoring (useless classes and wf removed), implementation of the test for the openorgs dedup
|
2021-03-29 16:10:46 +02:00 |
Claudio Atzori
|
a0837ac357
|
[Stats update] integrating PR#100 for testing D-Net/dnet-hadoop#100
|
2021-03-29 15:59:58 +02:00 |
miconis
|
2355cc4e9b
|
minor changes and bug fix
|
2021-03-29 10:07:12 +02:00 |
Sandro La Bruzzo
|
1dfda3624e
|
improved workflow importing datacite
|
2021-03-26 13:56:29 +01:00 |
Claudio Atzori
|
827e7e37db
|
[Cleaning] drop instance.alternateIdentifier elements when they are available among instance.pid
|
2021-03-25 11:07:59 +01:00 |
miconis
|
28c1cdd132
|
merged stable_ids into openorgswf
|
2021-03-25 10:44:49 +01:00 |
miconis
|
5dfb66b0fa
|
minor changes
|
2021-03-25 10:29:34 +01:00 |
miconis
|
348b0ef921
|
bug fix, implementation of the workflow for the creation of raw_organizations (openorgs dedup), addition of the pid lists to the openorgs postgres db
|
2021-03-24 15:51:27 +01:00 |
Claudio Atzori
|
751125fdf9
|
[Actionmanager] zero function considers empty entity.id as well as rel.source/rel.target
|
2021-03-23 17:34:32 +01:00 |
Claudio Atzori
|
1e423fdc07
|
[Actionmanager] remove invalid records from the input graph before groupGraphTableByIdAndMerge
|
2021-03-23 13:39:24 +01:00 |
Claudio Atzori
|
e5ebb500cf
|
fixed pom versions; included missing workflow modules in dhp-workflows/pom.xml
|
2021-03-23 12:13:53 +01:00 |
Claudio Atzori
|
b75ad76f79
|
Merge branch 'stable_ids' of https://code-repo.d4science.org/D-Net/dnet-hadoop into stable_ids
|
2021-03-23 09:59:12 +01:00 |
Claudio Atzori
|
8db248aa13
|
avoiding error on jenkins compilations: java.net.BindException: Cannot assign requested address: Service 'sparkDriver' failed after 16 retries (on a random free port)!
|
2021-03-23 09:56:34 +01:00 |
Sandro La Bruzzo
|
625e4c29c4
|
added model constants
|
2021-03-23 09:39:56 +01:00 |
Claudio Atzori
|
b4febed138
|
updated mapping tests as consequence of the special treatment reserved to Handle PIDs
|
2021-03-23 09:37:48 +01:00 |
Claudio Atzori
|
431cbe9955
|
handle missing instance.pid during bulk cleaning
|
2021-03-23 09:28:58 +01:00 |
Sandro La Bruzzo
|
c392936b97
|
fixed error on best access right
|
2021-03-23 09:23:22 +01:00 |
Sandro La Bruzzo
|
c73072079d
|
fix conflicts
|
2021-03-22 16:36:31 +01:00 |
Sandro La Bruzzo
|
098914dcff
|
fix wrong relation with source null
|
2021-03-22 11:35:02 +01:00 |