Miriam Baglioni
|
8905a39bf3
|
mergin with branch beta
|
2021-12-02 13:17:29 +01:00 |
Miriam Baglioni
|
87eedad898
|
-
|
2021-12-02 13:17:19 +01:00 |
Claudio Atzori
|
3b19821f3c
|
added stats computation on the graph hive DB tables
|
2021-12-02 10:44:10 +01:00 |
Claudio Atzori
|
cfa4560769
|
minor: fixed hive action name
|
2021-12-02 10:43:36 +01:00 |
Claudio Atzori
|
d85af6fc25
|
[cleaning wf] fixed OAF record navigation, a mapping defined on a container object would have prevented the natvigation to continue on its properties
|
2021-12-01 15:49:15 +01:00 |
Claudio Atzori
|
4fe7888817
|
code formatting
|
2021-12-01 15:48:15 +01:00 |
Claudio Atzori
|
01e5e0142a
|
added test to verify the relation inverse lookup operation
|
2021-12-01 09:46:26 +01:00 |
Claudio Atzori
|
0df9574a6f
|
Merge pull request '[stats wf] Added sprint 3&4 of indicators' (#166) from antonis.lempesis/dnet-hadoop:beta into beta
Reviewed-on: #166
|
2021-11-29 10:40:26 +01:00 |
Claudio Atzori
|
014e872ae1
|
[resolution wf] added optional parameter to skip the entity resolution
|
2021-11-26 15:38:56 +01:00 |
Claudio Atzori
|
5c6d328537
|
code formatting
|
2021-11-26 15:38:16 +01:00 |
dimitrispie
|
09fc2afdca
|
Added indi_funder_country_collab
Kept only indi_pub_has_cc_licence
|
2021-11-26 16:13:10 +02:00 |
Antonis Lempesis
|
0b4163ee0b
|
added sprint3,4, removed 2, chaos
|
2021-11-26 15:58:01 +02:00 |
dimitrispie
|
29f69f2f89
|
Sprint 4
|
2021-11-26 15:22:04 +02:00 |
Miriam Baglioni
|
ac07ed8251
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2021-11-25 12:32:58 +01:00 |
Miriam Baglioni
|
5fd0e610bf
|
[DOIBOOST Process] fix filtering to filter results with non null id
|
2021-11-25 12:10:45 +01:00 |
Sandro La Bruzzo
|
feea154e89
|
remove working dir after test
|
2021-11-25 11:02:38 +01:00 |
Sandro La Bruzzo
|
028a8acad8
|
add test resources
|
2021-11-25 10:54:47 +01:00 |
Sandro La Bruzzo
|
2164a2a889
|
Datacite: Code Refactor generated a general SparkApplication Scala where all the spark scala have to inherit
Commented a little the Datacite transformation code
|
2021-11-25 10:54:13 +01:00 |
Sandro La Bruzzo
|
a7cf277d98
|
Datacite: Removed HostedBy Patch as described on ticket #7219, Now all the records will have hosted by Unknown Repository
|
2021-11-22 16:03:17 +01:00 |
Sandro La Bruzzo
|
483d3039d1
|
entity resolution: added distcpt of missing entities in graph materialization
|
2021-11-22 15:55:24 +01:00 |
Sandro La Bruzzo
|
93fe8ce8b2
|
entity resolution: fix test
|
2021-11-22 15:50:43 +01:00 |
Sandro La Bruzzo
|
35e20b0647
|
updated resolution wf:
- generate a new version of the graph
- changed merge from union to join
|
2021-11-22 11:48:55 +01:00 |
Miriam Baglioni
|
fdb75b180e
|
[Cleaning] added couple of tests for DOIBOOST publications
|
2021-11-21 16:35:22 +01:00 |
Miriam Baglioni
|
0506fa2654
|
[Graph Dump] changed to mirror the changes in the model
|
2021-11-19 15:56:25 +01:00 |
Sandro La Bruzzo
|
3426451d3f
|
Merge remote-tracking branch 'origin/beta' into beta
|
2021-11-19 14:49:04 +01:00 |
Sandro La Bruzzo
|
4542a2338b
|
updated site configuration to deploy on website
|
2021-11-19 13:44:08 +01:00 |
Claudio Atzori
|
e5a2c596b2
|
Merge branch 'beta' into preserve_openorg_parent_child_relations
|
2021-11-19 11:35:46 +01:00 |
Claudio Atzori
|
f4538f3c4c
|
cleanup
|
2021-11-19 11:33:10 +01:00 |
Claudio Atzori
|
2b46b87f56
|
fixed filtering criteria applied in SparkCopyRelationsNoOpenorgs to keep the parent/child relations from OpenOrgs
|
2021-11-19 11:30:29 +01:00 |
Miriam Baglioni
|
9fae872181
|
[Graph Dump] changed to mirror the changes in the model
|
2021-11-19 11:25:50 +01:00 |
Sandro La Bruzzo
|
fc03c99805
|
fixed javadocs url after deploying site
|
2021-11-19 10:46:33 +01:00 |
Sandro La Bruzzo
|
0c0d561bc4
|
added public class into tests to create correct javadoc
|
2021-11-19 09:54:22 +01:00 |
Claudio Atzori
|
62fa61f3cf
|
merge from beta
|
2021-11-19 09:23:42 +01:00 |
Claudio Atzori
|
bd9a43cefd
|
Revert to 4094f2bb9a
|
2021-11-19 09:20:43 +01:00 |
Claudio Atzori
|
3974fa7dc1
|
Merge branch 'beta' into affiliationPropagation
|
2021-11-18 18:06:26 +01:00 |
Claudio Atzori
|
a24b9f8268
|
[dedup] trivial refactoring
|
2021-11-18 17:12:02 +01:00 |
Claudio Atzori
|
c0750fb17c
|
avoid non necessary count operations over large spark datasets
|
2021-11-18 17:11:31 +01:00 |
Claudio Atzori
|
bb5dca7979
|
cleanup
|
2021-11-18 17:10:46 +01:00 |
Miriam Baglioni
|
793b5a8e5f
|
Aggiornare 'dhp-workflows/dhp-graph-mapper/src/main/java/eu/dnetlib/dhp/oa/graph/dump/ResultMapper.java'
Removing the dump of Measure at the level of the result. We decided not to map it
|
2021-11-18 14:49:38 +01:00 |
Miriam Baglioni
|
5dc5792722
|
[Graph Dump] Change test resource to mirror the movement of the measure element
|
2021-11-18 14:39:12 +01:00 |
Miriam Baglioni
|
0136a8c266
|
[Graph Dump] Change test to mirror that measure is at the level of the isntance
|
2021-11-18 14:38:33 +01:00 |
Miriam Baglioni
|
1b79c0ee79
|
mergin with branch beta
|
2021-11-18 11:01:00 +01:00 |
Antonis Lempesis
|
cb3adb90f4
|
Merge branch 'beta' into beta
|
2021-11-17 14:33:45 +01:00 |
Antonis Lempesis
|
c283406829
|
added Universidad Polytecnica de Madrid
|
2021-11-17 15:33:00 +02:00 |
Claudio Atzori
|
e0395719d7
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2021-11-17 14:17:27 +01:00 |
Claudio Atzori
|
82a4e4efae
|
[cleaning wf] fixed methodology to rule out invalid result titles, based on https://support.openaire.eu/issues/7206
|
2021-11-17 14:17:22 +01:00 |
Miriam Baglioni
|
6d4a1c57ee
|
[Resolve Entities] Change test dataset to mirror the modification in the creation of the map between the pids and the unresolved
|
2021-11-17 12:41:52 +01:00 |
Sandro La Bruzzo
|
9c82d670b8
|
make class public in order to create javadoc
|
2021-11-17 12:31:02 +01:00 |
Sandro La Bruzzo
|
1f5ee116ed
|
code refactor, created and moved scala code on the correct maven folder under src/main/scala and src/test/scala
fixed test
|
2021-11-17 12:23:52 +01:00 |
Sandro La Bruzzo
|
2fd9ceac13
|
code refactor, created and moved scala code on the correct maven folder under src/main/scala and src/test/scala
|
2021-11-17 11:35:22 +01:00 |
Sandro La Bruzzo
|
2506d7a679
|
Merge branch 'mvn_site_documentation' of code-repo.d4science.org:D-Net/dnet-hadoop into mvn_site_documentation
|
2021-11-17 11:07:24 +01:00 |
Sandro La Bruzzo
|
cded363b55
|
code refactor, created and moved scala code on the correct maven folder under src/main/scala and src/test/scala
|
2021-11-17 11:06:35 +01:00 |
Miriam Baglioni
|
4094f2bb9a
|
added integration md file
|
2021-11-17 10:04:52 +01:00 |
Miriam Baglioni
|
ec8b0219ff
|
[Documentation] Added first page for Integration via unresolved entities generation
|
2021-11-16 17:41:34 +01:00 |
Miriam Baglioni
|
2bbece2ca5
|
mergin with branch beta
|
2021-11-16 16:35:40 +01:00 |
Sandro La Bruzzo
|
2d67020c59
|
added dhp-enrichment maven site template
|
2021-11-16 16:01:08 +01:00 |
Miriam Baglioni
|
28ea532ece
|
[Affilaition Propagation] moved the selection of graph relation as a preparation step
|
2021-11-16 15:24:19 +01:00 |
Sandro La Bruzzo
|
18c1d70ef4
|
Merge branch 'beta' of code-repo.d4science.org:D-Net/dnet-hadoop into mvn_site_documentation
|
2021-11-16 15:16:49 +01:00 |
Sandro La Bruzzo
|
a1cafaf2e3
|
added mvn site for dnet-hadoop project
|
2021-11-16 15:16:28 +01:00 |
Miriam Baglioni
|
7c96e3fd46
|
removed not useful dir
|
2021-11-16 13:57:26 +01:00 |
Miriam Baglioni
|
c7c0c3187b
|
[AFFILIATION PROPAGATION] Applied some SonarLint suggestions
|
2021-11-16 13:56:32 +01:00 |
Miriam Baglioni
|
c6a9f0a1a8
|
mergin with branch beta
|
2021-11-16 12:04:40 +01:00 |
Miriam Baglioni
|
99d86134f5
|
[Graph Dump] changed the dump since the measures have been moded at the level of the instance
|
2021-11-16 12:04:21 +01:00 |
Claudio Atzori
|
0a727d325d
|
[dedup] increased number of partitions in the consistency phase
|
2021-11-16 08:43:41 +01:00 |
Claudio Atzori
|
bafa2990f3
|
code formatting
|
2021-11-15 17:07:16 +01:00 |
Claudio Atzori
|
668ac25224
|
[graph resolution] using existing argument parser file name
|
2021-11-15 17:02:45 +01:00 |
Claudio Atzori
|
7d0a03f607
|
[graph resolution] minor
|
2021-11-15 14:45:54 +01:00 |
Claudio Atzori
|
941a50a2fc
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2021-11-15 14:42:49 +01:00 |
Claudio Atzori
|
7c804acda8
|
[graph resolution] minor
|
2021-11-15 14:42:43 +01:00 |
Sandro La Bruzzo
|
efa09057db
|
Merge branch 'beta' of code-repo.d4science.org:D-Net/dnet-hadoop into beta
|
2021-11-15 14:32:09 +01:00 |
Sandro La Bruzzo
|
48923e46a1
|
added documentation to Pubmed Class and also added mvn site for dhp-aggregations
|
2021-11-15 14:32:01 +01:00 |
Claudio Atzori
|
d2c787d416
|
[graph resolution] fixed sequence of the workflow steps
|
2021-11-15 14:31:15 +01:00 |
Claudio Atzori
|
975b10b711
|
[actionmanager] increased spark.sql.shuffle.partitions to 5000
|
2021-11-15 12:31:45 +01:00 |
Miriam Baglioni
|
4ec88c718c
|
merge with beta - resolved conflict in pom
|
2021-11-15 10:52:16 +01:00 |
Miriam Baglioni
|
6f1a434e90
|
[Bypass Action Set] Fixed test to consider the new identifier utils
|
2021-11-15 09:59:23 +01:00 |
Miriam Baglioni
|
157d33ebf9
|
[Bypass Action Set] Refactoring
|
2021-11-15 09:58:48 +01:00 |
Miriam Baglioni
|
6595135a1a
|
[Dump Schemas] changed the schema of the dumped result according to the modifications in the bestAccessRight type
|
2021-11-12 11:45:38 +01:00 |
Miriam Baglioni
|
43cae4ad88
|
Merge branch 'dump' of https://code-repo.d4science.org/D-Net/dnet-hadoop into dump
|
2021-11-12 11:36:54 +01:00 |
Miriam Baglioni
|
b3f9370125
|
merge with beta - resolved conflict in pom
|
2021-11-12 11:25:26 +01:00 |
Miriam Baglioni
|
92d0e18b55
|
[Bypass Action Set] used constant DOI instead of "doi"
|
2021-11-12 10:56:58 +01:00 |
Miriam Baglioni
|
881113743f
|
[Bypass Action Set] refactoring
|
2021-11-12 10:55:50 +01:00 |
Miriam Baglioni
|
47ccb53c4f
|
[Bypass Action Set] modification for comment #157 (comment)
|
2021-11-12 10:54:09 +01:00 |
Miriam Baglioni
|
ffb0ce1d59
|
merge with beta - resolved conflict in pom
|
2021-11-12 10:19:59 +01:00 |
Miriam Baglioni
|
716021546e
|
[Bypass Action Set] minor fix
|
2021-11-12 10:18:01 +01:00 |
Sandro La Bruzzo
|
3469cc2b1d
|
Merge branch 'beta' of code-repo.d4science.org:D-Net/dnet-hadoop into beta
|
2021-11-12 09:56:52 +01:00 |
Sandro La Bruzzo
|
a7763d2492
|
removed alternate identifier in resolutionMap
|
2021-11-12 09:56:45 +01:00 |
Miriam Baglioni
|
b8bdabfae9
|
[Graph DUmp] removed OpenAccessRoute from test in best access right
|
2021-11-11 16:16:48 +01:00 |
Miriam Baglioni
|
e5498052e8
|
[Graph DUmp] removed OpenAccessRoute from test in best access right
|
2021-11-11 16:14:10 +01:00 |
Miriam Baglioni
|
935062edec
|
[Bypass Action Set] creation of unresolved entities
|
2021-11-11 16:11:25 +01:00 |
Antonis Lempesis
|
26f086dd64
|
removed the too restrctive clause. will discuss again
|
2021-11-11 12:57:19 +02:00 |
Claudio Atzori
|
148289150f
|
Merge branch 'beta' into doiboost_url
|
2021-11-11 10:40:19 +01:00 |
Sandro La Bruzzo
|
2ca0a436ad
|
added SparkResolveEntities node to the oozie wf
|
2021-11-11 10:25:42 +01:00 |
Sandro La Bruzzo
|
9cb195314f
|
implemented and tested resolution of entities
|
2021-11-11 10:17:40 +01:00 |
Miriam Baglioni
|
6d3c4c4abe
|
mergin with branch beta
|
2021-11-11 08:59:53 +01:00 |
Miriam Baglioni
|
8cc50ecee0
|
[Graph Dump] changed AccessRight with BestAccessRight in the dump and modified the dependency to the schema to the SNAPSHOT
|
2021-11-11 08:59:20 +01:00 |
Miriam Baglioni
|
88b73f4f49
|
mergin with branch beta
|
2021-11-10 17:00:52 +01:00 |
Miriam Baglioni
|
c371b23077
|
-
|
2021-11-10 17:00:37 +01:00 |
Miriam Baglioni
|
9e214ce0eb
|
[BypassAS] addition of OC relations
|
2021-11-09 12:07:19 +01:00 |
Sandro La Bruzzo
|
6477a40670
|
implement filter of openCitation
|
2021-11-09 11:27:12 +01:00 |
Miriam Baglioni
|
6f7ca539c6
|
[BypassAS] update of results for bipFinder and FOS
|
2021-11-09 11:25:41 +01:00 |