Claudio Atzori
|
2b46b87f56
|
fixed filtering criteria applied in SparkCopyRelationsNoOpenorgs to keep the parent/child relations from OpenOrgs
|
2021-11-19 11:30:29 +01:00 |
Claudio Atzori
|
a24b9f8268
|
[dedup] trivial refactoring
|
2021-11-18 17:12:02 +01:00 |
Claudio Atzori
|
c0750fb17c
|
avoid non necessary count operations over large spark datasets
|
2021-11-18 17:11:31 +01:00 |
Claudio Atzori
|
bb5dca7979
|
cleanup
|
2021-11-18 17:10:46 +01:00 |
Miriam Baglioni
|
793b5a8e5f
|
Aggiornare 'dhp-workflows/dhp-graph-mapper/src/main/java/eu/dnetlib/dhp/oa/graph/dump/ResultMapper.java'
Removing the dump of Measure at the level of the result. We decided not to map it
|
2021-11-18 14:49:38 +01:00 |
Claudio Atzori
|
10a32f287f
|
Merge pull request '[stats wf] RIs, affiliations, parquet' (#156) from antonis.lempesis/dnet-hadoop:beta into beta
Reviewed-on: D-Net/dnet-hadoop#156
|
2021-11-17 15:02:25 +01:00 |
Antonis Lempesis
|
cb3adb90f4
|
Merge branch 'beta' into beta
|
2021-11-17 14:33:45 +01:00 |
Antonis Lempesis
|
c283406829
|
added Universidad Polytecnica de Madrid
|
2021-11-17 15:33:00 +02:00 |
Claudio Atzori
|
e0395719d7
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2021-11-17 14:17:27 +01:00 |
Claudio Atzori
|
82a4e4efae
|
[cleaning wf] fixed methodology to rule out invalid result titles, based on https://support.openaire.eu/issues/7206
|
2021-11-17 14:17:22 +01:00 |
Miriam Baglioni
|
6d4a1c57ee
|
[Resolve Entities] Change test dataset to mirror the modification in the creation of the map between the pids and the unresolved
|
2021-11-17 12:41:52 +01:00 |
Claudio Atzori
|
49f897ef29
|
[cleaning wf] fixed regex used to spot garbage in result titles; adjusted threshold for filtering titles
|
2021-11-16 15:24:23 +01:00 |
Claudio Atzori
|
0a727d325d
|
[dedup] increased number of partitions in the consistency phase
|
2021-11-16 08:43:41 +01:00 |
Claudio Atzori
|
bafa2990f3
|
code formatting
|
2021-11-15 17:07:16 +01:00 |
Claudio Atzori
|
668ac25224
|
[graph resolution] using existing argument parser file name
|
2021-11-15 17:02:45 +01:00 |
Claudio Atzori
|
7d0a03f607
|
[graph resolution] minor
|
2021-11-15 14:45:54 +01:00 |
Claudio Atzori
|
941a50a2fc
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2021-11-15 14:42:49 +01:00 |
Claudio Atzori
|
7c804acda8
|
[graph resolution] minor
|
2021-11-15 14:42:43 +01:00 |
Sandro La Bruzzo
|
efa09057db
|
Merge branch 'beta' of code-repo.d4science.org:D-Net/dnet-hadoop into beta
|
2021-11-15 14:32:09 +01:00 |
Sandro La Bruzzo
|
48923e46a1
|
added documentation to Pubmed Class and also added mvn site for dhp-aggregations
|
2021-11-15 14:32:01 +01:00 |
Claudio Atzori
|
d2c787d416
|
[graph resolution] fixed sequence of the workflow steps
|
2021-11-15 14:31:15 +01:00 |
Claudio Atzori
|
975b10b711
|
[actionmanager] increased spark.sql.shuffle.partitions to 5000
|
2021-11-15 12:31:45 +01:00 |
Claudio Atzori
|
1ecceea788
|
Merge pull request 'Open Citations' (#158) from openCitations into beta
Reviewed-on: D-Net/dnet-hadoop#158
|
2021-11-15 10:59:19 +01:00 |
Miriam Baglioni
|
4ec88c718c
|
merge with beta - resolved conflict in pom
|
2021-11-15 10:52:16 +01:00 |
Miriam Baglioni
|
6f1a434e90
|
[Bypass Action Set] Fixed test to consider the new identifier utils
|
2021-11-15 09:59:23 +01:00 |
Miriam Baglioni
|
157d33ebf9
|
[Bypass Action Set] Refactoring
|
2021-11-15 09:58:48 +01:00 |
Claudio Atzori
|
7b81607035
|
Merge pull request 'PR: Bypass Action Set' (#157) from bypass_acstionset into beta
Reviewed-on: D-Net/dnet-hadoop#157
|
2021-11-12 12:01:05 +01:00 |
Miriam Baglioni
|
92d0e18b55
|
[Bypass Action Set] used constant DOI instead of "doi"
|
2021-11-12 10:56:58 +01:00 |
Miriam Baglioni
|
881113743f
|
[Bypass Action Set] refactoring
|
2021-11-12 10:55:50 +01:00 |
Miriam Baglioni
|
47ccb53c4f
|
[Bypass Action Set] modification for comment D-Net/dnet-hadoop#157 (comment)
|
2021-11-12 10:54:09 +01:00 |
Miriam Baglioni
|
ffb0ce1d59
|
merge with beta - resolved conflict in pom
|
2021-11-12 10:19:59 +01:00 |
Miriam Baglioni
|
716021546e
|
[Bypass Action Set] minor fix
|
2021-11-12 10:18:01 +01:00 |
Claudio Atzori
|
1f2a3d1af0
|
depending on dhp-schemas:2.8.22 (release)
|
2021-11-12 10:15:11 +01:00 |
Sandro La Bruzzo
|
3469cc2b1d
|
Merge branch 'beta' of code-repo.d4science.org:D-Net/dnet-hadoop into beta
|
2021-11-12 09:56:52 +01:00 |
Sandro La Bruzzo
|
a7763d2492
|
removed alternate identifier in resolutionMap
|
2021-11-12 09:56:45 +01:00 |
Miriam Baglioni
|
935062edec
|
[Bypass Action Set] creation of unresolved entities
|
2021-11-11 16:11:25 +01:00 |
Antonis Lempesis
|
26f086dd64
|
removed the too restrctive clause. will discuss again
|
2021-11-11 12:57:19 +02:00 |
Claudio Atzori
|
8bdca3413f
|
Merge pull request 'DOIBoost Mapping: change the creation of the instance in the DOIBoost result' (#155) from doiboost_url into beta
Reviewed-on: D-Net/dnet-hadoop#155
|
2021-11-11 10:40:32 +01:00 |
Claudio Atzori
|
148289150f
|
Merge branch 'beta' into doiboost_url
|
2021-11-11 10:40:19 +01:00 |
Sandro La Bruzzo
|
2ca0a436ad
|
added SparkResolveEntities node to the oozie wf
|
2021-11-11 10:25:42 +01:00 |
Sandro La Bruzzo
|
9cb195314f
|
implemented and tested resolution of entities
|
2021-11-11 10:17:40 +01:00 |
Miriam Baglioni
|
6d3c4c4abe
|
mergin with branch beta
|
2021-11-11 08:59:53 +01:00 |
Miriam Baglioni
|
c371b23077
|
-
|
2021-11-10 17:00:37 +01:00 |
Miriam Baglioni
|
9e214ce0eb
|
[BypassAS] addition of OC relations
|
2021-11-09 12:07:19 +01:00 |
Sandro La Bruzzo
|
6477a40670
|
implement filter of openCitation
|
2021-11-09 11:27:12 +01:00 |
Miriam Baglioni
|
6f7ca539c6
|
[BypassAS] update of results for bipFinder and FOS
|
2021-11-09 11:25:41 +01:00 |
Miriam Baglioni
|
a7d50c499b
|
[BypassAS] prepare FOS subject, test and model for FOS and BipFinder scores
|
2021-11-08 16:44:19 +01:00 |
Antonis Lempesis
|
91354c6068
|
- fetching all context related results
- storing tables as parquet
|
2021-11-08 15:15:46 +02:00 |
Miriam Baglioni
|
df7ee77c7a
|
[DOIBoost Mapping] removed not needed comments
|
2021-11-04 16:24:07 +01:00 |
Miriam Baglioni
|
de63d29b6f
|
[DOIBoost Mapping] Fix to avoid to produce results with null as identifier (probably due to the filtering function in the factory for the creation of the id)
|
2021-11-04 16:16:40 +01:00 |