Claudio Atzori
|
e0395719d7
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2021-11-17 14:17:27 +01:00 |
Claudio Atzori
|
82a4e4efae
|
[cleaning wf] fixed methodology to rule out invalid result titles, based on https://support.openaire.eu/issues/7206
|
2021-11-17 14:17:22 +01:00 |
Miriam Baglioni
|
6d4a1c57ee
|
[Resolve Entities] Change test dataset to mirror the modification in the creation of the map between the pids and the unresolved
|
2021-11-17 12:41:52 +01:00 |
Claudio Atzori
|
49f897ef29
|
[cleaning wf] fixed regex used to spot garbage in result titles; adjusted threshold for filtering titles
|
2021-11-16 15:24:23 +01:00 |
Claudio Atzori
|
0a727d325d
|
[dedup] increased number of partitions in the consistency phase
|
2021-11-16 08:43:41 +01:00 |
Claudio Atzori
|
bafa2990f3
|
code formatting
|
2021-11-15 17:07:16 +01:00 |
Claudio Atzori
|
668ac25224
|
[graph resolution] using existing argument parser file name
|
2021-11-15 17:02:45 +01:00 |
Claudio Atzori
|
7d0a03f607
|
[graph resolution] minor
|
2021-11-15 14:45:54 +01:00 |
Claudio Atzori
|
941a50a2fc
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2021-11-15 14:42:49 +01:00 |
Claudio Atzori
|
7c804acda8
|
[graph resolution] minor
|
2021-11-15 14:42:43 +01:00 |
Sandro La Bruzzo
|
efa09057db
|
Merge branch 'beta' of code-repo.d4science.org:D-Net/dnet-hadoop into beta
|
2021-11-15 14:32:09 +01:00 |
Sandro La Bruzzo
|
48923e46a1
|
added documentation to Pubmed Class and also added mvn site for dhp-aggregations
|
2021-11-15 14:32:01 +01:00 |
Claudio Atzori
|
d2c787d416
|
[graph resolution] fixed sequence of the workflow steps
|
2021-11-15 14:31:15 +01:00 |
Claudio Atzori
|
975b10b711
|
[actionmanager] increased spark.sql.shuffle.partitions to 5000
|
2021-11-15 12:31:45 +01:00 |
Claudio Atzori
|
1ecceea788
|
Merge pull request 'Open Citations' (#158) from openCitations into beta
Reviewed-on: D-Net/dnet-hadoop#158
|
2021-11-15 10:59:19 +01:00 |
Miriam Baglioni
|
4ec88c718c
|
merge with beta - resolved conflict in pom
|
2021-11-15 10:52:16 +01:00 |
Miriam Baglioni
|
6f1a434e90
|
[Bypass Action Set] Fixed test to consider the new identifier utils
|
2021-11-15 09:59:23 +01:00 |
Miriam Baglioni
|
157d33ebf9
|
[Bypass Action Set] Refactoring
|
2021-11-15 09:58:48 +01:00 |
Claudio Atzori
|
7b81607035
|
Merge pull request 'PR: Bypass Action Set' (#157) from bypass_acstionset into beta
Reviewed-on: D-Net/dnet-hadoop#157
|
2021-11-12 12:01:05 +01:00 |
Miriam Baglioni
|
92d0e18b55
|
[Bypass Action Set] used constant DOI instead of "doi"
|
2021-11-12 10:56:58 +01:00 |
Miriam Baglioni
|
881113743f
|
[Bypass Action Set] refactoring
|
2021-11-12 10:55:50 +01:00 |
Miriam Baglioni
|
47ccb53c4f
|
[Bypass Action Set] modification for comment D-Net/dnet-hadoop#157 (comment)
|
2021-11-12 10:54:09 +01:00 |
Miriam Baglioni
|
ffb0ce1d59
|
merge with beta - resolved conflict in pom
|
2021-11-12 10:19:59 +01:00 |
Miriam Baglioni
|
716021546e
|
[Bypass Action Set] minor fix
|
2021-11-12 10:18:01 +01:00 |
Claudio Atzori
|
1f2a3d1af0
|
depending on dhp-schemas:2.8.22 (release)
|
2021-11-12 10:15:11 +01:00 |
Sandro La Bruzzo
|
3469cc2b1d
|
Merge branch 'beta' of code-repo.d4science.org:D-Net/dnet-hadoop into beta
|
2021-11-12 09:56:52 +01:00 |
Sandro La Bruzzo
|
a7763d2492
|
removed alternate identifier in resolutionMap
|
2021-11-12 09:56:45 +01:00 |
Miriam Baglioni
|
935062edec
|
[Bypass Action Set] creation of unresolved entities
|
2021-11-11 16:11:25 +01:00 |
Antonis Lempesis
|
26f086dd64
|
removed the too restrctive clause. will discuss again
|
2021-11-11 12:57:19 +02:00 |
Claudio Atzori
|
8bdca3413f
|
Merge pull request 'DOIBoost Mapping: change the creation of the instance in the DOIBoost result' (#155) from doiboost_url into beta
Reviewed-on: D-Net/dnet-hadoop#155
|
2021-11-11 10:40:32 +01:00 |
Claudio Atzori
|
148289150f
|
Merge branch 'beta' into doiboost_url
|
2021-11-11 10:40:19 +01:00 |
Sandro La Bruzzo
|
2ca0a436ad
|
added SparkResolveEntities node to the oozie wf
|
2021-11-11 10:25:42 +01:00 |
Sandro La Bruzzo
|
9cb195314f
|
implemented and tested resolution of entities
|
2021-11-11 10:17:40 +01:00 |
Miriam Baglioni
|
6d3c4c4abe
|
mergin with branch beta
|
2021-11-11 08:59:53 +01:00 |
Miriam Baglioni
|
c371b23077
|
-
|
2021-11-10 17:00:37 +01:00 |
Miriam Baglioni
|
9e214ce0eb
|
[BypassAS] addition of OC relations
|
2021-11-09 12:07:19 +01:00 |
Sandro La Bruzzo
|
6477a40670
|
implement filter of openCitation
|
2021-11-09 11:27:12 +01:00 |
Miriam Baglioni
|
6f7ca539c6
|
[BypassAS] update of results for bipFinder and FOS
|
2021-11-09 11:25:41 +01:00 |
Miriam Baglioni
|
a7d50c499b
|
[BypassAS] prepare FOS subject, test and model for FOS and BipFinder scores
|
2021-11-08 16:44:19 +01:00 |
Antonis Lempesis
|
91354c6068
|
- fetching all context related results
- storing tables as parquet
|
2021-11-08 15:15:46 +02:00 |
Miriam Baglioni
|
df7ee77c7a
|
[DOIBoost Mapping] removed not needed comments
|
2021-11-04 16:24:07 +01:00 |
Miriam Baglioni
|
de63d29b6f
|
[DOIBoost Mapping] Fix to avoid to produce results with null as identifier (probably due to the filtering function in the factory for the creation of the id)
|
2021-11-04 16:16:40 +01:00 |
Miriam Baglioni
|
d50057b2d9
|
[DOIBoost Mapping] changed the way to create the url for the instance: we use the crooref guidelines https://doi.org/doi
|
2021-11-03 16:59:37 +01:00 |
Miriam Baglioni
|
edf55395e9
|
added test resourse
|
2021-11-03 16:49:30 +01:00 |
Miriam Baglioni
|
d97ea82a29
|
[DOIBoost Mapping] Added test to verify the instance created for Crossref will have just the url related to the doi
|
2021-11-03 16:45:15 +01:00 |
Miriam Baglioni
|
96769b4481
|
[DOIBoost - Mapping] Changed the logic which brought in in the instance urls that should not be there: The urld of the doi in the json is reachable from the root (json/"URL") other urls where added from the links element. Now the mapping from the link element has been removed
|
2021-11-03 16:43:36 +01:00 |
Miriam Baglioni
|
683fe093cf
|
[DOIBoost - Mapping] Remove the addition of the instance to the MAG publication record
|
2021-11-03 15:51:26 +01:00 |
Miriam Baglioni
|
b2bb8d9d79
|
[DOIBoost - Mapping] selecting the url from Crossref containing the doi
|
2021-11-03 15:44:57 +01:00 |
Miriam Baglioni
|
779318961c
|
[DOIBoost - Mapping] removed the url from crossref containing the api.elsevier.com... string in the url
|
2021-11-03 14:38:52 +01:00 |
Miriam Baglioni
|
2480e590d1
|
[DOIBoost - Mapping] changed the type on which to map dissertation from Crossref: from 006 Doctoral thesis to 0044 Thesis since dissertation could be either Doctoral or master thesis
|
2021-11-03 14:25:23 +01:00 |