Sandro La Bruzzo
|
477cb10715
|
Merge remote-tracking branch 'origin/beta' into beta
|
2021-09-27 16:57:23 +02:00 |
Sandro La Bruzzo
|
be79d74e3d
|
Fixed DoiBoost generation to point to correct organization in affiliation relation
|
2021-09-27 16:57:04 +02:00 |
Claudio Atzori
|
474117c2e8
|
Merge branch 'beta' into dedup_whitelist
|
2021-09-27 16:41:25 +02:00 |
Miriam Baglioni
|
476a4708d6
|
mergin with branch beta
|
2021-09-27 16:02:32 +02:00 |
Miriam Baglioni
|
5ec69889db
|
OpenCitations: creation of AS from OC
|
2021-09-27 16:02:06 +02:00 |
Claudio Atzori
|
a53acfbc06
|
Merge pull request '[stats] updates in the mapping, indicators, wf' (#145) from antonis.lempesis/dnet-hadoop:beta into beta
Reviewed-on: #145
|
2021-09-27 15:59:54 +02:00 |
Alessia Bardi
|
b924276e18
|
tests to generate records for the EOSC-Future demo with the EOSC Jupyter Notebbok subject
|
2021-09-24 17:11:56 +02:00 |
Antonis Lempesis
|
a1e1cf32d7
|
fixed an impala error
|
2021-09-24 12:57:24 +03:00 |
Antonis Lempesis
|
f358cabb2b
|
fixed typo
|
2021-09-22 21:50:37 +03:00 |
Miriam Baglioni
|
eedf7c3310
|
mergin with branch beta
|
2021-09-22 15:18:34 +02:00 |
Miriam Baglioni
|
f2118d771a
|
first steps in the implementation of the integration of opencitations
|
2021-09-22 15:18:05 +02:00 |
Claudio Atzori
|
7fa60e166e
|
Merge branch 'beta' into dedup_whitelist
|
2021-09-22 11:31:18 +02:00 |
Antonis Lempesis
|
421d55265d
|
created hive action for observatory queries
|
2021-09-21 03:07:58 +03:00 |
Enrico Ottonello
|
92a63f78fe
|
multiple download attempts handling if a connection to orcid server fails
|
2021-09-20 18:25:00 +02:00 |
Enrico Ottonello
|
0c74f5667e
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2021-09-20 18:12:31 +02:00 |
miconis
|
853333bdde
|
implementation of the whitelist for similarity relations
|
2021-09-20 16:21:47 +02:00 |
Antonis Lempesis
|
8b681dcf1b
|
attempt to make the observatory wf run in hive
|
2021-09-18 00:35:14 +03:00 |
Antonis Lempesis
|
2943287d10
|
fixed the definition of cc_licence, part II
|
2021-09-16 15:59:06 +03:00 |
Antonis Lempesis
|
dd2329849f
|
fixed the definition of cc_licence
|
2021-09-16 13:50:34 +03:00 |
Claudio Atzori
|
09c2eb7f62
|
Merge branch 'beta' into clean_relations
|
2021-09-16 11:09:47 +02:00 |
Miriam Baglioni
|
e9ccdf853f
|
related to #132
|
2021-09-15 18:44:54 +02:00 |
Claudio Atzori
|
12766bf5f2
|
Merge branch 'beta' into clean_relations
|
2021-09-15 17:18:15 +02:00 |
Claudio Atzori
|
663b1556d7
|
manually integrating PR#140 #140
|
2021-09-15 16:40:25 +02:00 |
Claudio Atzori
|
ebf53a1616
|
added cleaning for relation fields: subRelType & relClass according to dedicated vocabs
|
2021-09-15 16:10:37 +02:00 |
Enrico Ottonello
|
8b804e7fe1
|
removed unused imports
|
2021-09-14 17:30:52 +02:00 |
Enrico Ottonello
|
aefa36c54b
|
other task executions go ahead if UnknownHostException happens on a single task
|
2021-09-14 17:26:15 +02:00 |
Antonis Lempesis
|
de9bf3a161
|
added cc_licences and abstracts in observatory db
|
2021-09-14 01:29:08 +03:00 |
Antonis Lempesis
|
9b1936701c
|
fixed yet another typo
|
2021-09-13 21:07:44 +03:00 |
Antonis Lempesis
|
8fc89ae822
|
moved context table creation before indicators
|
2021-09-13 14:33:23 +03:00 |
Antonis Lempesis
|
461bf90ca6
|
fixed the gold_oa definition
|
2021-09-13 11:10:30 +03:00 |
Antonis Lempesis
|
43852bac0e
|
creating other::other concept for all contexts
|
2021-09-13 01:36:41 +03:00 |
Antonis Lempesis
|
f13cca7e83
|
moved dependencies of indicators before them...
|
2021-09-08 23:07:58 +03:00 |
Antonis Lempesis
|
c6ada217a1
|
fixed typo
|
2021-09-08 22:34:59 +03:00 |
Antonis Lempesis
|
1250ae197f
|
using new indicators for the definition of peerreviewed, gold, and green
|
2021-09-08 14:08:43 +03:00 |
Antonis Lempesis
|
ccee451dde
|
added indicators of sprint 2 in monitor db
|
2021-09-07 23:17:13 +03:00 |
Sandro La Bruzzo
|
aed29156c7
|
changed behavior in transformation job, that doesn't fail at first error
|
2021-09-07 19:05:46 +02:00 |
Sandro La Bruzzo
|
3c6fc2096c
|
fix bug on oai iterator that skip record cleaned
|
2021-09-07 10:46:26 +02:00 |
Sandro La Bruzzo
|
d4dadf6d77
|
reduced max number of PID in Relatedentity
|
2021-09-02 14:21:24 +02:00 |
Sandro La Bruzzo
|
9f8a80deb7
|
fixed wrong import of unresolved relation in openaire
|
2021-09-01 14:16:27 +02:00 |
Alessia Bardi
|
3762b17f7b
|
added VERSIOn and PART relationship and re-ordered according to my personal and obviously possibly biased
ordering
|
2021-08-31 20:20:05 +02:00 |
Sandro La Bruzzo
|
e8b3cb9147
|
Implemented method to download delta updates in EBI Links
|
2021-08-30 09:32:45 +02:00 |
Alessia Bardi
|
ccf4103a25
|
keep the original url if the decoder fails for any reason
|
2021-08-25 10:07:58 +02:00 |
Sandro La Bruzzo
|
45898c71ac
|
fixed wrong doi in pubmed
|
2021-08-24 15:20:04 +02:00 |
Alessia Bardi
|
00a28c0080
|
originalId was renamed to acronym
|
2021-08-23 15:02:21 +02:00 |
Alessia Bardi
|
f19b04d41b
|
code formatting after mvn compile
|
2021-08-23 14:33:39 +02:00 |
Alessia Bardi
|
931f430129
|
Merge branch 'beta' into datasource_model_eosc_beta
|
2021-08-23 11:57:21 +02:00 |
Alessia Bardi
|
4c1474e693
|
Dealing with #6859#note-2: we have to decode URLs to avoid & and other chars encoded becasue of the original XML representation of data
|
2021-08-20 17:03:30 +02:00 |
Miriam Baglioni
|
5f8ccbc365
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2021-08-20 11:13:47 +02:00 |
Miriam Baglioni
|
882abb40e4
|
CrossrefDump -
|
2021-08-20 11:12:53 +02:00 |
Miriam Baglioni
|
45c62609af
|
CrossrefDump - modified because parameter file was moved
|
2021-08-20 11:12:31 +02:00 |
Miriam Baglioni
|
35880c0e7b
|
CrossrefDump - changed the wf to be able to resume from one of the steps
|
2021-08-20 11:11:35 +02:00 |
Miriam Baglioni
|
f3b6c392c1
|
CrossrefDump - moving parameter file under folder crossref_dump_reader
|
2021-08-20 11:10:58 +02:00 |
Miriam Baglioni
|
65822400ce
|
CrossrefDump - added new parameter file that was missing
|
2021-08-20 11:10:35 +02:00 |
Alessia Bardi
|
a053e1513c
|
different funders in blacklist from BETA and PROD aggregator
|
2021-08-19 11:32:27 +02:00 |
Alessia Bardi
|
812bd54c57
|
different funders in blacklist from BETA and PROD aggregator
|
2021-08-19 11:30:14 +02:00 |
Miriam Baglioni
|
a65d3caaea
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2021-08-19 10:29:10 +02:00 |
Miriam Baglioni
|
e5cf11d088
|
change open access route to result matching hbm to gold
|
2021-08-19 10:29:04 +02:00 |
Claudio Atzori
|
7c0c67bdd6
|
added mock pom
|
2021-08-13 17:45:53 +02:00 |
Claudio Atzori
|
82086f3422
|
fixed directory name
|
2021-08-13 17:42:14 +02:00 |
Claudio Atzori
|
bc7068106c
|
added crossref download oozie workflow
|
2021-08-13 17:19:44 +02:00 |
Claudio Atzori
|
2c0a05f11a
|
manually merged PR#139
|
2021-08-13 17:15:53 +02:00 |
Claudio Atzori
|
d43667d857
|
Merge pull request 'Automatic download of Crossref' (#138) from crossref_dw_wf into beta
Reviewed-on: #138
|
2021-08-13 17:10:10 +02:00 |
Miriam Baglioni
|
5856ca8a7b
|
merging with branch beta - resolved conflicts
|
2021-08-13 16:45:45 +02:00 |
Miriam Baglioni
|
6fec71e8d2
|
removed the specific of the infra we are running the wf from the wf name
|
2021-08-13 16:39:02 +02:00 |
Miriam Baglioni
|
ed7e28490a
|
change in sh
|
2021-08-13 16:19:01 +02:00 |
Claudio Atzori
|
7743d0f919
|
consolidated dnet wf profiles into the same submodule
|
2021-08-13 16:14:54 +02:00 |
Miriam Baglioni
|
6eb7508995
|
mergin with branch beta
|
2021-08-13 16:07:04 +02:00 |
Claudio Atzori
|
f74adc4752
|
added DownloadCSV2 as alternative implementation of the same download procedure
|
2021-08-13 15:52:15 +02:00 |
Claudio Atzori
|
5f0903d50d
|
fixed CSV downloader & tests
|
2021-08-13 14:17:54 +02:00 |
Claudio Atzori
|
17cefe6a97
|
[HBM] removed stale replace option
|
2021-08-13 12:43:59 +02:00 |
Claudio Atzori
|
7ee2757fcd
|
fixed DownloadCSV parameters spec; workflow patching the hostedby replaces the graph content (publication, datasource) rather than creating a copy
|
2021-08-13 12:41:01 +02:00 |
Claudio Atzori
|
c3ad4ab701
|
minor fixes
|
2021-08-13 12:23:15 +02:00 |
Claudio Atzori
|
baed5e3337
|
test classes moved in specific components
|
2021-08-13 12:14:47 +02:00 |
Claudio Atzori
|
3359f73fcf
|
cleanup & best practices
|
2021-08-13 12:00:42 +02:00 |
Miriam Baglioni
|
f4ec81c92c
|
mergin with branch beta
|
2021-08-13 10:31:35 +02:00 |
Miriam Baglioni
|
dc8b05b39e
|
Hosted By Map - changed the association with the datasource id for the hostedby element: there is no more the need to compute it. With the new HBM it is already the id in the graph
|
2021-08-13 10:18:25 +02:00 |
Miriam Baglioni
|
32fd75691f
|
refactoring
|
2021-08-13 10:15:42 +02:00 |
Miriam Baglioni
|
01db1f8bc4
|
GetCSV refactoring - removed not needed import
|
2021-08-13 10:14:17 +02:00 |
Miriam Baglioni
|
964a46ca21
|
GetCSV refactoring - modified due to movement of classes
|
2021-08-13 10:11:18 +02:00 |
Miriam Baglioni
|
eaf077fc34
|
GetCSV refactoring - removed not needed dependency
|
2021-08-13 10:08:58 +02:00 |
Miriam Baglioni
|
5f674efb0c
|
moved dependency version in external pom
|
2021-08-13 10:07:53 +02:00 |
Miriam Baglioni
|
5cd5714530
|
GetCSV refactoring - added ignore annotation for fields not in input csv
|
2021-08-13 10:06:49 +02:00 |
Miriam Baglioni
|
ed183d878e
|
GetCSV refactoring - modified test classes due to change in the model of projects and programme
|
2021-08-13 09:28:51 +02:00 |
Miriam Baglioni
|
8769dd8eef
|
GetCSV refactoring - refactoring due to movement of classes
|
2021-08-12 18:20:56 +02:00 |
Miriam Baglioni
|
6b9e1bf2e3
|
GetCSV refactoring - removing not needed dependency
|
2021-08-12 18:17:50 +02:00 |
Miriam Baglioni
|
d57b2bb927
|
GetCSV refactoring - removing not needed dependency
|
2021-08-12 18:12:51 +02:00 |
Miriam Baglioni
|
9da74b544a
|
GetCSV refactoring - refactoring due to movement of classes
|
2021-08-12 18:12:15 +02:00 |
Miriam Baglioni
|
ab8abd61bb
|
GetCSV refactoring - refactoring due to movement of classes
|
2021-08-12 18:11:07 +02:00 |
Miriam Baglioni
|
335a824e34
|
GetCSV refactoring - fixed issue
|
2021-08-12 18:10:10 +02:00 |
Miriam Baglioni
|
f0845e9865
|
GetCSV refactoring - refactoring due to movement of classes
|
2021-08-12 18:04:58 +02:00 |
Miriam Baglioni
|
7a789423aa
|
GetCSV refactoring - refactoring due to movement of classes
|
2021-08-12 18:04:27 +02:00 |
Miriam Baglioni
|
e9fc3ef3bc
|
GetCSV refactoring - changed to use the new class to get and write the csv file
|
2021-08-12 18:03:41 +02:00 |
Miriam Baglioni
|
4317211a2b
|
GetCSV refactoring - refactoring due to movement
|
2021-08-12 18:03:14 +02:00 |
Miriam Baglioni
|
b62cd656a7
|
GetCSV refactoring - changed the model to store only the information needed
|
2021-08-12 18:01:10 +02:00 |
Miriam Baglioni
|
d36e925277
|
GetCSV refactoring - moved under model package
|
2021-08-12 18:00:21 +02:00 |
Miriam Baglioni
|
6e84b3951f
|
GetCSV refactoring - moving classes to dhp-common that have dependency with GetCSV class (that was located in graph-mapper)
|
2021-08-12 17:57:41 +02:00 |
Claudio Atzori
|
9587d4aee8
|
Merge branch 'beta' into hostedbymap
|
2021-08-12 17:04:30 +02:00 |
Claudio Atzori
|
86d940044c
|
added test to verify bad records from FWF-E-Book-Library
|
2021-08-12 11:32:56 +02:00 |
Claudio Atzori
|
8cdce59e0e
|
[graph raw] let the mapping exceptions propagate
|
2021-08-12 11:32:26 +02:00 |
Miriam Baglioni
|
08dd2b2102
|
moving the dependency version to the external pom file
|
2021-08-11 18:09:41 +02:00 |
Miriam Baglioni
|
ac417ca798
|
removed not needed test resource
|
2021-08-11 17:50:33 +02:00 |
Miriam Baglioni
|
e33daaeee8
|
reverting
|
2021-08-11 17:46:19 +02:00 |
Miriam Baglioni
|
785db1d5b2
|
refactoring
|
2021-08-11 17:44:07 +02:00 |
Miriam Baglioni
|
95e5482bbb
|
removing not needed dependency
|
2021-08-11 17:42:26 +02:00 |
Miriam Baglioni
|
b966329833
|
reverting
|
2021-08-11 17:37:00 +02:00 |
Miriam Baglioni
|
8ad7c71417
|
reverting
|
2021-08-11 17:36:12 +02:00 |
Miriam Baglioni
|
0e1a6bec20
|
reverting
|
2021-08-11 17:32:29 +02:00 |
Miriam Baglioni
|
c6a2a780a9
|
reverting
|
2021-08-11 17:30:17 +02:00 |
Miriam Baglioni
|
b6b58bba28
|
reverting
|
2021-08-11 17:25:37 +02:00 |
Miriam Baglioni
|
804589eb30
|
reverting
|
2021-08-11 17:23:35 +02:00 |
Miriam Baglioni
|
d688749ad9
|
reverting
|
2021-08-11 17:22:28 +02:00 |
Miriam Baglioni
|
524c06e028
|
reverting
|
2021-08-11 17:20:30 +02:00 |
Miriam Baglioni
|
7aa3260729
|
reverting
|
2021-08-11 17:18:45 +02:00 |
Miriam Baglioni
|
55fc500d8d
|
reverting
|
2021-08-11 17:17:48 +02:00 |
Miriam Baglioni
|
8229632839
|
adding assertions to the mapping of the unibi part of gold list
|
2021-08-11 16:36:01 +02:00 |
Miriam Baglioni
|
b1c6140ebf
|
removed all comments in Italian
|
2021-08-11 16:23:33 +02:00 |
Miriam Baglioni
|
52c18c2697
|
removed not needed test class. Teh functionality has been moved
|
2021-08-11 16:16:55 +02:00 |
Miriam Baglioni
|
8da3a25cf6
|
merging with branch beta
|
2021-08-11 15:55:34 +02:00 |
Claudio Atzori
|
9f4db73f30
|
updated/fixed unit tests
|
2021-08-11 15:02:51 +02:00 |
Claudio Atzori
|
61d811ba53
|
suggestions from intellij
|
2021-08-11 12:18:20 +02:00 |
Claudio Atzori
|
2ee21da43b
|
suggestions from SonarLint
|
2021-08-11 12:13:22 +02:00 |
Miriam Baglioni
|
b954fe9ba8
|
mergin with branch beta
|
2021-08-11 10:12:46 +02:00 |
Miriam Baglioni
|
b688567db5
|
hostedbymap - modified part of test to check the bestaccessright changed
|
2021-08-11 10:12:10 +02:00 |
Miriam Baglioni
|
9731a6144a
|
hostedbymap - in case the journal is open access the access may be changed also for the best access right in the result
|
2021-08-10 17:49:45 +02:00 |
Miriam Baglioni
|
a90bac3bc9
|
Graph Dump - added method to test class to verify addition of validation date in projects for community result
|
2021-08-09 16:36:54 +02:00 |
Miriam Baglioni
|
bd0d7bfba7
|
Graph Dump - added resources for testing addition of validation date in project for communityresult
|
2021-08-09 16:36:17 +02:00 |
Miriam Baglioni
|
8daaa32e90
|
Graph Dump - added resources for testing
|
2021-08-09 15:46:29 +02:00 |
Miriam Baglioni
|
bc9e3a06ba
|
Graph Dump - extended the test class
|
2021-08-09 15:46:06 +02:00 |
Miriam Baglioni
|
2efa5abda5
|
refactoring
|
2021-08-09 12:28:36 +02:00 |
Claudio Atzori
|
577f3b1ac8
|
added dnet workflows responsible for the graph construction, enrichment, provision
|
2021-08-09 11:53:58 +02:00 |
Miriam Baglioni
|
da20fceaf7
|
removed all the part related to the crossref dump download since it is done in a separate workflow
|
2021-08-09 11:53:45 +02:00 |
Claudio Atzori
|
964f97ed4d
|
cleanup
|
2021-08-09 11:53:06 +02:00 |
Miriam Baglioni
|
54a6cbb244
|
CrossrefDump - put token among the parameters
|
2021-08-09 11:41:10 +02:00 |
Miriam Baglioni
|
b7079804cb
|
CrossrefDump - put token among the parameters
|
2021-08-09 11:34:35 +02:00 |
Miriam Baglioni
|
a5f82f442b
|
Merge branch 'beta' into doiboost_wf
|
2021-08-09 11:17:51 +02:00 |
Miriam Baglioni
|
b6dcf89d22
|
mergin with branch beta
|
2021-08-09 11:14:43 +02:00 |
Miriam Baglioni
|
eff499af9f
|
added new tests and changed the test example
|
2021-08-09 11:12:30 +02:00 |
Miriam Baglioni
|
5d70f842eb
|
mergin with branch beta
|
2021-08-06 18:57:09 +02:00 |
Miriam Baglioni
|
c3931557e3
|
extended the logic of the dump to consider the validation date in the relation (also in the dumped result for communities and funders at the level of the project), the extention on the instance for the APC, the pid, the alternate identifiers, and the extention of the AccessRight to store the OpenAccessRoute. Added new resourec for testing and extended the old class to verify the new dump. Fixed also issue on relation dump: only relation whose source and target are entities in the graph are dumped. The same hold for references to projects
|
2021-08-06 18:56:18 +02:00 |
Claudio Atzori
|
66f398fe6f
|
Merge pull request '[stats] fixed a typo' (#133) from antonis.lempesis/dnet-hadoop:beta into beta
Reviewed-on: #133
|
2021-08-06 14:29:57 +02:00 |
Miriam Baglioni
|
6bd1eca7e0
|
merge branch with beta
|
2021-08-05 15:23:32 +02:00 |
Miriam Baglioni
|
73dc082927
|
added new dumped field (openaccessroute, pid and alternate identifier at the level of the instance) and the bipFinder measure at the level of the result
|
2021-08-05 15:20:50 +02:00 |
Miriam Baglioni
|
ee13da9258
|
merge branch with master
|
2021-08-05 11:34:20 +02:00 |
Miriam Baglioni
|
bd096f5170
|
removed not needed param file
|
2021-08-05 10:55:43 +02:00 |
Miriam Baglioni
|
5faeefbda8
|
added script to download the dump,changed the workflow input paramenters
|
2021-08-05 10:54:03 +02:00 |
Miriam Baglioni
|
1965e4eece
|
new workflow for downloading the dump of crossref and unpack it
|
2021-08-04 18:29:03 +02:00 |
Claudio Atzori
|
83c04e5d28
|
mapping test for dataset records adapted to reflect the delegated pid authority (zenodo)
|
2021-08-04 10:37:57 +02:00 |
Miriam Baglioni
|
b4eb026c8b
|
mergin with branch beta
|
2021-08-04 10:21:37 +02:00 |
Miriam Baglioni
|
c7b71647c6
|
Hosted By Map - modification of the resource for testing the presence of only one entry per datasource id
|
2021-08-04 10:20:02 +02:00 |
Miriam Baglioni
|
eb8c3f8594
|
Hosted By Map - test modified because of the application of the new aggregator on datasources
|
2021-08-04 10:19:17 +02:00 |