Claudio Atzori
|
391aa1373b
|
added unit test
|
2022-01-19 17:13:21 +01:00 |
Claudio Atzori
|
62f135262e
|
code formatting
|
2022-01-19 12:30:52 +01:00 |
Claudio Atzori
|
44a937f4ed
|
factored out entity grouping implementation, extended to consider results from delegated authorities rather than identical records from other sources
|
2022-01-19 12:24:52 +01:00 |
Miriam Baglioni
|
42e8f76778
|
[GraphCleaning] change the return value in the filtering function to avoid to lose the APC entities
|
2022-01-13 16:06:43 +01:00 |
Claudio Atzori
|
4f212652ca
|
scalafmt: code formatting
|
2022-01-11 16:57:48 +01:00 |
Miriam Baglioni
|
be0acccf42
|
Merge branch 'beta' into dump
|
2021-12-22 12:39:57 +01:00 |
Sandro La Bruzzo
|
3920d68992
|
Fixed workflow generation of delta in datacite
|
2021-12-21 11:41:49 +01:00 |
Sandro La Bruzzo
|
b881ee5ef8
|
[scholexplorer]
- implemented generation of scholix of delta update of datacite
|
2021-12-15 11:25:32 +01:00 |
Miriam Baglioni
|
56409d1281
|
[Dump] resolved conflicts with beta and merging
|
2021-12-14 15:03:45 +01:00 |
Miriam Baglioni
|
a3592b463a
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2021-12-14 14:58:26 +01:00 |
Claudio Atzori
|
aff3ddc8d2
|
added cleaning for the format field, removing carrige return and tab characters
|
2021-12-14 11:41:46 +01:00 |
Miriam Baglioni
|
936578aaf1
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2021-12-13 15:01:47 +01:00 |
Claudio Atzori
|
41c70c607d
|
cleaning workflow assigns the proper default instance type when a value could not be cleaned using the vocabularies
|
2021-12-09 16:44:28 +01:00 |
Claudio Atzori
|
e6e177dda0
|
vocabulary based cleaning considers also the term label when looking up for a synonym
|
2021-12-09 13:57:53 +01:00 |
Miriam Baglioni
|
b113586207
|
resolved conflicts
|
2021-12-07 10:16:14 +01:00 |
Sandro La Bruzzo
|
5d51b3dd4a
|
Merge pull request 'scala_refactor' (#169) from scala_refactor into beta
Reviewed-on: #169
|
2021-12-06 15:33:44 +01:00 |
Miriam Baglioni
|
96a7d46278
|
[Graph Dump] fixed tests
|
2021-12-06 15:06:32 +01:00 |
Sandro La Bruzzo
|
81bf604059
|
[scala-refactor] Module dhp-common:
Moved all scala source into src/main/scala and src/test/scala
|
2021-12-06 11:29:24 +01:00 |
Claudio Atzori
|
9132727793
|
fixed date cleaning test
|
2021-12-06 10:54:05 +01:00 |
Claudio Atzori
|
863a2f9db3
|
avoid to filter OAF records defined as invisible = true
|
2021-12-03 09:08:12 +01:00 |
Miriam Baglioni
|
8905a39bf3
|
mergin with branch beta
|
2021-12-02 13:17:29 +01:00 |
Sandro La Bruzzo
|
1e1f5e4fe0
|
minor fix
|
2021-11-25 13:03:17 +01:00 |
Sandro La Bruzzo
|
2164a2a889
|
Datacite: Code Refactor generated a general SparkApplication Scala where all the spark scala have to inherit
Commented a little the Datacite transformation code
|
2021-11-25 10:54:13 +01:00 |
Sandro La Bruzzo
|
4542a2338b
|
updated site configuration to deploy on website
|
2021-11-19 13:44:08 +01:00 |
Miriam Baglioni
|
9fae872181
|
[Graph Dump] changed to mirror the changes in the model
|
2021-11-19 11:25:50 +01:00 |
Claudio Atzori
|
62fa61f3cf
|
merge from beta
|
2021-11-19 09:23:42 +01:00 |
Claudio Atzori
|
bd9a43cefd
|
Revert to 4094f2bb9a
|
2021-11-19 09:20:43 +01:00 |
Claudio Atzori
|
82a4e4efae
|
[cleaning wf] fixed methodology to rule out invalid result titles, based on https://support.openaire.eu/issues/7206
|
2021-11-17 14:17:22 +01:00 |
Sandro La Bruzzo
|
60ae874dcb
|
Merge branch 'beta' of code-repo.d4science.org:D-Net/dnet-hadoop into mvn_site_documentation
|
2021-11-17 11:08:34 +01:00 |
Claudio Atzori
|
49f897ef29
|
[cleaning wf] fixed regex used to spot garbage in result titles; adjusted threshold for filtering titles
|
2021-11-16 15:24:23 +01:00 |
Sandro La Bruzzo
|
a1cafaf2e3
|
added mvn site for dnet-hadoop project
|
2021-11-16 15:16:28 +01:00 |
Sandro La Bruzzo
|
aafdffa6b3
|
resolved conflict
|
2021-10-26 09:45:46 +02:00 |
Sandro La Bruzzo
|
034304b33a
|
conflict resolved on merge
|
2021-10-26 09:40:47 +02:00 |
Claudio Atzori
|
6b34ba737e
|
minor
|
2021-10-21 14:16:18 +02:00 |
Sandro La Bruzzo
|
ae4e99a471
|
Adapted workflow of resolution of PID to work into OpenAIRE data workflow
- Added relations in both verse on all Scholexplorer datasources
|
2021-10-20 17:12:16 +02:00 |
Miriam Baglioni
|
c8321ad31a
|
merge with branch beta
|
2021-10-01 12:59:08 +02:00 |
Claudio Atzori
|
663b1556d7
|
manually integrating PR#140 #140
|
2021-09-15 16:40:25 +02:00 |
Claudio Atzori
|
baed5e3337
|
test classes moved in specific components
|
2021-08-13 12:14:47 +02:00 |
Claudio Atzori
|
3359f73fcf
|
cleanup & best practices
|
2021-08-13 12:00:42 +02:00 |
Miriam Baglioni
|
58f241f4a2
|
GetCSV refactoring - changed due to change of input resource
|
2021-08-13 10:04:44 +02:00 |
Miriam Baglioni
|
f3d575f749
|
GetCSV refactoring - changed due to changes in input resource
|
2021-08-13 10:03:57 +02:00 |
Miriam Baglioni
|
a5f6edfa6c
|
GetCSV refactoring - changed to mirror the original model class
|
2021-08-13 09:30:03 +02:00 |
Miriam Baglioni
|
7402daf51a
|
GetCSV refactoring - added dependency to open-csv lib
|
2021-08-12 17:59:19 +02:00 |
Miriam Baglioni
|
733bcaecf6
|
GetCSV refactoring - added test class (all the tests are disabled since they refer to remote resource)
|
2021-08-12 17:58:52 +02:00 |
Miriam Baglioni
|
bfe8f5335c
|
GetCSV refactoring - copied model classes in test path
|
2021-08-12 17:58:14 +02:00 |
Miriam Baglioni
|
6e84b3951f
|
GetCSV refactoring - moving classes to dhp-common that have dependency with GetCSV class (that was located in graph-mapper)
|
2021-08-12 17:57:41 +02:00 |
Miriam Baglioni
|
9650eea497
|
reverting
|
2021-08-11 17:45:48 +02:00 |
Miriam Baglioni
|
cc3d72df0e
|
removing not needed dependency
|
2021-08-11 17:42:01 +02:00 |
Miriam Baglioni
|
f9b6b45d85
|
reverting
|
2021-08-11 17:04:48 +02:00 |
Miriam Baglioni
|
8da3a25cf6
|
merging with branch beta
|
2021-08-11 15:55:34 +02:00 |