Commit Graph

127 Commits (master)

Author SHA1 Message Date
Michele Artini e96527325d saved a query for openaire production database 4 years ago
Claudio Atzori ed76521d9b removed stale test resources, will be re-added later on 4 years ago
Claudio Atzori 0f364605ff removed stale tests, need to reimplemente them anyway 4 years ago
Claudio Atzori 6a288625e5 fixed workflow outgoing node 4 years ago
Claudio Atzori 1b18fd4d54 sync with master branch 4 years ago
Sandro La Bruzzo 4f04759738 Merge branch 'master' of code-repo.d4science.org:D-Net/dnet-hadoop 4 years ago
Sandro La Bruzzo 76ee85141a added oozie job for DNET migration and implemented Spark job for extracting entities 4 years ago
Claudio Atzori c460e2d281 Aggiornare 'dhp-workflows/docs/oozie-installer.markdown' 4 years ago
Michele Artini 176c5606bd aligned with origin/master, aligned model and mapping 4 years ago
Claudio Atzori 56d1810a66 working procedure for records indexing using Spark, via lib com.lucidworks.spark:spark-solr 4 years ago
Claudio Atzori 1ee1baa8c0 Merge branch 'master' into provision_indexing 4 years ago
Claudio Atzori a3d0b57b25 [maven-release-plugin] prepare for next development iteration 4 years ago
Claudio Atzori 6ed9a15bc8 [maven-release-plugin] prepare release dhp-1.1.5 4 years ago
Claudio Atzori 49e648f7c3 bumped version 4 years ago
Claudio Atzori f9fae97e09 test json files aligned with the latest model changes 4 years ago
Claudio Atzori 1fee6e2b7e implemented XML records construction and serialization, indexing WIP 4 years ago
Michele Artini 80cb52593f bug fixing 4 years ago
Michele Artini cdea0dae75 bug fixing 4 years ago
Michele Artini 69336195d3 simplifications 4 years ago
Michele Artini 06c2fd6df9 bug fixing 4 years ago
Michele Artini 5fc09b179c bug fixing 4 years ago
Michele Artini 95740767e0 Ready for tests 4 years ago
Michele Artini 181e8498d4 ... 4 years ago
Michele Artini bb1533a07e partial commit 4 years ago
Michele Artini fbb0fc140b partial implementation of migration 4 years ago
Claudio Atzori 7ba0f44d05 WIP 4 years ago
Claudio Atzori 49ef2f4eb1 removed input parameter specification, SparkXmlRecordBuilderJob doesn't need hive 4 years ago
Claudio Atzori b5e1e2e5b2 reintegrated changes from fcbc4ccd70 4 years ago
Claudio Atzori 7bacd6812e Merge branch 'provision_indexing' of https://code-repo.d4science.org/D-Net/dnet-hadoop into HEAD
 Conflicts:
	dhp-workflows/dhp-graph-provision/src/main/java/eu/dnetlib/dhp/graph/GraphJoiner.java
	dhp-workflows/dhp-graph-provision/src/main/java/eu/dnetlib/dhp/graph/MappingUtils.java
	dhp-workflows/dhp-graph-provision/src/main/java/eu/dnetlib/dhp/graph/RelatedEntity.java
	dhp-workflows/dhp-graph-provision/src/main/java/eu/dnetlib/dhp/graph/SparkXmlRecordBuilderJob.java
4 years ago
Claudio Atzori b2691a3b0a save adjacency list as JoinedEntity 4 years ago
Claudio Atzori 8c2aff99b0 joining entities using T x R x S, WIP: last representation based on LinkedEntity type 4 years ago
Claudio Atzori fcbc4ccd70 a bit of docs doesn't hurt 4 years ago
Claudio Atzori a55f5fecc6 joining entities using T x R x S method with groupByKey, WIP: making target objects (T) have lower memory footprint 4 years ago
Michele Artini 6bfe2dc96e partial implementation 4 years ago
Claudio Atzori 799929c1e3 joining entities using T x R x S method with groupByKey 4 years ago
Michele Artini f6eccdde33 partial implementation 4 years ago
Michele Artini cd114f1c3b partial update 4 years ago
Michele Artini b35c59eb42 partial implementation of entities from db 4 years ago
Michele Artini 81f82b5d34 partial implementation of applications to migrate entities 4 years ago
Claudio Atzori 1cd6899480 merged from master 4 years ago
Claudio Atzori 97c239ee0d WIP: trying to find a way to build the records for the index 4 years ago
miconis 4955be0197 Merge branch 'master' of code-repo.d4science.org:D-Net/dnet-hadoop 4 years ago
miconis f61adfc2bb minor changes 4 years ago
miconis 9bdcb02179 minor changes and update of the configuration for publications 4 years ago
Michele Artini f7b9a7a9af entity migration (partial implementation) 4 years ago
Michele Artini 7229fecbcf fix warnings in poms 4 years ago
Sandro La Bruzzo dd21db7036 fixed stuff 4 years ago
Claudio Atzori 7ba586d2e5 oozie workflow aimed to build the adjacency lists representation of the graph, needed to build the records to be indexed 4 years ago
Sandro La Bruzzo 76efcde4fd using new branch decisionTreeDedup 4 years ago
Sandro La Bruzzo b4392f9f43 implemented DedupRecord factory for missing entities 4 years ago