1
0
Fork 0

Commit Graph

  • 254eb46809 [maven-release-plugin] prepare release dnet-dedup-3.0.14 Claudio Atzori 2019-09-25 10:39:39 +0200
  • 74c6462b49 updated translation map and some tests Claudio Atzori 2019-09-25 10:15:13 +0200
  • aed81e4cfa translation map updated miconis 2019-09-25 09:53:06 +0200
  • afd2b398d5 optimize imports miconis 2019-08-09 15:42:41 +0200
  • d71dae5fd2 implementation of the conditions in tree nodes. get rid of the conditions part of the configuration miconis 2019-08-09 15:41:49 +0200
  • a5c5d2f01b implementation of the decision tree. It takes place of the distance algos, necessaryConditions and sufficientConditions are still there. The model contains only path, type and name of the field. ignoreMissing is still in the model because it is used by the conditions. miconis 2019-08-09 10:08:34 +0200
  • f2136e1024 code refactoring: useless module removed miconis 2019-08-07 15:16:59 +0200
  • 8c867101ef addition of a fixSpecial function to address the problem with special character in organization names, addition of new terms in translation maps miconis 2019-08-06 17:06:05 +0200
  • 4502b44337 addition of the BlockUtils class for meta-blocking, implementation of a new local test with edge filtering example miconis 2019-08-06 12:09:34 +0200
  • cffb712a99 Merge branch 'master' of https://github.com/dnet-team/dnet-dedup miconis 2019-07-19 17:10:53 +0200
  • a85576c27e restyling of the JaroWinklerNormalizedName comparator, now it is optimized. Addition of some translations in the translation maps, addition of a clustering based on keywords in organizations legalnames miconis 2019-07-19 17:10:29 +0200
  • 6cb846331a [maven-release-plugin] prepare for next development iteration Claudio Atzori 2019-07-08 11:12:52 +0200
  • c04d2232c2 [maven-release-plugin] prepare release dnet-dedup-3.0.13 Claudio Atzori 2019-07-08 11:12:45 +0200
  • fb5e38db26 Merge branch 'master' of https://github.com/dnet-team/dnet-dedup miconis 2019-07-08 11:02:29 +0200
  • 3c6f8d1e44 bug fixing in the keywordsclustering class miconis 2019-07-08 11:01:49 +0200
  • a69022617d [maven-release-plugin] prepare for next development iteration Claudio Atzori 2019-07-08 10:11:24 +0200
  • c6baeb93d4 [maven-release-plugin] prepare release dnet-dedup-3.0.12 Claudio Atzori 2019-07-08 10:11:17 +0200
  • f5de20a508 [maven-release-plugin] rollback the release of dnet-dedup-3.0.12 miconis 2019-07-08 10:00:48 +0200
  • ba50aa8654 [maven-release-plugin] prepare for next development iteration miconis 2019-07-08 09:48:10 +0200
  • 7065110a21 [maven-release-plugin] prepare release dnet-dedup-3.0.12 miconis 2019-07-08 09:48:03 +0200
  • 15bec5e876 addition of doi normalization in PidMatch comparator, addition of keywordsclustering (clustering based on terms in the translation maps for the organizations), minor changes miconis 2019-07-08 09:44:02 +0200
  • 2dcffb965f [maven-release-plugin] prepare for next development iteration Claudio Atzori 2019-06-19 10:02:39 +0200
  • 85126c59f7 [maven-release-plugin] prepare release dnet-dedup-3.0.11 Claudio Atzori 2019-06-19 10:02:32 +0200
  • 15d7b584f3 optimized classpath resolvers Claudio Atzori 2019-06-19 10:01:35 +0200
  • ff4956def9 [maven-release-plugin] prepare for next development iteration Claudio Atzori 2019-06-18 14:46:34 +0200
  • eb5ce312a3 [maven-release-plugin] prepare release dnet-dedup-3.0.10 Claudio Atzori 2019-06-18 14:46:27 +0200
  • f2bc665403 avoid to divide by zero: in case of missing values, return undefined response Claudio Atzori 2019-06-18 14:45:15 +0200
  • e3f86b92c8 cleanup Claudio Atzori 2019-06-18 14:44:42 +0200
  • 54e4d0af04 exact match condition gives undefined if a field is missing, ignoremissing semantics changed: now performs the comparison in any case if =true, if false gives -1 in case of missing miconis 2019-06-18 14:05:31 +0200
  • e8db8f2abb implementation of the integration test, addition of document blocks to group entities after clustering miconis 2019-05-21 16:38:26 +0200
  • 53ec9bccca changed the implemetation of RabitMQ Comunication Sandro La Bruzzo 2019-04-16 12:28:01 +0200
  • 403c13eebf Implemented message manager, Fixed bug on collection worker, implemented Collecion and Transform spark job Sandro La Bruzzo 2019-04-11 15:39:29 +0200
  • 58c4e1f725 cleaned properties enricoottonello 2019-04-09 12:17:03 +0200
  • 9294851a6c implemented comunication layer using rabbitMq between oozie node and Dnet Sandro La Bruzzo 2019-04-05 12:19:25 +0200
  • 3f4ba71bbd resolved conflicts Sandro La Bruzzo 2019-04-03 16:12:57 +0200
  • ded6aef5e1 moved collector worker Sandro La Bruzzo 2019-04-03 16:05:16 +0200
  • 2f79eb930a added apidescriptor enricoottonello 2019-04-03 16:03:44 +0200
  • c2ecbf5572 moved collector worker Sandro La Bruzzo 2019-04-03 16:03:36 +0200
  • f7a3bdf3f8 [maven-release-plugin] prepare for next development iteration Claudio Atzori 2019-04-03 12:35:00 +0200
  • 98c179c8fb [maven-release-plugin] prepare release dnet-dedup-3.0.9 Claudio Atzori 2019-04-03 12:34:52 +0200
  • 3e61a90c8f [maven-release-plugin] rollback the release of dnet-dedup-3.0.9 miconis 2019-04-03 12:27:28 +0200
  • 15fb9eb883 [maven-release-plugin] prepare for next development iteration miconis 2019-04-03 12:26:05 +0200
  • a1ff4daa7f [maven-release-plugin] prepare release dnet-dedup-3.0.9 miconis 2019-04-03 12:25:56 +0200
  • 1d29bae47c branch cities merged into master miconis 2019-04-03 12:22:33 +0200
  • b316467608 added common module enricoottonello 2019-04-03 10:53:54 +0200
  • 7e7018c51f addition of a sparktester test, implementation of 2 different classes for testing in dnet-dedup-test module, addition of new terms in the vocabulary and change in the implementation of the JaroWinklerNormalizedName comparator miconis 2019-04-03 09:40:14 +0200
  • 1a1ec7da8e Merge branch 'master' of https://github.com/dnet-team/dnet-hadoop.git Michele Artini 2019-04-01 10:40:59 +0200
  • f8c8b6669f ui Michele Artini 2019-04-01 10:40:09 +0200
  • 4bd5a9beee minor changes miconis 2019-03-26 15:48:21 +0100
  • 0b503949de Merge branch 'master' of https://github.com/dnet-team/dnet-hadoop Sandro La Bruzzo 2019-03-25 15:19:08 +0100
  • b2cd76831e added common cli Sandro La Bruzzo 2019-03-25 15:18:48 +0100
  • 12c65eab4c implemented command line Sandro La Bruzzo 2019-03-25 15:18:31 +0100
  • b1200b6f46 ui Michele Artini 2019-03-25 11:54:18 +0100
  • 6f813fbc8e ui Michele Artini 2019-03-22 15:15:57 +0100
  • e2f1013b3d logs and ui Michele Artini 2019-03-22 11:41:00 +0100
  • 405f418dfc fix a conflict Merge branch 'master' of https://github.com/dnet-team/dnet-hadoop Michele Artini 2019-03-22 10:24:42 +0100
  • 55aa7e1f26 new info Michele Artini 2019-03-22 10:16:47 +0100
  • 3b5f291df2 added angular client Enrico Ottonello 2019-03-22 10:13:56 +0100
  • 639483090a added swagger comments to input parameters Enrico Ottonello 2019-03-22 10:11:46 +0100
  • 662448e584 update of the comparator for legalnames of organizations Michele De Bonis 2019-03-21 14:27:27 +0100
  • 042d477878 added versions support Enrico Ottonello 2019-03-20 15:36:23 +0100
  • 859957d0fd Merge branch 'master' of https://github.com/dnet-team/dnet-hadoop Sandro La Bruzzo 2019-03-20 09:49:53 +0100
  • 46bc3e1f53 added view on mdstore and transaction tables Enrico Ottonello 2019-03-19 13:34:40 +0100
  • cefcb38e62 added module dhp-applications Enrico Ottonello 2019-03-18 16:02:23 +0100
  • ed484bf24e Merge branch 'master' of https://github.com/dnet-team/dnet-hadoop.git enrico 2019-03-18 15:38:04 +0100
  • 126d89ed38 first implementation of dhp-mdstore-manager-app Enrico Ottonello 2019-03-18 15:30:07 +0100
  • 2af1d61d43 test Michele De Bonis 2019-03-18 11:36:10 +0100
  • 6156562893 Added test Sandro La Bruzzo 2019-03-18 10:47:28 +0100
  • 49d8cc716e added oozie wf Sandro La Bruzzo 2019-03-18 10:46:07 +0100
  • c0da3da4c4 addedd common modules for CollectionBuilder in spark Sandro La Bruzzo 2019-03-18 10:45:30 +0100
  • e67d9ee1a9 added first implementation of dnet-workflows Sandro La Bruzzo 2019-03-18 10:44:35 +0100
  • 37b84b6afa Added description of the module Sandro La Bruzzo 2019-03-13 14:53:35 +0100
  • 1eb0281b38 refactored structure of the project luosolo 2019-03-13 14:43:20 +0100
  • c10770cd3e added module that allows to collect data into HDFS luosolo 2019-03-12 15:40:55 +0100
  • f2394fcd9f [maven-release-plugin] prepare for next development iteration Claudio Atzori 2019-02-18 09:09:14 +0100
  • 722431dde1 [maven-release-plugin] prepare release dnet-dedup-3.0.8 Claudio Atzori 2019-02-18 09:09:07 +0100
  • 470c4b0f20 default configuration includes configurationId Claudio Atzori 2019-02-18 09:07:23 +0100
  • ccb7e83196 [maven-release-plugin] prepare for next development iteration Claudio Atzori 2019-02-17 12:56:19 +0100
  • 7d8e62d4cc [maven-release-plugin] prepare release dnet-dedup-3.0.7 Claudio Atzori 2019-02-17 12:56:11 +0100
  • 968cd47436 replace existing attributes when loading default configuration Claudio Atzori 2019-02-17 12:48:25 +0100
  • 0735f3a822 implementation of the test classes and minor changes Michele De Bonis 2019-02-08 12:56:47 +0100
  • 7a8d28991f implementation of the decision tree for the deduplication of the authors, implementation of multiple comparators to be used in a tree node and definition of the proto for person entity Michele De Bonis 2018-12-20 09:54:41 +0100
  • 39613dbbd6 implementation of the decisional tree, addition of the dnet-openaire-data-protos module, definition of the person proto, blockprocessor and paceconfig modified with addition of support for the tree processing Michele De Bonis 2018-12-12 16:30:03 +0100
  • f1c68d8ba3 apply limits (length, size) to pace Fields Claudio Atzori 2018-11-20 10:51:38 +0100
  • c5979ffe18 [maven-release-plugin] prepare for next development iteration Claudio Atzori 2018-11-19 17:41:45 +0100
  • 9869dff1d2 [maven-release-plugin] prepare release dnet-dedup-3.0.6 Claudio Atzori 2018-11-19 17:41:37 +0100
  • c2d4cb3ba6 added new properties to FieldDef (size, length) to limit the information mapped onto each MapDocument Claudio Atzori 2018-11-19 17:37:57 +0100
  • 394fcafd41 [maven-release-plugin] prepare for next development iteration Claudio Atzori 2018-11-17 09:13:16 +0100
  • 397554130c [maven-release-plugin] prepare release dnet-dedup-3.0.5 Claudio Atzori 2018-11-17 09:13:09 +0100
  • 0dfb2ea600 added distance function fot software titles Claudio Atzori 2018-11-17 09:11:38 +0100
  • 3d4372ced9 addition of cities check Michele De Bonis 2018-11-16 16:11:03 +0100
  • 55a9b4f501 [maven-release-plugin] prepare for next development iteration Claudio Atzori 2018-11-16 09:18:00 +0100
  • 35ab630493 [maven-release-plugin] prepare release dnet-dedup-3.0.4 Claudio Atzori 2018-11-16 09:17:53 +0100
  • 399e4bc80f default (empty) configuration should be aligned with the updated model Claudio Atzori 2018-11-15 16:52:56 +0100
  • 59bab8dba4 less verbose logging Claudio Atzori 2018-11-13 09:07:45 +0100
  • 478ad72cb8 propagate exceptions in case of serialization errors, removed configuration pretty printing, removed unused class ScoredResult Claudio Atzori 2018-11-12 15:52:18 +0100
  • f7616c7a8a [maven-release-plugin] prepare for next development iteration Claudio Atzori 2018-11-12 14:23:36 +0100
  • df4b871c8b [maven-release-plugin] prepare release dnet-dedup-3.0.3 Claudio Atzori 2018-11-12 14:23:29 +0100
  • 72a9b3139e Merge branch 'master' of https://github.com/dnet-team/dnet-dedup Michele De Bonis 2018-11-12 14:11:26 +0100
  • b5062f5429 configuration file updated, addition of condition on domain Michele De Bonis 2018-11-12 14:11:15 +0100