harvester-d4science

Commit Graph

Author	SHA1	Message	Date
amercader	39ad78d90a	[#59 ] Ignore auth in the CKAN harvester	2013-08-15 14:37:12 +01:00
Stefan Oderbolz	f26baf6c09	Hide both the label and the number of datasets when 'status' is not available	2013-08-15 13:25:16 +02:00
amercader	1c36b33aaf	[#59 ] Ignore auth when using site_user	2013-08-14 12:28:27 +01:00
amercader	ffea49ca62	[#56 ] Update parameters on source create command Add missing title and owner_org fields, remove deprecated user_id and publisher_id	2013-08-14 11:54:51 +01:00
amercader	3494727d3f	[#56 ] Increase max params number	2013-08-14 11:43:32 +01:00
amercader	8e33262026	[#56 ] Fix syntax error and wrong type	2013-08-14 11:31:23 +01:00
Stefan Oderbolz	4dfd091aec	Make the /harvest page more robust if source.status is not set This prevents exceptions from appearing in the log from Jinja: [error] [client 1.2.3.4] Error - <class 'jinja2.exceptions.UndefinedError'>: 'dict object' has no attribute 'status'	2013-08-14 11:52:11 +02:00
Stefan Oderbolz	7ae9d6e208	Made print method more robust against KeyErrors This is especially needed if you create a new harvest source which does not have all the optional arguments. Before this lead to a KeyError after the creation of the source. Now this simply output 'None'.	2013-08-05 23:50:30 +02:00
Stefan Oderbolz	1249564be5	Add additional name argument when creating new harvest source	2013-08-05 23:46:21 +02:00
Stefan Oderbolz	ade5f83e38	Change key of data_dict from 'type' to 'source_type'	2013-08-05 23:07:25 +02:00
amercader	cb745c3c3e	Avoid importing unnecessary functions from the harvest logic	2013-08-05 18:39:44 +01:00
Vitor Baptista	70e53a7833	Fix bug where source was being treated as an object, when it's a dict	2013-07-29 07:06:58 -03:00
amercader	cc3f3d3426	[#50 ] Fix objects deletion on gather exceptions	2013-07-05 13:29:11 +01:00
amercader	e2696b98bb	[#50 ] Save all dates as UTC in the database At some point we may want to transform these to local time at the dictization level. We will need a library like dateutil to handle it properly though.	2013-07-04 14:59:27 +01:00
kindly	c2283e3fdb	only migrate harvest sources which are active	2013-06-28 02:32:45 +01:00
kindly	a42991b8c9	fix so that non sysadmins can edit harvest sources of organizations they are admins or editors of.	2013-06-27 12:16:11 +01:00
kindly	6540726c47	use correct limit for paging harvest listing	2013-06-26 11:14:38 +01:00
amercader	584c340583	Merge branch '42-remove-non-string-extras'	2013-06-03 10:33:59 +01:00
Sean Hammond	01df3a1db4	[#42 ] Dump non-string extras with json Convert any non-string extra values to strings using json.dumps(), instead of just deleting them.	2013-05-31 20:35:06 +02:00
amercader	3a31db59b6	[#36 ] Move validation code to validate_config This ensures it is checked whenever the source is edited or created.	2013-05-31 17:23:40 +01:00
amercader	a6a0196a4e	Merge branch 'api-version-fix' of git://github.com/fraunhoferfokus/ckanext-harvest into fraunhoferfokus-api-version-fix	2013-05-31 17:15:43 +01:00
Sean Hammond	85a013f2c9	[#42 ] Remove non-string extras from packages Remove extras whose values are not strings (e.g. dicts, lists..) from packages before attempting to create or update the packages on the target site. In CKAN 1 it was possible for the values of extras to be other types, but in CKAN 2 they must be strings, so when harvesting from a CKAN 1 site into a CKAN 2 site SQLAlchemy would crash when trying to create packages with non-string extras. The fix in this commit is to simply remove any non-string extras from the harvested package. (Alternatively, we could try to convert them to a string using JSON.) Fixes #42.	2013-05-31 15:43:42 +02:00
amercader	361abcfc07	[#17 ] Fix bug with remote groups handling If neither 'only_local' or 'create' are used the remote groups property needs to be removed, otherwise it causes an exception when the group is not found.	2013-05-30 18:06:15 +01:00
Konrad Reiche	87cae31c75	Fix api_version check in the group importer code I have forgotten to update one check for the api_version 1 in the code responsible for the remote group import feature. This commit fixes that. Signed-off-by: Konrad Reiche <konrad.reiche@fokus.fraunhofer.de>	2013-05-27 13:36:56 +02:00
Konrad Reiche	c858b9fe9f	Add exception handling for the API version parsing I have added try-except clauses in order to prevent the process from crashing if a non-parsable integer is used for the api_version option. Signed-off-by: Konrad Reiche <konrad.reiche@fokus.fraunhofer.de>	2013-05-27 13:12:05 +02:00
Konrad Reiche	05094090af	Change type of the API version to integer The CKAN logic uses integers when dealing with the API version, e.g. making checks which API version is in use. Currently, the harvester uses strings to identify the API version. Instead of dealing with type conversion the harvester could use integers directly. This commit fixes okfn/ckanext-harvest#36. When the API version is parsed from the configuration it is passed through the int() function. This way the harvesting will still work even if a harvest source was configured with a string API version which makes this commit backward compatible. Signed-off-by: Konrad Reiche <konrad.reiche@fokus.fraunhofer.de>	2013-05-27 12:51:48 +02:00
amercader	ff7287d4b4	[#30 ] Remove lxml dependency	2013-05-24 18:12:02 +01:00
amercader	3d2867ca04	[#17 ] Remove ckanclient dependency as it is not used	2013-05-24 17:55:37 +01:00
amercader	f1d11c1307	[#17 ] Import remote groups in CKAN harvester This is a cleaner commit of the great work done by @platzhirsch implementing remote groups import on the CKAN harvester.	2013-05-24 16:55:05 +01:00
amercader	1792180e4f	Better harvest source dataset migration Current implementation only checked for the first source to exist and didn't allow to rerun the migration for other sources if there was an error. With the new one, all non existing sources are migrated each time.	2013-05-24 14:49:55 +01:00
amercader	1d54edfdaa	Fix bug in source datasets migration Wrong dataset type was causing the default package schema to be used, thus failing when providing an id.	2013-05-24 14:25:05 +01:00
amercader	751409ab7d	[#34 ] Integrate clear command with delete source When deleting a source, if clear_source equals true in the context, harvest_source_clear will be called. Default is false. The UI shows a select with the two options.	2013-05-20 14:30:22 +01:00
amercader	6d5d0fbaae	Add hover helper text to refresh and clear buttons	2013-05-20 12:09:14 +01:00
Tom Rees	edfc49719b	Use page_heading helper consistently with the main CKAN templates.	2013-05-17 16:12:57 +01:00
amercader	d0bc52f2d8	[#34 ] Fix typo in warning message	2013-05-16 18:07:32 +01:00
amercader	b9e2613458	[#34 ] Allow all authorized users for a source to clear it	2013-05-16 17:57:59 +01:00
amercader	71349e658b	[#34 ] Expose harvest source clear button	2013-05-16 17:51:48 +01:00
amercader	7b652542e7	[#34 ] Fix harvest_source_clear action Some typos in the SQL statements, and also the source needs to be reindexed to update the status with the counts.	2013-05-16 17:33:39 +01:00
amercader	1efd7ab4cd	Ignore remote orgs in CKAN harvester If #17 progresses we can do somethign similar for them, although it amy be more complicated because of authorization issues.	2013-05-16 17:30:54 +01:00
kindly	1714e55110	simplify harvest_clear queries so they do not lock on big db	2013-04-30 13:59:23 +01:00
kindly	a2b8ab1994	make harvest source clear not create table	2013-04-30 12:40:46 +01:00
amercader	9041f3f3ad	Changes in Redis conusmer to make tests work	2013-04-22 18:08:19 +01:00
amercader	70dfee1a36	Update queue tests	2013-04-22 17:56:11 +01:00
kindly	dcfd201cdd	[#32 ] redis queue support	2013-04-21 17:04:57 +01:00
kindly	0ce59a29b6	delete insead of update harvest objects when error	2013-04-12 12:32:33 +01:00
kindly	7d7657f94a	make gather phase as finished if there is an error	2013-04-12 10:35:08 +01:00
kindly	bd761498f0	make sure config dict is not jsonified if it contains an error	2013-04-08 18:52:36 +01:00
amercader	eaebeb4e6e	Merge branch 'release-v2.0' of github.com:okfn/ckanext-harvest into release-v2.0	2013-04-08 13:25:33 +01:00
amercader	5414b6c08d	Merge branch '29-new-idataset-form' into release-v2.0	2013-04-08 13:23:41 +01:00
joetsoi	66ff773f99	remove previous commit import	2013-03-29 12:47:14 +00:00
joetsoi	3ac065f0f0	fix package_schema import	2013-03-29 01:17:24 +00:00
joetsoi	cb8b808274	sanity check that harvest source id matches harvest dataset id remove author_email, license_id, maintainer_email, maintainer and author from pacakge_dict, these were not actually necessary	2013-03-29 00:59:20 +00:00
amercader	99bd17401c	Handle wrong JSON in harvest_source_extra_validator	2013-03-28 16:19:16 +00:00
kindly	a9b8be8f01	harvest source index clear	2013-03-28 15:36:44 +00:00
amercader	95ebb5bbf3	[#29 ] Remove check_data_dict ✨	2013-03-28 15:01:21 +00:00
amercader	fbc8ecde97	[#29 ] Fix some imports on actions and plugin	2013-03-28 15:00:44 +00:00
kindly	c754479014	#29 make new idatasets form work with harvest source form	2013-03-25 17:38:07 +00:00
joetsoi	548d3c1c2a	fix validation issue on db upgrade	2013-03-25 12:02:07 +00:00
kindly	b5a697ec87	Merge branch 'release-v2.0' of https://github.com/okfn/ckanext-harvest into release-v2.0	2013-03-25 11:58:31 +00:00
kindly	0b5c3c608a	catch and raise gather exception, acking the message	2013-03-25 11:57:57 +00:00
amercader	438ba672e2	Merge branch 'release-v2.0' of github.com:okfn/ckanext-harvest into release-v2.0	2013-03-25 11:44:37 +00:00
kindly	845c9927a8	add harvest source clear	2013-03-25 11:39:00 +00:00
joetsoi	d518b6709a	[#27 ] fix package_list_for_source for unowned data sources	2013-03-21 15:59:22 +00:00
amercader	7bff041568	[#25 ] Further tweaks on helpers texts	2013-03-21 13:47:23 +00:00
John Martin	4d0dd9a4d3	[#25 ] Small copy tweak to confirmation dialog	2013-03-21 12:14:33 +00:00
John Martin	78bde974b9	[#25 ] Adds confirmation dialog to reharvest button	2013-03-21 10:56:39 +00:00
John Martin	3197162fe6	[#25 ] Changed 'Refresh' to 'Reharvest' on button	2013-03-21 10:36:12 +00:00
kindly	a7583a7b8b	Merge branch 'release-v2.0' of https://github.com/okfn/ckanext-harvest into release-v2.0	2013-03-21 02:32:11 +00:00
kindly	b676fb02e1	only get out harvest items in interface and when indexing	2013-03-21 02:31:34 +00:00
amercader	15c44d9aa8	Merge branch '23-harvest-form-cleanup' into release-v2.0	2013-03-20 17:03:41 +00:00
John Martin	4ba298fe58	[#23 ] Make labels a little wider on harvest new form	2013-03-20 14:07:03 +00:00
amercader	02e90767f4	Fix source listing in organization page It needed update after #515 in ckan core	2013-03-20 13:01:23 +00:00
John Martin	86355fb9db	[#23 ] Form cleanup after core bootstrap upgrade	2013-03-20 10:44:24 +00:00
kindly	634a0bbd30	return instead of continue	2013-03-19 01:21:20 +00:00
kindly	3adf38105e	readd code from old branch seperating the fetch and import logic	2013-03-19 01:16:43 +00:00
amercader	c2a6bd14eb	Add auth function for harvest_source_show_status	2013-03-18 16:48:27 +00:00
amercader	c76b7d95f3	Only count public datasets on the source status This is more in line with what is done on the orgs/groups pages	2013-03-18 16:41:01 +00:00
amercader	cb80ac784e	Add logic to show private datasets to authorizaded users	2013-03-18 16:29:29 +00:00
amercader	341331ac53	Merge branch 'release-v2.0' of github.com:okfn/ckanext-harvest into release-v2.0	2013-03-14 17:33:21 +00:00
amercader	d77f16aba9	[#21 ] Improve gather stage error handling See issue for full details. Basically we don't want to catch any exception at the queue.py level, as they prevent debugging. Harvesters should deal with them and return a list of ids or an empty list if no objects need to be fetched. Also improved the debug messages.	2013-03-14 17:31:07 +00:00
John Martin	b30cc54427	Fix for add harvest source button within org	2013-03-14 14:45:54 +00:00
amercader	91f18bffab	Fix pagination on org sources listing	2013-03-14 11:44:38 +00:00
amercader	8cac0977aa	Fix import on org sources listing	2013-03-14 11:44:22 +00:00
amercader	cd6c1b56a8	[#18 ] Get package dict on after_delete to check type No need for #615 in core then	2013-03-13 17:31:39 +00:00
amercader	1b11b00946	[#18 ] Fix wrong logic for setting the source active field	2013-03-13 13:19:43 +00:00
kindly	cb5e06119e	Merge branch 'release-v2.0' of https://github.com/okfn/ckanext-harvest into release-v2.0	2013-03-12 23:31:58 +00:00
kindly	06355ee6c4	Make IFacets work for harvest source related searches	2013-03-12 23:31:06 +00:00
amercader	fab5b81c2c	Pass context to functions handling harvest sources	2013-03-12 17:30:31 +00:00
amercader	5e50a5c9ad	[#8 ] Update how state is handled for source objects	2013-03-12 15:35:49 +00:00
amercader	2ee3f33f51	[#18 ] Allow reactivation of sources Due to #607 in CKAN core, once a source was deleted you could not reactivate it again. As a workaround, if the source is deleted the Delete button is not shown and the state select is, so you can set it to 'active'. Also fixed wrong redirect after deletion.	2013-03-12 14:06:54 +00:00
amercader	23d1d5742c	[#18 ] Update delete harvest source functionality The harvest_source_delete logic function proxies to package delete, which will delete the harvest source dataset. The harvest plugin then hooks to the after_delete extension point in order to inactivate the actual HarvestSource object and abort any pending jobs. Also added the Delete button to the harvest source form.	2013-03-12 13:14:07 +00:00
amercader	c957fdf17c	Merge branch '14-template-tweaks' into release-v2.0	2013-03-08 14:49:43 +00:00
amercader	ecceff48ed	[#14 ] Use source.organization again after fix in `949bb6f`	2013-03-08 14:48:49 +00:00
amercader	949bb6fe6a	[#16 ] Add organization to source dict	2013-03-08 14:47:11 +00:00
John Martin	f25ef19985	[#14 ] Fix for org breadcrumbs on sources	2013-03-08 12:48:11 +00:00
John Martin	2a53e4a2e4	[#14 ] Couple of minor template tweaks	2013-03-08 12:38:41 +00:00
joetsoi	7257258ca4	mark new harvest objects as current When a new harvest_object for a new package was being created, it was immediately being marked as false, as all objects were marked as false, including the new object just created and newly marked as current=true. Fix so that old HarvestObjects are only marked as current=False when updating an existing package.	2013-03-07 20:27:27 +00:00
John Martin	14e51ec587	Fix for removed snippet from ckan core	2013-03-07 11:52:59 +00:00
amercader	2ee27164c3	[#13 ] Remove or deprecate unused code Mostly in controllers, dictization and plugin, either related to the old templates pre-dataset type or old authorization.	2013-03-06 16:54:33 +00:00
amercader	6c02c87f8d	[#13 ] Set routes to /harvest Mostly painless as we (most of the time) were using DATASET_TYPE_NAME. All old routes now point to the correct place in the new interface.	2013-03-06 16:33:46 +00:00

1 2 3 4 5 ...

504 Commits