-
release-1.9.0
Release 1.9.0. Important changes: - Move scheduled cleanup from `check_arcce_monitor` to `check_arcce_clean`. As a consequence users may want to run `check_arcce_clean` at least once per every hour. - Keep better track of the work done by `check_arcce_monitor` and `check_arcce_clean`. Try to bail out within a slotted time to avoid unclean exits and spurious Nagios alerts. Instead report on the progress and raise a warning or critical alerts if it seems like the probes are not keeping up. - Clean up jobs from the local ARC job file which cannot be queried and which do not correspond to any active job. - The operation of `check_arcce_monitor` can now be restricted to jobs for a certain CE and/or those having been created with a certain `--job-tag`. This should allow running one monitor service per CE and/or job type, though it has not been tested under Nagios. Minor fixes: - Remove execute bit and shebang from `arcce_igtf.py`. - In `check_archostcert` consider certs unusable 2 days before expiration. - Various timeouts have been adjusted or added. - Fix repeated renaming of output directories from `arcget`. - Make random adjustments to rescheduled job times and process rescheduled and monitoring work in random order. - Remove deprecated `--use-jobs-xml` option from `check_arcce_clean` and some other ancient stuff which nobody will notice. - Avoid warning due to unparsed 12 hex-digit lines from `arcstat`. - Various minor fixes.
-
release-1.9.0rc1
Release candidate 1.9.0rc1. Important changes: - Move scheduled cleanup from `check_arcce_monitor` to `check_arcce_clean`. As a consequence users may want to run `check_arcce_clean` at least once per every hour. - Keep better track of the work done by `check_arcce_monitor` and `check_arcce_clean`. Try to bail out within a slotted time to avoid unclean exits and spurious Nagios alerts. Instead report on the progress and raise a warning or critical alerts if it seems like the probes are not keeping up. - Clean up jobs from the local ARC job file which cannot be queried and which do not correspond to any active job. - The operation of `check_arcce_monitor` can now be restricted to jobs for a certain CE and/or those having been created with a certain `--job-tag`. This should allow running one monitor service per CE and/or job type, though it has not been tested under Nagios. Minor fixes: - Remove execute bit and shebang from `arcce_igtf.py`. - In `check_archostcert` consider certs unusable 2 days before expiration. - Various timeouts have been adjusted or added. - Fix repeated renaming of output directories from `arcget`. - Make random adjustments to rescheduled job times and process rescheduled and monitoring work in random order. - Remove deprecated `--use-jobs-xml` option from `check_arcce_clean` and some other ancient stuff which nobody will notice. - Avoid warning due to unparsed 12 hex-digit lines from `arcstat`. - Various minor fixes.
-
release-0.9.0rc1
Release candidate 0.9.0rc1 Important changes: - Move scheduled cleanup from `check_arcce_monitor` to `check_arcce_clean`. As a consequence users may want to run `check_arcce_clean` at least once per every hour. - Keep better track of the work done by `check_arcce_monitor` and `check_arcce_clean`. Try to bail out within a slotted time to avoid unclean exits and spurious Nagios alerts. Instead report on the progress and raise a warning or critical alerts if it seems like the probes are not keeping up. - Clean up jobs from the local ARC job file which cannot be queried and which do not correspond to any active job. - The operation of `check_arcce_monitor` can now be restricted to jobs for a certain CE and/or those having been created with a certain `--job-tag`. This should allow running one monitor service per CE and/or job type, though it has not been tested under Nagios. Minor fixes: - Remove execute bit and shebang from `arcce_igtf.py`. - In `check_archostcert` consider certs unusable 2 days before expiration. - Various timeouts have been adjusted or added. - Fix repeated renaming of output directories from `arcget`. - Make random adjustments to rescheduled job times and process rescheduled and monitoring work in random order. - Remove deprecated `--use-jobs-xml` option from `check_arcce_clean` and some other ancient stuff which nobody will notice. - Avoid warning due to unparsed 12 hex-digit lines from `arcstat`. - Various minor fixes.
-
release-1.8.4
Release 1.8.4. - Work around blocking semantic when using old Sqlite bindings and increase lock timeout to 10 s also when using new bindings.
-
release-1.8.3
Release 1.8.3. - Handle missing SHA1 in IGTF CA certificate check.
-
release-1.8.2
Release 1.8.2. - Delay some IGTF warnings which were raised before the publication day. - Fix parsing of arcstat output for GLUE2-enabled CEs, using "Status" instead of "Specific status". - Fix /tmp file leak when arccp fails, and improve related logging. - Add --min-proxy-lifetime option and inclease default to avoid expiration before the job is run. - Fix missed passive state update when arcget output dir was absent. + Add option to monitor state progress. + Support multiple LDAP attribute lookup and use it in the EGI setup to add GlueSAPath as a fallback for GlueVOInfoPath for SRM checks.
-
release-1.8.2rc2
Release candidate 1.8.2rc2. - Fix state monitoring. - Fix a missed Nagios critical state for passive termination service.
-
release-1.8.2rc1
Release candidate 1.8.2rc1. - Delay some IGTF warnings which were raised before the publication day. - Fix parsing of arcstat output for GLUE2-enabled CEs, using "Status" instead of "Specific status". - Fix /tmp file leak when arccp fails, and improve related logging. - Add --min-proxy-lifetime option and inclease default to avoid expiration before the job is run. + Add option to monitor state progress. + Support multiple LDAP attribute lookup and use it in the EGI setup to add GlueSAPath as a fallback for GlueVOInfoPath for SRM checks.
-
release-1.8.1
Release candidate 1.8.1rc1. - Fix parsing of arcstat output. - Support posting job submission errors to passive services (GGUS-104724).
-
release-1.8.1rc1
Release candidate 1.8.1rc1. - Fix parsing of arcstat output. - Support posting job submission errors to passive services (GGUS-104724).
-
release-1.8.0
Release 1.8.0 - Let check_arcce_monitor discard jobs after a given number of attempts to fetch it. - Let check_arcce_monitor clear jobs, avoiding issues with simultaneous file access by check_arcce_submit. - Improve the heuristics for when it's ok to clear a missing job. - Tolerate transient disappearance of jobs from job arcstat output. This also fixes a missed cleanup. - Re-try cleaning each job and staged file up to 12 times distributed in geometrically progressing intervals over a few months. - The job-fetching code is mostly rewritten, avoiding redundant fetch attempts and making the code more clear. - Ignore non-zero exit from arcget if the output does not indicate an error. - Replicate of test files uploaded via LFC when multiple SEs are used. - Support fail-over URIs for LDAP-derived variables. - Add an option to archive all job files when a job fails. - Implicitly compute interpolated variables in the configuration file. This obsoletes the "variables" setting. - For the random_line substitution: Fix exclude and add include. - Use Genshi templates for generating job descriptions, allowing people to modify job descriptions. - Add support for gmlog. This fixes GGUS-100995.