Elasticsearch timeouts on bulk processing
When I upload a larger example file, some calculations fail processing the indexing task due to ConnectionTimeout caused by - ReadTimeoutError(HTTPConnectionPool(host='elastic', port=9200): Read timed out. (read timeout=10))
.
Kibana show this error and a warning for the same upload/calc:
Traceback (most recent call last):
File "/app/nomad/processing/tasks.py", line 196, in parse_task
parser_backend.write_json(out, pretty=True)
File "/usr/local/lib/python3.6/contextlib.py", line 88, in __exit__
next(self.gen)
File "/app/nomad/files.py", line 300, in write_archive_json
metadata=metadata)
File "/usr/local/lib/python3.6/site-packages/minio/api.py", line 784, in put_object
sse=sse)
File "/usr/local/lib/python3.6/site-packages/minio/api.py", line 1479, in _do_put_object
content_sha256=sha256_hex
File "/usr/local/lib/python3.6/site-packages/minio/api.py", line 1822, in _url_open
object_name).get_exception()
minio.error.ResponseError: ResponseError: code: XAmzContentSHA256Mismatch, message: The provided 'x-amz-content-sha256' header does not match what was computed., bucket_name: None, object_name: None, request_id: 154F9CF08EDEC866, host_id: 3L137, region:
This comes obviously also from the parse
task, but via celery.redirected
logger.