bleurt
bleurt copied to clipboard
WMT metric shared dataset download error
We cannot access the current download URL for the wmt17, wmt18 datasets. When I run this command,
python -m bleurt.wmt.db_builder -target_language="en" -rating_years="2017"
It gives an error
INFO:tensorflow:Downloading newstest2017-segment-level-human from http://computing.dcu.ie/~ygraham/newstest2017-segment-level-human.tar.gz
I0824 12:51:08.780502 140389933356864 downloaders.py:139] Downloading newstest2017-segment-level-human from http://computing.dcu.ie/~ygraham/newstest2017-segment-level-human.tar.gz
Downloading data from http://computing.dcu.ie/~ygraham/newstest2017-segment-level-human.tar.gz
Traceback (most recent call last):
File "/opt/conda/lib/python3.8/site-packages/keras/utils/data_utils.py", line 274, in get_file
urlretrieve(origin, fpath, dl_progress)
File "/opt/conda/lib/python3.8/site-packages/keras/utils/data_utils.py", line 82, in urlretrieve
response = urlopen(url, data)
File "/opt/conda/lib/python3.8/urllib/request.py", line 222, in urlopen
return opener.open(url, data, timeout)
File "/opt/conda/lib/python3.8/urllib/request.py", line 531, in open
response = meth(req, response)
File "/opt/conda/lib/python3.8/urllib/request.py", line 640, in http_response
response = self.parent.error(
File "/opt/conda/lib/python3.8/urllib/request.py", line 569, in error
return self._call_chain(*args)
File "/opt/conda/lib/python3.8/urllib/request.py", line 502, in _call_chain
result = func(*args)
File "/opt/conda/lib/python3.8/urllib/request.py", line 649, in http_error_default
raise HTTPError(req.full_url, code, msg, hdrs, fp)
urllib.error.HTTPError: HTTP Error 503: Service Unavailable
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/opt/conda/lib/python3.8/runpy.py", line 194, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/opt/conda/lib/python3.8/runpy.py", line 87, in _run_code
exec(code, run_globals)
File "/data/private/metric_shared/bleurt/bleurt/wmt/db_builder.py", line 273, in <module>
app.run(main)
File "/opt/conda/lib/python3.8/site-packages/tensorflow/python/platform/app.py", line 40, in run
_run(main=main, argv=argv, flags_parser=_parse_flags_tolerate_undef)
File "/opt/conda/lib/python3.8/site-packages/absl/app.py", line 312, in run
_run_main(main, args)
File "/opt/conda/lib/python3.8/site-packages/absl/app.py", line 258, in _run_main
sys.exit(main(argv))
File "/data/private/metric_shared/bleurt/bleurt/wmt/db_builder.py", line 262, in main
create_wmt_dataset(FLAGS.target_file, FLAGS.rating_years,
File "/data/private/metric_shared/bleurt/bleurt/wmt/db_builder.py", line 100, in create_wmt_dataset
importer.fetch_files()
File "/data/private/metric_shared/bleurt/bleurt/wmt/downloaders.py", line 305, in fetch_files
super(Importer17, self).fetch_files()
File "/data/private/metric_shared/bleurt/bleurt/wmt/downloaders.py", line 140, in fetch_files
_ = tf.keras.utils.get_file(
File "/opt/conda/lib/python3.8/site-packages/keras/utils/data_utils.py", line 276, in get_file
raise Exception(error_msg.format(origin, e.code, e.msg))
Exception: URL fetch failure on http://computing.dcu.ie/~ygraham/newstest2017-segment-level-human.tar.gz: 503 -- Service Unavailable
We have to change the URL to the HTTPS version, for example,
https://www.computing.dcu.ie/~ygraham/newstest2017-segment-level-human.tar.gz