bleurt icon indicating copy to clipboard operation
bleurt copied to clipboard

WMT metric shared dataset download error

Open juheeuu opened this issue 4 years ago • 0 comments

We cannot access the current download URL for the wmt17, wmt18 datasets. When I run this command,

python -m bleurt.wmt.db_builder   -target_language="en"   -rating_years="2017"

It gives an error

INFO:tensorflow:Downloading newstest2017-segment-level-human from http://computing.dcu.ie/~ygraham/newstest2017-segment-level-human.tar.gz
I0824 12:51:08.780502 140389933356864 downloaders.py:139] Downloading newstest2017-segment-level-human from http://computing.dcu.ie/~ygraham/newstest2017-segment-level-human.tar.gz
Downloading data from http://computing.dcu.ie/~ygraham/newstest2017-segment-level-human.tar.gz
Traceback (most recent call last):
  File "/opt/conda/lib/python3.8/site-packages/keras/utils/data_utils.py", line 274, in get_file
    urlretrieve(origin, fpath, dl_progress)
  File "/opt/conda/lib/python3.8/site-packages/keras/utils/data_utils.py", line 82, in urlretrieve
    response = urlopen(url, data)
  File "/opt/conda/lib/python3.8/urllib/request.py", line 222, in urlopen
    return opener.open(url, data, timeout)
  File "/opt/conda/lib/python3.8/urllib/request.py", line 531, in open
    response = meth(req, response)
  File "/opt/conda/lib/python3.8/urllib/request.py", line 640, in http_response
    response = self.parent.error(
  File "/opt/conda/lib/python3.8/urllib/request.py", line 569, in error
    return self._call_chain(*args)
  File "/opt/conda/lib/python3.8/urllib/request.py", line 502, in _call_chain
    result = func(*args)
  File "/opt/conda/lib/python3.8/urllib/request.py", line 649, in http_error_default
    raise HTTPError(req.full_url, code, msg, hdrs, fp)
urllib.error.HTTPError: HTTP Error 503: Service Unavailable

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/opt/conda/lib/python3.8/runpy.py", line 194, in _run_module_as_main
    return _run_code(code, main_globals, None,
  File "/opt/conda/lib/python3.8/runpy.py", line 87, in _run_code
    exec(code, run_globals)
  File "/data/private/metric_shared/bleurt/bleurt/wmt/db_builder.py", line 273, in <module>
    app.run(main)
  File "/opt/conda/lib/python3.8/site-packages/tensorflow/python/platform/app.py", line 40, in run
    _run(main=main, argv=argv, flags_parser=_parse_flags_tolerate_undef)
  File "/opt/conda/lib/python3.8/site-packages/absl/app.py", line 312, in run
    _run_main(main, args)
  File "/opt/conda/lib/python3.8/site-packages/absl/app.py", line 258, in _run_main
    sys.exit(main(argv))
  File "/data/private/metric_shared/bleurt/bleurt/wmt/db_builder.py", line 262, in main
    create_wmt_dataset(FLAGS.target_file, FLAGS.rating_years,
  File "/data/private/metric_shared/bleurt/bleurt/wmt/db_builder.py", line 100, in create_wmt_dataset
    importer.fetch_files()
  File "/data/private/metric_shared/bleurt/bleurt/wmt/downloaders.py", line 305, in fetch_files
    super(Importer17, self).fetch_files()
  File "/data/private/metric_shared/bleurt/bleurt/wmt/downloaders.py", line 140, in fetch_files
    _ = tf.keras.utils.get_file(
  File "/opt/conda/lib/python3.8/site-packages/keras/utils/data_utils.py", line 276, in get_file
    raise Exception(error_msg.format(origin, e.code, e.msg))
Exception: URL fetch failure on http://computing.dcu.ie/~ygraham/newstest2017-segment-level-human.tar.gz: 503 -- Service Unavailable

We have to change the URL to the HTTPS version, for example, https://www.computing.dcu.ie/~ygraham/newstest2017-segment-level-human.tar.gz

juheeuu avatar Aug 24 '21 04:08 juheeuu