Loading TCTracks with `from_ibtracs_netcdf` does not properly close the filestream #920

spjuhel · 2024-07-16T14:43:06Z

Describe the bug
When reading the IBTrACS.ALL.v04r00.nc file with from_ibtracs_netcdf() the file is opened with xr.open_dataset(ibtracs_path) which opens a stream to the file that is not closed afterwards. I had problems with this on euler when trying to access the file at later stages with the following error:

OSError: [Errno -101] NetCDF: HDF error: '/cluster/work/climate/sjuhel/climada/data/IBTrACS.ALL.v04r00.nc'

To Reproduce
I haven't been able to reproduce the issue on a local computer. This is possibly related to NetCDF too.

Expected behavior
The file is properly closed once the data is loaded.

See also: pydata/xarray#2887

Climada Version: 4.1.1

System Information (please complete the following information):

Operating system and version: Ubuntu 22.04 (euler)
Python version: 3.10.13 (Custom venv while climada is not yet installed on the cluster, possibly part of the problems)

Additional context
I don't think this is a critical problem, but the following would probably be a better way to handle the opening of the file:

From:

climada_python/climada/hazard/tc_tracks.py

Line 469 in 8f89ce1

ibtracs_ds = xr.open_dataset(ibtracs_path)

to:

with xr.open_dataset(ibtracs_path) as ds:
    ibtracks_ds = ds.load()

The text was updated successfully, but these errors were encountered:

peanutfun · 2024-07-17T09:00:55Z

Indeed, a context manager is the appropriate way to open files with xarray. However, notice that ds.load() will load all data into memory, which is not the default for opening a dataset and might be an issue for very large files.

It looks to me like the line in question can simply be replaced by the context manager. All following lines of the function must then be indented.

spjuhel · 2024-09-26T10:10:13Z

So I noticed there are actually other places where dataset are opened without using a context manager and will make a PR to address all this occurrence instead of just this one.

spjuhel added bug enhancement labels Jul 16, 2024

peanutfun added the accepting pull request Contribute by raising a pull request to resolve this issue! label Jul 17, 2024

spjuhel mentioned this issue Sep 26, 2024

Use context manager for xarray dataset file opening #953

Merged

13 tasks

spjuhel self-assigned this Sep 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loading TCTracks with `from_ibtracs_netcdf` does not properly close the filestream #920

Loading TCTracks with `from_ibtracs_netcdf` does not properly close the filestream #920

spjuhel commented Jul 16, 2024

peanutfun commented Jul 17, 2024

spjuhel commented Sep 26, 2024

Loading TCTracks with from_ibtracs_netcdf does not properly close the filestream #920

Loading TCTracks with from_ibtracs_netcdf does not properly close the filestream #920

Comments

spjuhel commented Jul 16, 2024

peanutfun commented Jul 17, 2024

spjuhel commented Sep 26, 2024

Loading TCTracks with `from_ibtracs_netcdf` does not properly close the filestream #920

Loading TCTracks with `from_ibtracs_netcdf` does not properly close the filestream #920