This release adds an enhancement and compatibility changes with upstream libraries. Thanks to @raphaelquast, @droumis and @hoxbro.
Enhancements:
- Add fail-fast for datasets outside the visible extent (#1345)
Compatibility:
- Compatibility with cudf 2024.06 (#1344)
- Compatibility with geopandas 1.0 and dask-geopandas 0.4.0 (#1347)
Maintenance:
- Update docs.yaml (#1346)
This release adds compatibility with Numpy 2.0, along with other improvements and bugfixes. Thanks to @hoxbro for his contributions.
Bugfixes:
- Remove artifact from Polygon rendering (#1329)
Compatibility:
- Test dev releases of
numpy
2.0 andnumba
0.60.0 (#1332) - Improve compatibility with
dask-expr
(#1335) - Add gpu marker for test and test both classic and
dask-expr
Dask.DataFrame
's (#1341)
Documentation:
Maintenance:
- Update list of maintainers (#1336)
- Parallelize the test suite and fix a test polluted bug (#1338)
- Update test workflow (#1340)
This release brings compatibility with new release of upstream packages. Thanks to first-time contributor @alexander-beedie, and the regular contributors @philippjfr, @ianthomas23, @maximlt, and @hoxbro.
Enhancements:
Compatibility:
- Python 3.12 support (#1317)
- Basic
dask_expr
support (#1317) - Numpy 2.0 support (#1306)
- Remove redundant py2 helper code (#1316)
Maintenance:
- Replace Google Analytics with GoatCounter (#1309)
- Docs: ignore numpydoc validation checks (#1310)
- Fix test suite (#1314)
- General maintenance (#1320)
Datashader 0.16.0 is a significant release adding support for rendering GeoPandas GeoDataFrames directly rather than having to convert them to SpatialPandas first. Support for GeoPandas geometry types in Datashader Canvas
functions is as follows:
Canvas.line
:LineString
,MultiLineString
,MultiPolygon
,Polygon
Canvas.point
:MultiPoint
,Point
Canvas.polygons
:MultiPolygon
,Polygon
There is also support in Canvas.line
for a new data type which is a 2D xarray.DataArray
(within an xarray.Dataset
) containing the coordinates of multiple lines that share the same x
coordinates.
The DataShape package is now vendored in Datashader as it has not been maintained for a number of years and is not accepting updates.
Thanks to new contributor @J08ny and regular contributors @Hoxbro and @ianthomas23.
Enhancements:
- Support rendering of GeoPandas GeoDataFrames as lines, points and polygons (#1285, #1293, #1297)
- Implement lines using 2D xarray with common x coordinates (#1282)
General code improvements:
- Add debug logging to compiler module (#1280)
- Vendor DataShape (#1284)
- Don't use
object
as base class (#1286) - Fix typos using
codespell
(#1288) - Fix
float16
being a floating type. (#1290) - Simplify line
_internal_build_extend
(#1294)
Improvements to CI:
- Update to latest
holoviz_tasks
(#1281) - Update
codecov
configuration (#1292) - Add
pre-commit
(#1295, #1296)
Compatibility:
- Support Pandas 2.1 (#1276, #1287)
- Replace
np.NaN
withnp.nan
(#1289) - Drop support for Python 3.8 (#1291)
This release adds antialiased line support for inspection reductions such as max_n
and where
, including within categorical by
reductions. It also improves support for summary
reductions and adds CUDA implementations of std
and var
reductions.
Thanks to regular contributors @Hoxbro, @ianthomas23, @maximlt and @thuydotm.
Enhancements:
- Antialiasing line support for inspection reductions:
- Pre-compile antialias stage 2 combination (#1258)
- Antialiased min and max row index reductions (#1259)
- CPU
shift_and_insert
function (#1260) - Refactor of CUDA
*_n
reductions (#1261) - Support antialiased lines in
*_n
reductions (#1262) - Replace accumulate with copy on first call to antialiased stage 2 combine (#1264)
- Separate where
combine_cpu
functions by ndim (#1265) - Antialiased line support for
where
reductions (#1269)
- Improved support for
summary
reductions: - CUDA support for
std
andvar
reductions (#1267)
General code improvements:
- Remove pyarrow pin (#1248)
Improvements to CI:
Improvements to documentation:
- Update readme to include Python 3.11 (#1249)
- Correct links to pandas docs (#1250)
- Remove twitter from index page (#1253)
- Create FUNDING.yml (#1263)
This release contains an important bug fix to ensure that categorical column order in maintained across dask partitions. It also adds support for categorical inspection reductions such as by(max_n)
. The only missing functionality for inspection reductions is now antialiased lines, which in planned for the next release.
Thanks to contributors @ianthomas23, @maximlt and @philippjfr.
Bug fixes:
- Fix single category reductions (#1231)
- Ensure categorical column order is the same across dask partitions (#1239)
Enhancements:
- Categorical inspection reductions:
- General code improvements:
Improvements to CI:
Improvements to documentation:
This release provides significant improvements for inspection reductions by adding new first_n
, last_n
, max_n
and min_n
reductions, and providing Dask and CUDA support for all existing and new inspection reductions including where
. It also provides support for Numba 0.57, NumPy 1.24 and Python 3.11, and drops support for Python 3.7.
Thanks to first-time contributors @danigm and @Jap8nted, and also regulars @Hoxbro, @philippjfr and @ianthomas23
Enhancements:
- Inspection reductions:
- Reduction append functions return index not boolean (#1180)
first_n
,last_n
,max_n
andmin_n
reductions (#1184)- Add
cuda
argument to_build_combine
(#1194) - Support
max_n
andmin_n
reductions on GPU (#1196) - Use fast cuda mutex available in numba 0.57 (#1212)
- Dask support for
first
,last
,first_n
andlast_n
reductions (#1214) - Wrap use of cuda mutex in
where
reductions (#1217) - Cuda and cuda-with-dask support for inspection reductions (#1219)
- x and y range attributes on returned aggregations (#1198)
- Make
datashader.composite
imports lazy for faster import time (#1222) - Improvements to CI:
- Cancel concurrent test workflows (#1208)
- Improvements to documentation:
Bug fixes:
- Fix conversion from
cupy
in categoricalrescale_discrete_levels
(#1179) - Validate canvas
width
,height
(#1183) - Support antialiasing in pipeline API (#1213)
Compatibility:
This release adds a new where
reduction that provides improved inspection capabilities and adds support for colormaps that are tuples of hex values. There are also various bug fixes and compatibility improvements.
Thanks to @ianthomas23, @maximlt and @Hoxbro.
Enhancements:
- New
where
reduction to provide improved inspection functionality: - Support colormaps that are tuples of hex values (#1173)
- Add governance docs (#1165)
- Improve documentation build system (#1170, #1171)
- Improvements to CI:
Bug fixes:
- Validate calculated log canvas range (#1154)
- Better validate
canvas.line()
coordinate lengths (#1160) - Return early in
eq_hist()
if all data masked out (#1168)
Compatibility:
- Follow recommended
numba
best practice. - Update dependencies:
- Pip
pyarrow
in tests dependencies (#1174)
- Pip
This release fixes a bug related to spatial indexing of spatialpandas.GeoDataFrames
, and introduces enhancements to antialiased lines, benchmarking and GPU support.
Thanks to first-time contributors @eriknw and @raybellwaves, and also @ianthomas23 and @maximlt.
Enhancements:
- Improvements to antialiased lines:
- New benchmark framework:
- Improvements to GPU support:
- Cupy implementation of eq_hist (#1129)
- Improvements to documentation:
- Fix markdown syntax for link (#1119)
- DOC: add text link to https://examples.pyviz.org/datashader_dashboard (#1123)
- Improvements to dependency management (#1111, #1116)
- Improvements to CI (#1132, #1135, #1136, #1137, #1143)
Bug fixes:
- Ensure spatial index
_sindex
is retained on dataframe copy (#1122)
This is a bug fix release to fix an important divide by zero bug in antialiased lines, along with improvements to documentation and handling of dependencies.
Thanks to @ianthomas23 and @adamjhawley.
Enhancements:
- Improvements to documentation:
- Improvements to handling of dependencies:
Bug fixes:
- Fix antialiased line divide by zero bug (#1099)
This release provides a number of important bug fixes and small enhancements from Ian Thomas along with infrastructure improvements from Maxime Liquet and new reductions from @tselea.
Enhancements:
- Improvements to antialiased lines:
- Improvements to
rescale_discrete_levels
forhow='eq_hist'
: - Implementation of first and last reduction (#1093) for data types other than raster.
Bug fixes:
- Do not snap trimesh vertices to pixel grid (#1092)
- Correctly orient (y, x) arrays for xarray (#1095)
- Infrastructure/build fixes (#1080, #1089, #1096)
This release has been nearly a year in the making, with major new contributions from Ian Thomas, Thuy Do Thi Minh, Simon Høxbro Hansen, Maxime Liquet, and James Bednar, and additional support from Andrii Oriekhov, Philipp Rudiger, and Ajay Thorve.
Enhancements:
- Full support for antialiased lines of specified width (#1048, #1072). Previous antialiasing support was limited to single-pixel lines and certain floating-point reduction functions. Now supports arbitrary widths and arbitrary reduction functions, making antialiasing fully supported. Performance ranges from 1.3x to 14x slower than the simplest zero-width implementation; see benchmarks.
- Fixed an issue with visibility on zoomed-in points plots and on overlapping line plots that was first reported in 2017, with a new option
rescale_discrete_levels
forhow='eq_hist'
(#1055) - Added a categorical color_key for 2D (unstacked) aggregates (#1020), for producing plots where each pixel has at most one category value
- Improved docs:
- A brand new polygons guide (#1071)
- A new guide to 3D aggregations using
by
now documenting usingcategorizer
objects to do 3D numerical binning (#1071) - Moved documentation for spreading to its own section so it can be presented at the right pipeline stage (was mixed up with colormapping before) (#1071)
- Added rescale_discrete_levels example (#1071)
- Other misc doc cleanup (#1035, #1037, #1058, #1074, #1077)
Bugfixes:
- Fixed details of the raster coordinate calculations to match other primitives, making it simpler to overlay separately rendered results (#959, #1046)
- Various fixes and extensions for cupy/CUDA, e.g. to use cuda for category_binning, spread, and dynspread, including cupy.interp where appropriate (#1015, #1016, #1044, #1050, #1060)
- Infrastructure/build/ecosystem fixes (#1022, #1025, #1027, #1036, #1045, #1049, #1050, #1057, #1061, #1062, #1063, #1064)
Compatibility:
Canvas.line()
optionantialias=True
is now deprecated; useline_width=1
(or another nonzero value) instead. (#1048)- Removed long-deprecated
bokeh_ext.py
(#1059) - Dropped support for Python 2.7 (actually already dropped from the tests in Datashader 0.12) and 3.6 (no longer supported by many downstream libraries like rioxarray, but several of them are not properly declaring that restriction, making 3.6 much more difficult to support.) (#1033)
- Now tested on Python 3.7, 3.8, 3.9, and 3.10. (#1033)
Thanks to Jim Bednar, Nezar Abdennur, Philipp Rudiger, and Jean-Luc Stevens.
Enhancements:
- Defined new
dynspread metric
based on counting the fraction of non-empty pixels that have non-empty pixels within a given radius. The resultingdynspread
behavior is much more intuitive than the old behavior, which counted already-spread pixels as if they were neighbors (#1001) - Added
ds.count()
as the default reduction fords.by
(#1004)
Bugfixes:
- Fixed array-bounds reading error in
dynspread
(#1001) - Fix
color_key
argument fordsshow
(#986) - Added Matplotlib output to the 3_Interactivity getting started page. (#1009)
- Misc docs fixes (#1007)
- Fix nan assignment to integer array in RaggedArray (#1008)
Compatibility:
- Any usage of
dynspread
with datatypes other than points should be replaced withspread()
, which will do what was probably intended by the originaldynspread
call, i.e. to make isolated lines and shapes visible. Strictly speaking, dynspread could still be useful for other glyph types if that glyph is contained entirely in a pixel, e.g. if a polygon or line segment is located within the pixel bounds, but that seems unlikely. - Dynspread may need to have the threshold or max_px arguments updated to achieve the same spreading as in previous releases, though the new behavior is normally going to be more useful than the old.
Major release with new features that should really be considered part of the upcoming 0.13 release; please treat all the new features as experimental in this release due to it being officially a minor release (unintentionally).
Massive thanks to these contributors for substantial new functionality:
- Nezar Abdennur (nvictus), Trevor Manz, and Thomas Caswell for their contributions to the new
dsshow()
support for using Datashader as a Matplotlib Artist, providing seamless interactive Matplotlib+Datashader plots. - Oleg Smirnov for
category_modulo
andcategory_binning
forby()
, making categorical plots vastly more powerful. - Jean-Luc Stevens for
spread
anddynspread
support for numerical aggregate arrays and not just RGB images, allowing isolated datapoints to be made visible while still supporting hover, colorbars, and other plot features that depend on the numeric aggregate values. - Valentin Haenel for the initial anti-aliased line drawing support (still experimental).
Thanks to Jim Bednar, Philipp Rudiger, Peter Roelants, Thuy Do Thi Minh, Chris Ball, and Jean-Luc Stevens for maintenance and other contributions.
New features:
- Expanded (and transposed) performance guide table (#961)
- Add
category_modulo
andcategory_binning
for grouping numerical values into categories using by() (#927) - Support spreading for numerical (non-RGB) aggregate arrays (#771, #954)
- Xiaolin Wu anti-aliased line drawing, enabled by adding
antialias=True
to theCanvas.line()
method call. Experimental; currently restricted tosum
andmax
reductions ant only supporting a single-pixel line width. (#916) - Improve Dask performance issue using a tree reduction (#926)
Bugfixes:
- Fix for xarray 0.17 raster files, supporting various nodata conventions (#991)
- Fix RaggedArray tests to keep up with Pandas test suite changes (#982, #993)
- Fix out-of-bounds error on Points aggregation (#981)
- Fix CUDA issues (#973)
- Fix Xarray handling (#971)
- Disable the interactivity warning on the homepage (#983)
Compatibility:
- Drop deprecated modules
ds.geo
(moved toxarray_image
) andds.spatial
(moved toSpatialPandas
) (#955)
No release notes produced.
This release is primarily a compatibility release for newer versions of Rapids cuDF and Numba versions along with a small number of bug fixes. With contributions from @jonmmease, @stuartarchibald, @AjayThorve, @kebowen730, @jbednar and @philippjfr.
- Fixes support for cuDF 0.13 and Numba 0.48 (#933)
- Fixes for cuDF support on Numba>=0.51 (#934, #947)
- Fixes tile generation using aggregators with output of boolean dtype (#949)
- Fixes for CI and build infrastructure (#935, #948, #951)
- Updates to docstrings (b1349e3, #950)
This release includes major contributions from @maihde (generalizing count_cat
to by
span for colorize), @jonmmease (Dask quadmesh support), @philippjfr and @jbednar (count_cat/by/colorize/docs/bugfixes), and Barry Bragg, Jr. (TMS tileset speedups).
New features (see getting_started/2_Pipeline.ipynb
for examples):
- New
by()
categorical aggregator, extendingcount_cat
to work with other reduction functions, no longer justcount
. Allows binning of aggregates separately per category value, so that you can compare how that aggregate is affected by category value. (#875, #902, #904, #906). See example in the holoviews docs. - Support for negative and zero values in
tf.shade
for categorical aggregates. (#896, #909, #910, #908) - Support for
span
in _colorize(). (#875, #910) - Support for Dask-based quadmesh rendering for rectilinear and curvilinear mesh types (#885, #913)
- Support for GPU-based raster mesh rendering via
Canvas.quadmesh
(#872) - Faster TMS tileset generation (#886)
- Expanded performance guide (#868)
Bugfixes:
Compatibility (breaking changes and deprecations):
- To allow negative-valued aggregates, count_cat now weights categories according to how far they are from the minimum aggregate value observed, while previously they were referenced to zero. Previous behavior can be restored by passing
color_baseline=0
tocount_cat
orby
count_cat
is now deprecated and removed from the docs; useby(..., count())
instead.- Result of a
count()
aggregation is nowuint32
notint32
to distinguish counts from other aggregation types (#910). - tf.shade now only treats zero values as missing for
count
aggregates (uint
; zero is otherwise a valid value distinct from NaN (#910). alpha
is now respected as the upper end of the alpha range for both _colorize() and _interpolate() in tf.shade; previously only _interpolate respected it.- Added new nansum_missing utility for working with Numpy>1.9, where nansum no longer returns NaN for all-NaN values.
- ds.geo and ds.spatial modules are now deprecated; their contents have moved to xarray_spatial and spatialpandas, respectively. (#894)
Download and install: https://datashader.org/getting_started
This release includes major contributions from @jonmmease (polygon rendering, spatialpandas), along with contributions from @philippjfr and @brendancol (bugfixes), and @jbednar (docs, warnings, and import times).
New features:
- Polygon (and points and lines) rendering for spatialpandas extension arrays (#826, #853)
- Quadmesh GPU support (#861)
- Much faster import times (#863)
- New table in docs listing glyphs supported for each data library (#864, #867)
- Support for remote Parquet filesystems (#818, #866)
Bugfixes and compatibility:
- Misc bugfixes and improvements (#844, #860, #866)
- Fix warnings and deprecations in tests (#859)
- Fix Canvas.raster (padding, mode buffers, etc. #862)
Download and install: https://datashader.org/getting_started
This release includes major contributions from @jonmmease (GPU support), along with contributions from @brendancol (viewshed speedups), @jbednar (docs), and @jsignell (examples, maintenance, website).
New features:
- Support for CUDA GPU dataframes (cudf and dask_cudf) (#794, #793, #821, #841, #842)
- Documented new quadmesh support (renaming user guide section 5_Rasters to 5_Grids to reflect the more-general grid support) (#805)
Bugfixes and compatibility:
- Avoid double-counting line segments that fit entirely into a single rendered pixel (#839)
- Improved geospatial toolbox, including 75X speedups to viewshed algorithm (#811, #824, #844)
This release includes major contributions from @jonmmease (quadmesh and filled-area support), @brendancol (geospatial toolbox, tile previewer), @philippjfr (distributed regridding, dask performance), and @jsignell (examples, maintenance, website).
New features:
- Native quadmesh (
canvas.quadmesh()
support (for rectilinear and curvilinear grids -- 3X faster than approximating with a trimesh; #779) - Filled area (
canvas.area()
support (#734) - Expanded geospatial toolbox, with support for:
- Distributed raster regridding with Dask (#762)
- Improved dask performance (#798, #801)
tile_previewer
utility function (simple Bokeh-based plotting of local tile sources for debugging; #761)
Bugfixes and compatibility:
- Compatibility with latest Numba, Intake, Pandas, and Xarray (#763, #768, #791)
- Improved datetime support (#803)
- Simplified docs (now built on Travis, and no longer requiring GeoViews) and examples (now on examples.pyviz.org)
- Skip rendering of empty tiles (#760)
- Improved performance for point, area, and line glyphs (#780)
InteractiveImage
andPipeline
are now deprecated; removed from examples (#751)
This release includes major contributions from @jonmmease (ragged array extension, SpatialPointsFrame, row-oriented line storage, dask trimesh support), @jsignell (maintenance, website), and @jbednar (Panel-based dashboard).
New features:
- Simplified Panel based dashboard using new Param features; now only 48 lines with fewer new concepts (#707)
- Added pandas ExtensionArray and Dask support for storing homogeneous ragged arrays (#687)
- Added SpatialPointsFrame and updated census, osm-1billion, and osm examples to use it (#702, #706, #708)
- Expanded 8_Geography.ipynb to document other geo-related functions
- Added Dask support for trimesh rendering, though computing the mesh initially still requires vertices and simplicies to fit into memory (#696)
- Add zero-copy rendering of row-oriented line coordinates, using a new axis argument (#694)
Bugfixes and compatibility:
- Added lnglat_to_meters to geo module; new code should import it from there (#708)
This release includes major contributions from @jonmmease (fixing several long-standing bugs), @jlstevens (updating all example notebooks to use current syntax, #685), @jbednar, @philippjfr, and @jsignell (Panel-based dashboard), and @brendancol (geo utilities).
New features:
- Replaced outdated 536-line Bokeh dashboard.py with 71-line Panel+HoloViews dashboard (#676)
- Allow aggregating xarray objects (in addition to Pandas and Dask DataFrames) (#675)
- Create WMTS tiles from Datashader data (#636)
- Added various geographic utility functions (ndvi, slope, aspect, hillshade, mean, bump map, Perlin noise) (#661)
- Made OpenSky data public (#691)
Bugfixes and compatibility:
- Fix array bounds error on line glyph (#683)
- Fixed the span argument to tf.shade (#680)
- Fixed composite.add (for use in spreading) to clip colors rather than overflow (#689)
- Fixed gerrymandering shape file (#688)
- Updated to match Bokeh (#656), Dask (#681, #667), Pandas/Numpy (#697)
Minor, mostly bugfix, release with some speed improvements.
New features:
Bugfixes and compatibility:
- Silenced inappropriate warnings (#631)
- Fixed various other bugs, including #644
- Added handling for zero data and zero range (#612, #648)
Minor compatibility release.
- Supports dask >= 0.18.
- Updated installation and usage instructions
Minor bugfix release.
- Now available to install using pip (
pip install datashader
) or conda defaults (conda install datashader
) - InteractiveImage is now deprecated; please use the Datashader support in HoloViews instead.
- Updated installation and example instructions to use new
datashader
command. - Made package building automatic, to allow more frequent releases
- Ensured transparent (not black) image is returned when there is no data to plot (thanks to Nick Xie)
- Simplified getting-started example (thanks to David Jones)
- Various fixes and compatibility updates to examples
Major release with extensive support for triangular meshes and changes to the raster API.
New features:
- Trimesh support: Rendering of irregular triangular meshes using
Canvas.trimesh()
(see user guide) (#525, #552) - Added a new website at datashader.org, with new Getting Started pages and an extensive User Guide, with about 50% new material not previously in example notebooks. Built entirely from Jupyter notebooks, which can be run in the
examples/
directory. Website is now complete except for sections on points (see the nyc_taxi example in the meantime). Canvas.raster()
now accepts xarray Dataset types, not just DataArrays, with the specific DataArray selectable from the Dataset using thecolumn=
argument of a supplied aggregation function.tf.Images()
now displays anything with an HTML representation, to allow laying out Pandas dataframes alongside datashader output.
Bugfixes and compatibility:
- Changed Raster API to match other glyph types:
- Now accepts a reduction function via an
agg=
argument likeCanvas.line()
,Canvas.points()
, etc. The previousdownsample_method
is still accepted for this release, but is now deprecated. upsample_method
is nowinterpolate
, acceptinglinear=True
orlinear=False
; the previous spelling is now deprecated.- The
layer=
argument previously accepted a 1-based integer index, which was confusing given the standard Python 0-based indexing elsewhere. Changed to accept an xarray coordinate, which can be a 1-based index if that's what is defined on the array, but also works with arbitrary floating-point coordinates (e.g. for a depth parameter in an image stack). - Now auto-ranges in x and y when not given explicit ranges, instead of raising an error.
- Now accepts a reduction function via an
- Fixed various bugs, including one generating incorrect output in
Canvas.raster(agg='mode')
Minor compatibility release to track changes in external packages.
- Updated imports for bokeh 0.12.11 (fixes #535), though there are issues in 0.12.11 itself and so 0.12.12 should be used instead (to be released shortly).
- Pinned pillow version on Windows (fixes #534).
Apart from the new website, this is a minor release primarily to catch up with changes in external libraries.
New features:
- Reorganized examples directory as the basis for a completely new website at https://bokeh.github.io/datashader-docs (#516).
- Added tf.Images() class to format multiple labeled Datashader images as a table in a Jupyter notebook, now used extensively in the new website.
- Added utility function
dataframe_from_multiple_sequences(x_values, y_values)
to convert large numbers of sequences stored as 2D numpy arrays to a NaN-separated pandas dataframe that can be displayed efficiently (see new example in tseries.ipynb) (#512). - Improved streaming support (#520).
Bugfixes and compatibility:
- Added support for Dask 0.15 and 0.16 and pandas 0.21 (#523, #529) and declared minimum required Numba version.
- Improved and fixed issues with various example notebooks, primarily to update for changes in dependencies.
- Changes in network graph support: ignore id field by default to avoid surprising dependence on column name, rename directly_connect_edges to connect_edges for accuracy and conciseness.
Release with bugfixes, changes to match external libraries, and some new features.
Backwards compatibility:
- Minor changes to network graph API, e.g. to ignore weights by default in forcelayout2 (#488)
- Fix upper-bound bin error for auto-ranged data (#459). Previously, points falling on the upper bound of the plotted area were excluded from the plot, which was consistent with the behavior for individual grid cells, but which was confusing and misleading for the outer boundaries. Points falling on the very outermost boundaries are now folded into the final grid cell, which should be the least surprising behavior.
New or updated examples (.ipynb files in examples/):
- streaming-aggregation.ipynb: Illustrates combining incoming streams of data for display (also see holoviews streaming).
- landsat.ipynb: simplified using HoloViews; now includes plots of full spectrum for each point via hovering.
- Updated and simplified census-hv-dask (now called census-congressional), census-hv, packet_capture_graph.
New features and improvements
- Updated Bokeh support to work with new bokeh 0.12.10 release (#505)
- More options for network/graph plotting (configurable column names, control over weights usage; #488, #494)
- For lines plots (time series, trajectory, networ graphs), switch line-clipping algorithm from Cohen-Sutherland to Liang-Barsky. The performance gains for random lines range from 50-75% improvement for a million lines. (#495)
- Added
tf.Images
class to format a list of images as an HTML table (#492) - Faster resampling/regridding operations (#486)
Known issues:
- examples/dashboard has not yet been updated to match other libraries, and is thus missing functionality like hovering and legends.
- A full website with documentation has been started but is not yet ready for deployment.
Minor bugfix release, primarily updating example notebooks to match API changes in external packages.
Backwards compatibility:
- Made edge bundling retain edge order, to allow indexing, and absolute coordinates, to allow overlaying on external data.
- Updated examples to show that xarray now requires dimension names to match before doing arithmetic or comparisons between arrays.
Known issues:
- If you use Jupyter notebook 5.0 (earlier or later versions should be ok), you will need to override a setting that prevents visualizations from appearing, e.g.:
jupyter notebook --NotebookApp.iopub_data_rate_limit=100000000 census.ipynb &
- The dashboard needs to be rewritten entirely to match current Bokeh and HoloViews releases, so that hover and legend support can be restored.
New release of features that may still be in progress, but are already usable:
- Added graph/network plotting support (still may be in flux) (#385, #390, #398, #408, #415, #418, #436)
- Improved raster regridding based on gridtools and xarray (still may be in flux); no longer depends on rasterio and scikit-image (#383, #389, #423)
- Significantly improved performance for dataframes with categorical fields
New examples (.ipynb files in examples/):
- osm-1billion: 1-billion-point OSM example, for in-core processing on a 16GB laptop.
- edge_bundling: Plotting graphs using "edgehammer" bundling of edges to show structure.
- packet_capture_graph: Laying out and visualizing network packets as a graph.
Backwards compatibility:
- Remove deprecated interpolate and colorize functions
- Made raster processing consistently use bin centers to match xarray conventions (requires recent fixes to xarray; only available on a custom channel for now) (#422)
- Fixed various limitations and quirks for NaN values
- Made alpha scaling respect
min_alpha
consistently (#371)
Known issues:
- If you use Jupyter notebook 5.0 (earlier or later versions should be ok), you will need to override a setting that prevents visualizations from appearing, e.g.:
jupyter notebook --NotebookApp.iopub_data_rate_limit=100000000 census.ipynb &
- The dashboard needs updating to match current Bokeh releases; most parts other than hover and legends, should be functional but it needs a rewrite to use currently recommended approaches.
Major release with extensive optimizations and new plotting-library support, incorporating 9 months of development from 5 main contributors:
- Extensive optimizations for speed and memory usage, providing at least 5X improvements in speed (using the latest Numba versions) and 2X improvements in peak memory requirements.
- Added HoloViews support for flexible, composable, dynamic plotting, making it simple to switch between datashaded and non-datashaded versions of a Bokeh or Matplotlib plot.
- Added examples/environment.yml to make it easy to install dependencies needed to run the examples.
- Updated examples to use the now-recommended supported and fast Apache Parquet file format
- Added support for variable alpha for non-categorical aggregates, by specifying a single color rather than a list or colormap #345
- Added datashader.utils.lnglat_to_meters utility function for working in Web Mercator coordinates with Bokeh
- Added discussion of why you should be using uniform colormaps), and examples of using uniform colormaps from the new colorcet package
- Numerous bug fixes and updates, mostly in the examples and Bokeh extension
- Updated reference manual and documentation
New examples (.ipynb files in examples/):
- holoviews_datashader: Using HoloViews to create dynamic Datashader plots easily
- census-hv-dask: Using GeoViews for overlaying shape files, demonstrating gerrymandering by race
- nyc_taxi-paramnb: Using ParamNB to make a simple dashboard
- lidar: Visualizing point clouds
- solar: Visualizing solar radiation data
- Dynamic 1D histogram example (last code cell in examples/nyc_taxi-nongeo.ipynb)
- dashboard: Now includes opensky example (
python dashboard/dashboard.py -c dashboard/opensky.yml
)
Backwards compatibility:
- To improve consistency with Numpy and Python data structures and eliminate issues with an empty column and row at the edge of the aggregated raster, the provided xrange,yrange bounds are now treated as upper exclusive. Results will thus differ between 0.5.0 and earlier versions. See #259 for discussion.
Known issues:
- If you use Jupyter notebook 5.0 (earlier or later versions should be ok), you will need to override a setting that prevents visualizations from appearing, e.g.:
jupyter notebook --NotebookApp.iopub_data_rate_limit=100000000 census.ipynb &
- Legend and hover support is currently disabled for the dashboard, due to ongoing development of a simpler approach.
Minor bugfix release to support Bokeh 0.12.1, with some API and defaults changes.
- Added
examples()
function to obtain the notebooks and other examples corresponding to the installed datashader version; see examples/README.md. - Updated dashboard example to match changes in Bokeh
- Added default color cycle with distinguishable colors for shading categorical data; now
tf.shade(agg)
with no other arguments should give a usable plot for both categorical and non-categorical data.
Backwards compatibility:
- Replaced confusing
tf.interpolate()
andtf.colorize()
functions with a single shading functiontf.shade()
. The previous names are still supported, but give deprecation warnings. Calls to the previous functions using keyword arguments can simply be renamed to usetf.shade
as all the same keywords are accepted, but calls tocolorize
that used a positional argument for e.g. thecolor_key
will now need to use a keyword when callingshade()
- Increased default
threshold
fortf.dynspread()
to improve visibility of sparse dots - Increased default
min_alpha
fortf.shade()
(formerlytf.colorize()
) to avoid undersaturation
Known issues:
- For Bokeh 0.12.1, some notebooks will give warnings for Bokeh plots when used with Jupyter's "Run All" command. Bokeh 0.12.2 will fix this problem when it is released, but for now you can either downgrade to 0.12.0 or use single-cell execution.
- There are some Bokeh compatibility issues with the dashboard example that are still being investigated and may require a new Bokeh or datashader release in this series.
Minor bugfix release to support Bokeh 0.12:
- Fixed InteractiveImage zooming to work with Bokeh 0.12.
- Added more responsive event throttling for DynamicImage;
throttle
parameter no longer needed and is now deprecated - Fixed datashader-download-data command
- Improved non-geo Taxi example
- Temporarily disabled dashboard legends; will re-enable in future release
The major feature of this release is support of raster data via Canvas.raster
. To use this feature, you must install the optional dependencies via conda install rasterio scikit-image
. Rasterio relies on gdal
whose conda package has some known bugs, including a missing dependency for conda install krb5
. InteractiveImage in this release requires bokeh 0.11.1 or earlier, and will not work with bokeh 0.12.
- PR #160 #187 Improved example notebooks and dashboard
- PR #186 #184 #178 Add datashader-download-data cli command for grabbing example datasets
- PR #176 #177 Changed census example data to use HDF5 format (slower but more portable)
- PR #156 #173 #174 Added Landsat8 and race/ethnicity vs. elevation example notebooks
- PR #172 #159 #157 #149 Added support for images using
Canvas.raster
(requiresrasterio
andscikit-image
). - PR #169 Added legends notebook demonstrating
create_categorical_legend
andcreate_ramp_legend
- PR #162. Added notebook example fordatashader.bokeh_ext.HoverLayer
- PR #152. Addedalpha``arg to ``tf.interpolate
- PR #151 #150, etc. Small bugfixes - PR #146 #145 #144 #143 Added streaming example
- Added
hold
decorator to utils,summarize_aggregate_values
helper function - Added FAQ to docs
Backwards compatibility:
- Removed
memoize_method
- Renameddatashader.callbacks
-->datashader.bokeh_ext
- Renamedexamples/plotting_problems.ipynb
-->examples/plotting_pitfalls.ipynb
A major release with significant new functionality and some small backwards-incompatible changes.
New features:
- PR #124, census New census notebook example, showing how to work with categorical data.
- PR #79, tseries, trajectory Added line glyph and ``.any()``reduction, used in new time series and trajectory notebook examples.
- PR #76, #77, #131 Updated all of the other notebooks in examples/, including nyc_taxi.
- PR #100, #125: Improved dashboard example: added categorical data support, census and osm datasets, legend and hover support, better performance, out of core option, and more
- PR #109, #111: Add full colormap support via a new
cmap
argument tointerpolate
andcolorize
supports color ranges as lists, plus Bokeh palettes and matplotlib colormaps - PR #98: Added
set_background
to make it easier to work with images having a different background color than the default white notebooks - PR #119, #121: Added
eq_hist
option forhow
in interpolate, performing histogram equalization on the data to reveal structure at every intensity level - PR #80, #83, #128: Greatly improved InteractiveImage performance and responsiveness
- PR #74, #123: Added operators for spreading pixels (to make individual datapoints visible, as circles, squares, or arbitrary mask shapes) and compositing (for simple and flexible composition of images)
Backwards compatibility:
- The
low
andhigh
color options tointerpolate
andcolorize
are now deprecated and will be removed in the next release; usecmap=[low,high]
instead. - The transfer function
merge
has been removed to avoid confusion.stack
and others can be used instead, depending on the use case. - The default
how
forinterpolate
andcolorize
is noweq_hist
to reveal the structure automatically regardless of distribution. Pipeline
now has a defaultdynspread
step, to make isolated points visible when zooming in, and the default sizes have changed.
Initial public release.