Releases: Bears-R-Us/arkouda
Releases · Bears-R-Us/arkouda
Release Notes v2022.09.30
Release Notes 2022-09-30
Major updates:
- Issue #1814 - Changes behavior of
search_intervals
andinterval_lookup
to have closed upper bounds - Issue #1810 - Adds
get_prefixes
andget_suffixes
toStrings
- Issues #1787, #1778 - Add
GroupBy.std()
,GroupBy.var()
, andGroupBy.median()
- Issue #1808 - Fixes runtime errors in
ak.in1d()
- Issue #1794 - Adds documentation of similarities/differences with
Numpy
&Pandas
- Issues #1781, #1820 and PRs #1813, #1807 - Make Arkouda compatible with Chapel 1.28
- Issues #1709, #1745, #1743, #1767, #1765, #1746, #1761, #1760, #1755, #1763, #1757, #1731 - Update server messages to use JSON arguments
Minor fixes:
- Issue #1742 - Updates symbol table structure
- Issue #1774 - Updates message function definitions
- Issues #1681, #1783 - Move
SegArray.__init__
andSegArray.__getitem__
to server - Issues #1650, #1751, #1747 - Add chapel version to
get_config
and splash message and add connect url to server connection info - Issues #1752, #1749 - Update
README
andCONTRIBUTING.md
Auto-generated release notes
- Closes #1709 -
FindSegmentsMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1714 - add getChplVersion/chplVersion to ServerConfig and chplVersion splash message by @hokiegeek2 in #1741
- Closes #1681 -
SegArray.__init__
to Server by @Ethan-DeBandi99 in #1739 - Closes #1751 - Include Chapel Version in
get_config
by @Ethan-DeBandi99 in #1753 - Closes #1745 -
MetricsMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1750 - 1747 add connect url to server connection info by @hokiegeek2 in #1758
- Closes #1752: Update
README
about by @pierce314159 in #1754 - Closes #1743 -
KExtremeMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1744 - Closes #1767 -
SortMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1768 - Closes #1765 -
SequenceMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1766 - Closes #1746 -
OperatorMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1748 - Closes #1761 -
RegistrationMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1762 - Closes #1760 -
Datetime
support in JSON message args by @Ethan-DeBandi99 in #1769 - Closes #1755 -
RandMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1756 - Closes #1763 -
SegmentedMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1764 - Closes #1757 -
ReductionMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1759 - Closes #1742 - Symbol Table Structure Update by @Ethan-DeBandi99 in #1770
- Closes #1749 - Update
CONTRIBUTING.md
Bug Report by @Ethan-DeBandi99 in #1771 - Closes #1731 - JSON Message Arguments in Chapel by @Ethan-DeBandi99 in #1773
- Closes #1785: Bind
pyzmq<24.0.0
by @pierce314159 in #1786 - Closes #1784: Enable
pyzmq>=24.0.0
by @pierce314159 in #1788 - Closes #1778: Implement
GroupBy.median()
by @pierce314159 in #1780 - Closes #1774 - Message Function Definition Update by @Ethan-DeBandi99 in #1777
- Closes #1781 - Quiet deprecation warnings in prep for Chapel 1.28 by @ronawho in #1782
- Closes #1787: Implement
GroupBy.std()
andGroupBy.var()
by @pierce314159 in #1791 - Closes #1783 -
SegArray.__getitem__
to Server by @Ethan-DeBandi99 in #1790 - Closes #1794 - Documentation of Similarities/Differences with
Numpy
&Pandas
by @Ethan-DeBandi99 in #1797 - Closes #1808 -
ak.in1d()
Runtime Errors by @Ethan-DeBandi99 in #1809 - Closes #1810 add
get_prefixes
andget_suffixes
toStrings
by @reuster986 in #1811 - Raise instantiation limit for 1.26/1.27 by @ronawho in #1813
- Closes #1814 Change interval arguments to closed by @reuster986 in #1815
- Recommend Chapel 1.28.0 and use it for CI testing by @ronawho in #1807
- Resolves #1820: Adjust formal names of function type for callback in 'registerFunction' by @dlongnecke-cray in #1819
Full Changelog: v2022.08.30...v2022.09.30
Release Notes v2022.08.30
Release Notes 2022-08-30
Major updates:
- Issue #1728 - Fixes
assume_sorted
bug innunique
- Issue #1611 - Adds
register
andattach
functionality toDataFrame
- Issue #1376 - Adds
Datetime
andTimedelta
attributes - Issue #1571 - Moves
SegArray
to server - Issue #1725 - Adds
sqrt
andpower
functions for pdarrays - Issues #1684, #1691, #1697, #1699, #1703, #1705, #1711, #1713, #1716, #1718, #1723 - Update server messages to use JSON arguments
Minor fixes:
- Issue #1613 - Enables regex with patterns that match the empty string
- Issue #1676 - Fixes
Strings.to_ndarray
for emptyStrings
- PR #1732 - Adds client mode environment variable
- PR #1720 - Adds logic to call
exit(0)
only when there are2..n ArkoudaServerDaemons
Auto-generated release notes
- Closes #1679 : Change std test to use assertAlmostEqual by @bmcdonald3 in #1680
- Closes #1613: Remove empty string regex workaround by @pierce314159 in #1677
- Closes #1692: Use assertAlmostEqual in stats test by @bmcdonald3 in #1693
- updated README files by @hokiegeek2 in #1701
- Closes #1684 -
ArgSortMsg.chpl
&UniqueMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1686 - Closes #1697,
BroadcastMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1698 - Closes #1703 -
DataFrameIndexingMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1704 - Closes #1705 -
EfuncMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1706 - Closes 1691 - Update
CastMsg.chpl
to use JSON Message Arguments by @Ethan-DeBandi99 in #1696 - Closes #1699 -
ConcatenateMsg.chpl
to JSON Message Arguments by @Ethan-DeBandi99 in #1702 - Closes #1571 - Moves SegArray to Server by @Ethan-DeBandi99 in #1683
- Closes #1676: Fixes
Strings.to_ndarray
for emptyStrings
by @pierce314159 in #1707 - Closes #1376: Add
Datetime
andTimedelta
attributes by @pierce314159 in #1690 - added logic to call exit(0) only when there are 2..n ArkoudaServerDaemons by @hokiegeek2 in #1720
- Closes #1716 -
In1dMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1717 - Closes #1713 -
HDF5Msg.chpl
&ParquetMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1715 - Closes #1723 -
JoinEqWithDTMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1724 - Closes #1718 -
IndexingMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1721 - #1728 Fix nunique by @reuster986 in #1733
- Closes #1611 - DataFrame Register and Attach Functionality by @joshmarshall1 in #1688
- Closes #1725 -
sqrt
andpower
functions on pdarrays by @Ethan-DeBandi99 in #1726 - Closes #1711 -
FlattenMsg.chpl
JSON Message Arguments by @Ethan-DeBandi99 in #1712 - Enable UI and API client mode by @hokiegeek2 in #1732
Full Changelog: v2022.08.15...v2022.08.30
Release Notes v2022.08.15
Release Notes 2022-08-15
Major updates:
- Issues #1618, #1630, #1631, #1632, #1654 - Improve JSON parameter object multiple
dtype
support and list processing, add nested JSON message argument support, and add a dispatch table for message argument generation - Issue #1659 and PRs #1601, #1602, and #1604 - Refactor
arkouda_server
to be a driver ofServerDaemon
s and enables external and metric integration - Issue #1639 - Adds aggregations for first, mode, and unique to Groupby
- Issue #1643 - Adds covariance and correlation methods for pdarray and correlation matrix method for DataFrame
- PR #1635 - Reduces sort bucket exchange overhead
- Issue #1666 - Removes support for Chapel 1.25
Minor fixes:
- Issue #1397 - Replaces current DataFrame display method with server implementation
- Issues #1648 and #1658 - Update docs to address issues with
brew install chapel
and add instructions to update environment - Issue #1642 - Fixes
testAllOperators
double counting mismatches - Issues #1589 and #1672 - Add test to verify randomness for
ak.randint
(disabled for the time being) - PR #1669 - Adds read support for Parquet uint32
Auto-generated release notes
- Closes #1630 - Use
Enum
forObjType
by @Ethan-DeBandi99 in #1647 - Closes #1654 - Fix JSON bug when param does not have
.dtype
by @Ethan-DeBandi99 in #1655 - arkouda_server refactor by @hokiegeek2 in #1601
- Closes #1631 - Message Arg Generation Using Dispatch Table by @Ethan-DeBandi99 in #1652
- Closes #1648 - Environment Update Instructions by @Ethan-DeBandi99 in #1649
- Closes #1642 -
testAllOperators
double counting when mismatch occurs by @Ethan-DeBandi99 in #1656 - Closes #1397 -
_get_head_tail_server()
replace_get_head_tail()
by @Ethan-DeBandi99 in #1653 - Added support for a comma-delimited list of ServerDaemonTypes #1659 by @hokiegeek2 in #1660
- Closes #1589 - Adds test to verify randomness for
ak.randint
by @joshmarshall1 in #1657 - Closes #1632 - Multiple
dtype
support and improved list processing by @Ethan-DeBandi99 in #1662 - Closes #1666: Remove support for Chapel 1.25 by @bmcdonald3 in #1667
- Add read support for Parquet uint32 by @bmcdonald3 in #1669
- Closes #1643:
Covariance
andCorrelation
by @pierce314159 in #1661 - Closes #1658 - Update
INSTALL.md
to Addressbrew install chapel
issues by @Ethan-DeBandi99 in #1664 - Remove broken TOC link by @bmcdonald3 in #1671
- Reduce sort bucket exchange overhead by @ronawho in #1635
- Closes #1672: Temporarily change test_randint_randomness not to run by @pierce314159 in #1673
- arkouda_server refactor with external integration by @hokiegeek2 in #1602
- Closes #1639: Adds aggregations for first, mode, and unique by @reuster986 in #1638
- Closes #1670: Remove
in-place
fromDataFrame.drop
docstring by @pierce314159 in #1674 - Closes #1618 - Nested JSON Message Argument Support by @Ethan-DeBandi99 in #1675
- arkouda_server_refactor with metrics integration by @hokiegeek2 in #1604
Full Changelog: v2022.07.28...v2022.08.15
Release Notes v2022.07.28
Release Notes 2022-07-28
Major updates:
- Issue #702 - Adds multidimensional object support for HDF5
- Issue #1495 - Adds multi-array support for
in1d
- Issue #1168 - Adds DataFrame Index renaming
- Issue #1620 - Fixes
uint
andfloat
binops - Issue #1607 - Re-enables
MultiIndex
support for mixed dtypes - Issue #1614 - Corrects
DataFrame
Index error on.append()
- Issue #1220 - Fixes modulo bias in
randint
- Issue #1519 - Adds strip method to Strings
- Issue #1598 - Allows for overlapping intervals and adds tiebreak to
search_intervals
- Issue #1605 - Adds
to_list
method to classes which haveto_ndarray
- Issue #1587 - Adds support for JSON formatted argument for server messages
- Issues #1637, #1622, #1645 - Correct floor division on edge cases
Minor fixes:
- Issues #1161 and #1573 - Update
make test-chapel
to run unit tests - Issue #1588 - Fixes bug in
__getitem__
's argument validation - Issue #1583 - Optimizes string indexing
- Issue #848 - Limits length of file lines logged
- Issues #1625 and #1627 - Set sphinx minimum version to 5.1.1
- Issue #1572 - Updates modular build process to initialize modules on command line
- Issue #1600 - Fixes bug where
SegArray.to_ndarray
returnedlist
Auto-generated release notes
- Closes #1161: Update
make test-chapel
to run unit tests by @pierce314159 in #1561 - Closes #1168 - DataFrame Index Renaming by @Ethan-DeBandi99 in #1564
- Closes #848: Limit Length of File Lines Logged by @pierce314159 in #1563
- Closes #1573: Remove python script generation for Chapel unit tests by @bmcdonald3 in #1575
- closes #1577 - 1566 Feedback Updates by @Ethan-DeBandi99 in #1578
- Closes #1495 - Move
in1dmulti
intoarkouda/pdarraysetops.py
by @Ethan-DeBandi99 in #1557 - Closes #1583: Optimize string indexing by removing extra interior domain creations by @bmcdonald3 in #1582
- Closes #1574 - Rename
SegmentedArray
toSegmentedString
by @Ethan-DeBandi99 in #1576 - Closes #702 - Multidimensional Object Support for HDF5 by @Ethan-DeBandi99 in #1552
- Closes #1584 - Use the non-deprecated methods from
util
by @Ethan-DeBandi99 in #1586 - Closes #1220 - Implementing Chapel's Random module to resolve modulo bias by @joshmarshall1 in #1542
- Closes #1580: Update
CONTRIBUTING.md
by @pierce314159 in #1581 - Closes #1519: Add strip method to Strings by @jeichert60 in #1569
- Closes #1594: Add -s to make commands in compopts and remove .SILENT by @bmcdonald3 in #1595
- Closes #1588: Fix
resolve_scalar_dtype(x) == "int64" or "uint64"
bug by @pierce314159 in #1590 - Closes #1591 - add
client_dtype_test
topytest.ini
by @Ethan-DeBandi99 in #1592 - Closes #1598 Interval lookup improvements by @reuster986 in #1599
- Closes #1609 - Intermittent Failures of
test_is_ipv4
Corrected by @Ethan-DeBandi99 in #1610 - Closes #1572: Update modular build process to initialize modules on command line by @bmcdonald3 in #1606
- Closes #1600 -
SegArray.to_ndarray()
returnslist
by @Ethan-DeBandi99 in #1612 - Closes #1627 -
sphinx==5.0.2
by @Ethan-DeBandi99 in #1629 - Fixes #1607: Allow
MultiIndex
to have mixed dtypes by @pierce314159 in #1608 - Closes #1550 - Removing Chapel unit tests from CI to improve build time by @joshmarshall1 in #1628
- Fixes #1620: Fix uint and float binops by @pierce314159 in #1623
- Closes #1531 and #1605: Update tests to use
assertListEqual
and add.to_list
methods by @pierce314159 in #1596 - Closes #1614 - Corrects
DataFrame
Index Error on.append()
by @Ethan-DeBandi99 in #1615 - Closes #1587 - Add JSON format argument support for server messages by @Ethan-DeBandi99 in #1619
- Closes #1637: Create test for floor division edge cases by @pierce314159 in #1640
- Closes #1622 - Value mismatches with numpy in operations test by @Ethan-DeBandi99 in #1633
- Closes #1645 -
floorDivisionHelper
inline proc by @Ethan-DeBandi99 in #1646 - Closes #1625 Setting sphinx>=5.1.1 by @Ethan-DeBandi99 in #1644
Full Changelog: v2022.07.08...v2022.07.28
Release Notes v2022.07.08
Release Notes 2022-07-08
Major updates:
- Issues #1543, #1545, #1546, #1547 - Recommend Chapel
1.27.0
and drop support for Chapel1.24
- Issues #1525 and #1529 - Update to require
numpy>=1.22.2
- Issue #1301 - Updates
CONTRIBUTING.md
to specify latest guidelines for contributing - Issue #1386 - Adds generic
unregister_by_name()
method - Issue #1527 - Adds
is_ipv4()
andis_ipv6()
for ipaddresses - Issue #1548 - Fixes
interval_lookup
bug and enable multi-array input - Issue #1512 - Fixes bug in
Series
methods withMultiIndex
Minor fixes:
- Issue #1535 - Optimizes
lookup
algorithm - Issue #1518 - Adds
to_title
andis_title
methods toStrings
- Issue #1540 - Adds
.objType
and.dtype
to Categorical - Issue #1565 - Adds
Index.is_unique
- Issues #1478 and #1465 - Add typechecking to
Series
andJoin
- Issue #1360 - Updates
ak.save_all()
to utilize append - Issue #1520 - Updates
make
to recompile Arrow - Issue #1511 - Fixes
make install-deps
inconsistencies - Issue #1362 - Fixes truncation of
BitVector
values inSeries
display - Issue #1353 - Makes
size()
an alias for the currentcount()
- Issue #1536 - Switches to using a lowLevelLocalizingSlice for DF indexing message
- Issues #1514 and #1555 - Rename
hash()
tosiphash()
and remove trailing comma from 1 tuple for compatibility with Chapel main
Auto-generated release notes
- Closes #1386 - Generic
unregister_by_name()
method by @joshmarshall1 in #1510 - Closes #1514: Remove trailing comma from 1 tuples in array creation message by @bmcdonald3 in #1513
- 1478 typechecking join by @jeichert60 in #1502
- Closes #1360: Utilize append functionality in
ak.save_all()
by @jeichert60 in #1507 - Closes #1520: Always recompile Arrow when running make or make check-arrow by @bmcdonald3 in #1521
- Closes #1511 -
make install-deps
inconsistencies by @Ethan-DeBandi99 in #1517 - Closes #1512: Bug in
Series
methods withMultiIndex
by @pierce314159 in #1516 - Closes #1465: Add typechecking to
Series
by @jeichert60 in #1477 - Closes #1362 - Truncation of
BitVector
values inSeries
display by @joshmarshall1 in #1524 - Closes #1353: Make
size()
an alias for the currentcount()
by @jeichert60 in #1515 - Closes #1301: Update
CONTRIBUTING.md
by @pierce314159 in #1523 - Closes #1525 - security alert for
numpy<=1.21.6
by @Ethan-DeBandi99 in #1526 - Closes #1529: Quick fix for np > 1.21 by @pierce314159 in #1530
- Closes #1535: More efficient
lookup
algorithm by @reuster986 in #1532 - Closes #1536: Switch to using a lowLevelLocalizingSlice for DF indexing message by @bmcdonald3 in #1537
- Closes #1518: Add
to_title
andis_title
methods toStrings
by @jeichert60 in #1534 - Closes #1527 - Add
is_ipv4()
andis_ipv6()
for ipaddresses by @Ethan-DeBandi99 in #1538 - Closes #1540 - Adds
.objType
and.dtype
to Categorical by @Ethan-DeBandi99 in #1541 - Recommend Chapel 1.26.0 and use it for CI testing by @ronawho in #1544
- Quiet deprecation warnings for Chapel 1.27 by @ronawho in #1553
- Rename
SegString.hash()
toSegString.siphash()
by @ronawho in #1556 - Closes #1546: Recommend Chapel 1.27.0 and use it for CI testing by @ronawho in #1554
- Closes #1548: Fix
interval_lookup
bug and enable multi-array input by @pierce314159 in #1549 - Closes #1565 - Add
Index.is_unique
by @Ethan-DeBandi99 in #1566 - Drop support for Chapel 1.24 by @ronawho in #1560
- Closes #1545: Add portability testing to older chpl versions by @ronawho in #1562
Full Changelog: v2022.06.13...v2022.07.08
Release Notes v2022.06.13
Release Notes 2022-06-13
Major updates:
- Issues #1505 and #1506 - Fix
Series
loss of support forIndex
andmulti-index
- Issue #1491 - Adds
uint64
support tosearch_intervals()
- Issue #1077 - Updates dtypes to better support scalars (8, 16, and 32 bit)
- Issue #612 - Changes minimum supported python version to Python 3.8
- Issue #1470 - Improves
ak.Index
(addsak.Index.values
, improves argument validation, etc) - Issue #1448 - Cleans up
util.py
(movesmost_common()
intoGroupBy
and deprecates functionality duplicated elsewhere in arkouda)
Minor fixes:
- Issue #1486 - Adds Flake8 to CI
- Issues #1480, #1481, and #1479 - Fix deprecation and Pandas cast warnings
- Issues #1499, #1371 and PR #1504 - Update requirements for Pandas and mypy, and swap channel order in conda yaml files
- PR #1490 - Updates additional files to avoid using SysCTypes, CPtr
- PR #1498 - Start timing array transfer again
Auto-generated release notes
- Closes #1480 & #1481 - Deprecation Warnings by @Ethan-DeBandi99 in #1483
- Closes #612 - Upgrade to Python 3.8 by @Ethan-DeBandi99 in #1487
- Closes #1470 -
ak.Index
Updates by @Ethan-DeBandi99 in #1482 - Closes #1486 - Add Flake8 to CI by @Ethan-DeBandi99 in #1488
- Closes #1448 -
util.py
cleanup by @joshmarshall1 in #1484 - Closes #1479 - Pandas Cast Warning by @Ethan-DeBandi99 in #1485
- Start timing array transfer again by @bmcdonald3 in #1498
- Update additional files to avoid using SysCTypes, CPtr by @bradcray in #1490
- Closes #1371 - Setting mypy back to >=0.931 by @joshmarshall1 in #1489
- fix channel order in conda yml file by @mhmerrill in #1504
- Closes #1505 & #1506 by @Ethan-DeBandi99 in #1508
- Closes #1491 -
search_intervals()
not working foruint64
by @Ethan-DeBandi99 in #1494 - Closes #1499 - Update Requirements for Pandas by @Ethan-DeBandi99 in #1500
- 1077 update dtypes to better support scalars by @hokiegeek2 in #1492
Full Changelog: v2022.06.06...v2022.06.13
Release Notes v2022.06.06
Release Notes 2022-06-06
Major updates:
- Issues #1390, #1428, and #1471 - Make
parquet
a dependency - Issues #1412, #1415, #1420, #1429, #1431, #1435, #1439, #1457, #1455, #1459 - Apply
black
andisort
to arkouda's client code, benchmarks, and tests. Ensure reformatting meetsPEP8
usingflake8
- Issue #1350 - Adds
ak.DataFrame.isin
method - Issues #953 and #1450 - Add Import/Export Functionality between arkouda and pandas
- Issue #1167 - Adds
inplace
parameter todrop
,rename
, andreset_index
inDataframe
- Issue #1456 - Enables saving with a
SegArray
column and addsfile_format
todf.save()
- Issues #1122, #1337, #1381, #1382, #1389, #1399- Add
register
andattach
functionality toGroupby
;is_registered
toSeries
,Segarray
, andIndex
; and generic attach support forCategorical
,Segarray
, andSeries
- Issues #1302, #1377, and #1454 - Update setup instructions and add
YAML
files for creating user and developer conda environments - Issue #1452 - Adds support for
argmin
andargmax
onbool
values - Issue #1387 - Fixes bug with relational binops on booleans
- Issue #1393 - Reimplements
assume_sorted
shortcut forGroupBy
andunique
left out of refactor - Issue #1394 - Forces
ak.concatenate
to preserve fancy pdarray types (BitVector
,IPv4
, ...) - Issue #1406 - Fixes bug where
ak.lookup
assumes ordered input - Issue #1401 - Fixes bug in DataFrame
save_table
- Issues #1384 and #1379 - Fix instances of accessing index as as a column, allow making
Series
from aStrings
, and allow indexing aDataframe
with a tuple of column names
Minor fixes:
- Issue #1363 - Adds more robust argument validation to
ak.Series
init - Issues #1404 and #1446 - Improve radix sort performance and testing
- Issues #1398 and #1424 - Work on DataFrame Display Performance Issue
- PR #1444 - Optimizes
domain_intersection()
when both domains are non-strided - Issue #1417 - Limits multi-locale CI testing
- Issue #1426 - Fixes allocated buffer size for akVersion
- Issue #1433 - Sets
mypy==0.931
- PR #1438 - Updates Math module to be compatible with changes to Chapel
- Issue #1437 - Removes
CyclicDist
as an option for pdarrays
Auto-generated release notes
- Closes #1337 - Adding support to generic attach for categorical and segarray by @joshmarshall1 in #1342
- 1379 Fixes
df.col
by @reuster986 in #1380 - Closes #1302 - .yml files for setup of user and developer conda environment by @joshmarshall1 in #1344
- 1377 provide RHEL installation instructions by @hokiegeek2 in #1378
- Fixes #1384: Change
df['index']
todf.index
inDataframe
by @reuster986 in #1383 - #1122 - Add Register and Attach functionality to GroupBy by @joshmarshall1 in #1327
- Fixes #1387: Boolean comparison binops by @pierce314159 in #1388
- Closes #1390: Missing Parquet skip in test by @jeichert60 in #1392
- Closes #1389: Bug fix for categorical._akNAcode with generic attach by @joshmarshall1 in #1391
- Closes #1393: Implement
assume_sorted
shortcut forGroupBy
andunique
by @pierce314159 in #1396 - 1406 Fix bug in ak.lookup by @reuster986 in #1407
- Part of #1381 - Added
SegArray.is_registered
method by @joshmarshall1 in #1400 - Closes #1394: Fancy pdarray type preservation in
ak.concatenate
by @pierce314159 in #1402 - Closes # 1398 - DataFrame Display Server-Side Performance Issue by @Ethan-DeBandi99 in #1409
- Closes #1417: Limit multi-locale CI testing by @ronawho in #1418
- Closes #1399: Removing version check from
ak.util.register
by @joshmarshall1 in #1403 - Closes #1381: Adding
is_registered()
to Series and Index by @joshmarshall1 in #1410 - Closes #1412 - Add .flake to gitignore by @Ethan-DeBandi99 in #1413
- Closes #1401 - DataFrame
save_table
bug by @Ethan-DeBandi99 in #1414 - Closes #1415 -
arkouda/accessor.py
PEP8 formatting by @Ethan-DeBandi99 in #1416 - Closes #1420 - /converter PEP8 Formatting by @Ethan-DeBandi99 in #1421
- Closes # 1167 -
ak.DataFrame
Addinplace
parameter by @Ethan-DeBandi99 in #1423 - Closes #1424 - Update DataFrame Nightly Plot to Plot Seconds by @Ethan-DeBandi99 in #1425
- Fix allocated buffer size for akVersion by @bmcdonald3 in #1427
- Closes #1429 -
pydoc/_ext/ak_sphinx_extensions.py
PEP8 Formatting by @Ethan-DeBandi99 in #1430 - Closes #1431 -
src/ServerModulesGen.py
PEP8 Formatting by @Ethan-DeBandi99 in #1432 - Closes #1433 - Set
mypy==0.931
by @Ethan-DeBandi99 in #1434 - Closes #1435 -
arkouda/infoclass.py
PEP8 Formatting by @Ethan-DeBandi99 in #1436 - Adjust source files to be compatible with changes I made to the Math module by @lydia-duncan in #1438
- Closes #1439 -
util/test
PEP8 Formatting by @Ethan-DeBandi99 in #1440 - Closes #953 - Import/Export Functionality by @Ethan-DeBandi99 in #1408
- Optimize domain_intersection() when both domains are non-strided by @bradcray in #1444
- Make some improvements to the standalone sort test by @ronawho in #1445
- Use swap operator in radix sort instead of assignment operator by @bmcdonald3 in #1419
- Closes #1450 - Import/Export Delete Folder on Startup by @Ethan-DeBandi99 in #1451
- Closes #1363: ak.Series with only values and more robust typechecking by @jeichert60 in #1441
- Closes #1350 - Add
ak.DataFrame.isin
by @Ethan-DeBandi99 in #1449 - Revert "Closes #1350 - Add
ak.DataFrame.isin
" by @pierce314159 in #1462 - Closes #1350
ak.DataFrame.isin
by @Ethan-DeBandi99 in #1463 - Closes #1428- Parquet as Dependency by @Ethan-DeBandi99 in #1453
- Remove CyclicDist as an option for pdarrays by @bmcdonald3 in #1464
- Closes #1457: PEP8 Formatting of
arkouda
directory by @pierce314159 in #1468 - Closes #1454 - Align dependencies by @jeichert60 in #1466
- Closes #1455 pep8 formatting
tests
by @Ethan-DeBandi99 in #1458 - Closes #1459 benchmark pep8 by @Ethan-DeBandi99 in #1461
- Closes #1471 - Update Install Doc by @Ethan-DeBandi99 in #1473
- Closes #1382: Adding
Series
support to generic attach by @joshmarshall1 in #1443 - Closes #1456 -
df.save()
withSegArray
Column by @Ethan-DeBandi99 in #1467 - Closes #1452 - Support for
argmin
andargmax
onbool
values. by @Ethan-DeBandi99 in #1469
New Contributors
- @jeichert60 made their first contribution in #1392
Full Changelog: v2022.05.09...v2022.06.06
Release Notes v2022.05.09
Release Notes 2022-05-09
Major updates:
- Issues #1365 and #1366 - Create nonexistent Parquet and HDF5 files when writing with mode='append'
- Issue #1331 - Align Parquet save/load with HDF5
Minor fixes:
- Issue #1351 - Aligns
ak.Series.shape
with pandas functionality - Issue #1367 - Removes akutil subdirectory since it has been merged into the main arkouda python package
- Issues #1345 and #1369 - Add DataFrame Display benchmark to nightly performance charts
Auto-generated release notes
- Closes #1351 - make ak.Series.shape work as it does in pandas by @Ethan-DeBandi99 in #1359
- Closes #1345 DataFrame Benchmark added to Nightly by @Ethan-DeBandi99 in #1346
- Closes #1331 - Parquet alignment with HDF5 by @Ethan-DeBandi99 in #1354
- Closes #1367: Remove akutil subdirectory by @pierce314159 in #1368
- Closes #1369 - Dataframe Nightly Test Issue by @Ethan-DeBandi99 in #1370
- Create nonexistent parquet files when writing with mode='append' by @bmcdonald3 in #1358
- Create nonexistent HDF5 files when writing with mode='append' by @bmcdonald3 in #1364
Full Changelog: v2022.05.05...v2022.05.09
Release Notes v2022.05.05
Release Notes 2022-05-05
Major updates:
- Issue #1008 - Introduces Multi-dimensional
ArrayView
objects with indexing and slicing - Issue #846 - Enables calling
unique
on lists/tuples of arrays and significantly improves performance ofGroupBy
on a large number of arrays - PR #1313 - Adds functions for setting explicit categories and standardizing categories across multiple
Categorical
s and adds an N/A value toCategorical
- Issue #1280 - Adds multiarray setops support for all
groupable
types - Issue #1086, #1282 - Adds
SegArray
register/attach and a genericak.attach
method - Issues #1273 and #1310 - Incorporate akutil plotting and import akutil updates (
Dataframe
to handle aggregate ops, additionalSeries
functionality, and addition ofAccessor
classes) - Issues #1311, #1317, #1318, and #1339 - Fix bugs in
array
dtype on python lists,uint64
pdarray ops, string comparison with empty string, and Dataframe index with logical indexing - PRs #1288, #1293, #1307, and #1299 and Issues #1303, #1324, #1229, and #1295 - Unify file reading functions into a single
read
function; addfile_format
keyword to read, save, and load functions; add aget_null_indices
function for Parquet files; and improves efficiency of save
Minor fixes:
- Issue #1300 - Adds a candidate DataFrame Display and a benchmark to compare performance with the current method
- Issues #1275, #1199, and PR #1343 - Update Installation documentation for WSL, add dynamic versioning into docs, and add an arkouda talk to README
- Issues #1290 and #1325 - Update documentation for
ak.histogram
and add documentation for segarray setops - Issues #1268, #1236, #1230, and #1286 - Clean up code in strings, dataframes, and groupby tests
- PR #1323 - Adds sanitizer check for C++ compilation in Makefile
- Issue #1315 - Adds better error handling in DataFrame.dtypes
Auto-generated release notes
- Closes #1086, Generic Attach Method by @joshmarshall1 in #1269
- Closes #1268, Strings object name initialization by @joshmarshall1 in #1271
- Closes 1275 - Updates to install guide for WSL by @joshmarshall1 in #1278
- Closes #1236 - Cleanup of arange unnecessary arguments by @joshmarshall1 in #1281
- Closes #1230 - Updated
mode.lower() in 'append' / 'truncate'
by @joshmarshall1 in #1289 - Closes #1290 Add segarray setops to documentation. by @Ethan-DeBandi99 in #1291
- Closes #1286:
test_broadcast_*
issue ingroupby_test
by @pierce314159 in #1292 - Unify various file reading functions into a single read function by @bmcdonald3 in #1288
- Add a file_format keyword argument to save by @bmcdonald3 in #1293
- Closes #1280 -
groupable
support for multiarray setops by @Ethan-DeBandi99 in #1294 - Closes #1229 and #1295 - Updating save checks in parquet/hdf5 msg files to be more efficient by @joshmarshall1 in #1296
- Closes #1273 akutil.plotting to arkouda by @Ethan-DeBandi99 in #1285
- Add a get_null_indices function for Parquet files by @bmcdonald3 in #1299
- Resolves #1311:
ak.array
dtype bug for python lists by @pierce314159 in #1312 - Update strings object to match new save function convention by @bmcdonald3 in #1307
- Closes #1315 make DataFrame.dtypes handle missing cases better by @Ethan-DeBandi99 in #1316
- Resolves #1317 and #1318: Fix
uint64
pdarray ops andString != ''
bug by @pierce314159 in #1321 - Add sanitizer check for C++ compilation in Makefile by @bmcdonald3 in #1323
- Closes #1325 - update documentation for ak.histogram to reflect that bin edges are now returned by @Ethan-DeBandi99 in #1326
- Closes #1303 save pdarray parquet by @Ethan-DeBandi99 in #1309
- Closes #1282 SegArray Register/Attach by @Ethan-DeBandi99 in #1320
- Closes #1324 -
load_all()
usefile_format
by @Ethan-DeBandi99 in #1329 - Initial Multi-dim ArrayView by @pierce314159 in #1142
- Categorical updates by @reuster986 in #1313
- #1199 - Dynamic versioning for quickstart documentation by @joshmarshall1 in #1332
- Closes #1300 - Improve Performance of DataFrame Display by @Ethan-DeBandi99 in #1334
- Closes #1339 - Dataframe index set incorrectly on logical indexing by @Ethan-DeBandi99 in #1340
- Update README.md by @mhmerrill in #1343
- Closes #1310 - Import akutil updates by @Ethan-DeBandi99 in #1341
- 846 refactor unique to handle groupby by @reuster986 in #1330
New Contributors
- @joshmarshall1 made their first contribution in #1269
Full Changelog: v2022.04.15...v2022.05.05
Release Notes v2022.04.15
Release Notes 2022-04-15
Major updates:
- Issue #1218 - Extends pdarray setops to work on multiple pdarrays
- Issue #1197 - Adds Segarray setops functionality
- Issue #1234 - Removes
ls_hdf
in favor of a genericls
which automatically handles hdf5 or parquet files - Issue #1265 - Adds
to_upper/to_lower
andis_upper/is_lower
functionality to Strings - PRs #1222, #1227, #1233 - Add support for writing string Parquet files and an append mode for Parquet file writing
- Issue #1272 - Changes
ak.histogram
to behave likeakutil.hist
- Issue #1256 - Moves
akutil.join
functionality into arkouda - Issues #1133, #1210 and, #1279 - Fix uint64 indexing and broadcasting errors
- Issue #1260 - Enables GroupBy with one of more boolean pdarrays
- Issue #1240 - Updates client dtype classes to be uint compatible and enables
ak.ip_address
to accept python lists - Issue #1154 - Adds
dtype
parameter toak.array
Minor fixes:
- Issue #1174 - Fixes randint range bug
- Issue #1190 - Updates
ak.Dataframe
to use the Index object - Issues #1039 and #1241 - Improves performance for String methods (peel, stick, and substring search)
- Issue #1251 and PR #1243 - Update README install instructions and adds documentation for
--saveUsedModules
flag - PRs #1219 and #1231 - Improves performance of parquet read and append
Auto-generated release notes
- Make regex calls compatible with chapel 1.26 by @ronawho in #1216
- Closes #1133 and #1210: Support indexing with uint64 by @pierce314159 in #1217
- Optimize calculation of Parquet column byte sizes for string reads by @bmcdonald3 in #1219
- Closes #1174 ak.randint() fails for ranges greater than 2**63 by @Ethan-DeBandi99 in #1214
- Closes #1154: dtype parameter for
ak.array
by @pierce314159 in #1223 - Closes #1209 fix deprecation warning by @Ethan-DeBandi99 in #1226
- Add initial support for writing string Parquet files by @bmcdonald3 in #1222
- Add append mode for Parquet file writing by @bmcdonald3 in #1227
- Bulk append values in Parquet and switch some places to int64 by @bmcdonald3 in #1231
- Closes #1232: Add Parquet string appending by @bmcdonald3 in #1233
- Unify HDF5 and Parquet ls calls into a single function by @bmcdonald3 in #1224
- Closes #1190 Index for Indexing DataFrame by @Ethan-DeBandi99 in #1235
- Fix append Parquet test when running with more than 10 locales by @bmcdonald3 in #1237
- Add documentation for saveUsedModules flag by @bmcdonald3 in #1243
- Closes #1039 Update peel/stick to use aggregation by @Ethan-DeBandi99 in #1228
- Closes #1218 extend pdarray setops to work for multiple arrays by @Ethan-DeBandi99 in #1225
- Revert "Closes #1218 extend pdarray setops to work for multiple arrays" by @mhmerrill in #1246
- Add null checking for strings by @bmcdonald3 in #1244
- Update null test file to be uncompressed by @bmcdonald3 in #1250
- Closes #1251 - Reconcile the README.md TOC sections describing install between Mac, Linux, and Windows by @Ethan-DeBandi99 in #1252
- Closes #1241: Use
computeOnSegments
forsubstringSearch
by @pierce314159 in #1248 - Part of #1254: Adds typechecked to pdarraycreation methods by @pierce314159 in #1255
- Closes #1256 -
join.py
from akutil to arkouda by @Ethan-DeBandi99 in #1258 - Fix server crash when reading string columns from multiple locales by @bmcdonald3 in #1262
- 1240 Uint64 compatibility and other improvements to client dtype classes by @reuster986 in #1264
- Closes #1218 Extend
pdarray
setops to work for multiple arrrays by @Ethan-DeBandi99 in #1266 - Closes #1265: Add
to/is_lower
andto/is_upper
methods toStrings
by @pierce314159 in #1274 - Closes #1260 - GroupBy w/ Booleans by @Ethan-DeBandi99 in #1270
- Closes #1267 - unexpected results with GroupBy() when 2nd element is string array by @Ethan-DeBandi99 in #1277
- Closes #1272 - akutil.hist move to ak.histogram. by @Ethan-DeBandi99 in #1276
- Resolves #1279 - Add
uint64
support forbroadcast
by @pierce314159 in #1283
Full Changelog: v2022.03.15...v2022.04.15