Skip to content

Tags: noloerino/modin

Tags

0.11.1

Toggle 0.11.1's commit message

Verified

This tag was signed with the committer’s verified signature.
devin-petersohn Devin Petersohn
Modin 0.11.1

This release contains significant amounts of improvements to the
maintainability of the code.

Key Features and Updates
------------------------
* Stability and bugfixes
  * Read_feather: cast columns from Index to list (bb618df)
  * Refactor read_csv skiprows parameter processing (6a47229)
  * Fix usage of modin.HDFStore in modin.read_hdf func (15d3fba)
  * Always keep 'by' data in groupby.__getitem__ (c2b399a)
* Pandas API implementations and improvements
  * pandas.read_gbq: remove deprecated parameters (13e0af0)
* Expansion in testing
* Dependencies
  * Unpin boto3 in setup.py (aee31ba)
* Omnisci backend enhancements
  * Fix arrow execution for empty frame. (b9a22cc)

Contributors this release
-------------------------

The following users contributed code to Modin since the last release.

@prutskov
@gshimansky
@ienkovich
@anmyachev
@devin-petersohn

0.11.0

Toggle 0.11.0's commit message

Verified

This tag was signed with the committer’s verified signature.
devin-petersohn Devin Petersohn
Modin 0.11.0

This release contains significant amounts of improvements to the
maintainability of the code and bugfixes. Multiple new additions
were made to the pandas API coverage.

Key Features and Updates
------------------------
* Stability and bugfixes
  * Fix __setitem__ when key is unhashable list (57bcfc1)
  * Fix slice_shift when index has duplicates (cf47333)
* Pandas API implementations and improvements
  * experimental ray implementation of read_pickle, to_pickle (cdf47ac)
  * Add support for Series.str.__getitem__ (7631def)
  * skiprows support added for read_csv (b8098bd)
  * support local variables in query and eval (0a64275)
  * Add storage_options param for read_parquet (da2ad79)
  * Fixed 'value_counts' implementation (ebd07dd)
  * Warn user about heterogeneous data presence during read_csv (15f168c)
  * Add attribute api in modin.pandas (9865616)
* XGBoost enhancements
  * Add async execution support for Modin xgb.predict (ab58612)
  * Fix processing of evals parameter in Modin xgb (9bef861)
* Developer API enhancements
  * Add additional parameters for from_partitions (ca43e8d)
  * add a way to get custom shapes (984e68f)
  * use MODIN_MEMORY to specify memory for dask engine (8a3b105)
* Expansion in testing
* Documentation improvements
* pandas 1.3.3 support (f91ee0a)
* Omnisci backend enhancements
  * add value_counts benchmark for OmniSci backend (600fa26)
  * Get rid of Ray when using OmniSci engine (ac8e2d1)
  * Support columns renaming in arrow execution. (3079783)
  * support logical 'and' and 'or' in filters. (ecaab1b)
  * fix dtypes for OmniSci dataframes. (02a34eb)
  * update Arrow to 3.0 for OmniSci backend (411b322)

Contributors this release
-------------------------

The following users contributed code to Modin since the last release.

@YarShev
@anmyachev
@dchigarev
@vnlitvinov
@gshimansky
@prutskov
@amyskov
@krfricke
@fexolm
@devin-petersohn

0.10.2

Toggle 0.10.2's commit message

Verified

This tag was signed with the committer’s verified signature.
devin-petersohn Devin Petersohn
Modin 0.10.2

This release contains minor bugfixes since 0.10.1. The supported pandas version was
upgraded to the latest pandas release (1.3.2). For a detailed breakdown of the bugs
fixed, please look at the changelog.

Contributors this release
-------------------------

The following users contributed code to Modin since the last release.

@YarShev
@anmyachev
@dchigarev
@vnlitvinov
@ienkovich
@prutskov
@amyskov
@Lozovskii-Aleksandr
@Garra1980
@Rubtsowa
@alexlenail
@devin-petersohn

0.10.1

Toggle 0.10.1's commit message

Verified

This tag was signed with the committer’s verified signature.
devin-petersohn Devin Petersohn
Modin 0.10.1

This release contains minor bugfixes since 0.10.0.

Contributors this release
-------------------------

The following users contributed code to Modin since the last release.

@YarShev
@ckw017
@anmyachev
@dchigarev
@vnlitvinov
@gshimansky
@prutskov
@amyskov
@Rubtsowa
@devin-petersohn

0.10.0

Toggle 0.10.0's commit message

Verified

This tag was signed with the committer’s verified signature.
devin-petersohn Devin Petersohn
Modin 0.10.0

This release contains significant amounts of improvements to the
maintainability of the code. Documentation was added for the low
level code at many levels.

Another key addition this release is the two major interface
additions: Spreadsheet and SQL.

Key Features and Updates
------------------------
* Many Documentation updates
* Stability and bugfixes
* Spreadsheet Interface
* SQL Interface
* Ray 1.4 support
* pandas 1.2.4 support
* Performance improvements
  * Improvements for XGBoost
  * Some groupby calls
  * map operations
* Metadata management improvements
* Improvements to Testing and CI
* pandas API ehancements
  * fillna
* Omnisci backend enhancements

Contributors this release
-------------------------

The following users contributed code to Modin since the last release.

@YarShev
@anmyachev
@dchigarev
@krfricke
@tkeech1
@vnlitvinov
@btseytlin
@gshimansky
@prutskov
@todd-yu
@kvu35
@amyskov
@richardlin047
@igalink
@devin-petersohn

0.9.1

Toggle 0.9.1's commit message

Verified

This tag was signed with the committer’s verified signature.
devin-petersohn Devin Petersohn
Modin 0.9.1 Release Notes

This release contained a number of bugfixes. All users are recommended
to update to the latest version.

Key Bugfixes
------------
* FIX-modin-project#2798: Fix number of partitions for dataframe on a cluster (modin-project#2828)
* FIX-modin-project#2859: Fix metadata calculation on reduce operations (modin-project#2860)
* FIX-modin-project#2857: Correctly handle identical index binary operations (modin-project#2862)
* FIX-modin-project#2869: Fix setting NPartitions via put (modin-project#2870)
* Fix Pickle support for DataFrame and Series (modin-project#2835)

Contributors this release
-------------------------

The following users contributed code to Modin since the last release.

@mGalarnyk
@YarShev
@anmyachev
@gshimansky
@prutskov
@dchigarev
@RehanSD
@devin-petersohn

Thank you!

0.9.0

Toggle 0.9.0's commit message

Verified

This tag was signed with the committer’s verified signature.
devin-petersohn Devin Petersohn
Modin 0.9.0 Release Notes

This release contained >80 commits from 18 contributors. With this
release
we have initial support for a spreadsheet interface and many
improvements
to performance and stability.

New Functionality
-----------------
* Spreadsheet Interface
* XGBoost Support Improvement
* Read multiple CSV files at once with `read_csv_glob`

Key Bugfixes
------------
* Parquet Metadata issue fixed: modin-project#1476

Documentation
-------------
* Documentation enhancements and improvements. More to come soon!

Dependencies
------------
* Support for Pandas 1.2+

Contributors this release
-------------------------

The following users contributed code to Modin since the last release.

@gshimansky
@mzjp2
@kvu35
@tirkarthi
@abykovsk
@noah-kuo
@amyskov
@RehanSD
@williamma12
@alphavector
@richardlin047
@todd-yu
@anmyachev
@dchigarev
@vnlitvinov
@YarShev
@prutskov
@devin-petersohn

0.8.3

Toggle 0.8.3's commit message

Verified

This tag was signed with the committer’s verified signature.
devin-petersohn Devin Petersohn
Modin 0.8.3 release notes

This release contains a number of bugfixes and testing/code quality improvements. See details below for the updates since last release.

Bugfixes + Pandas Concordance (🐛 + 🐼)
----------------------------------------
* FIX-modin-project#2386: add new location for import ray functions (modin-project#2387)
* FIX-modin-project#2380: don't ignore lengths parameter for dask engine (modin-project#2381)
* FIX-modin-project#2390: Fix inserting Series into DataFrame (modin-project#2391)
* FIX-2200: Enable Calcite by default in OmniSci backend (modin-project#2385)
* FEAT-modin-project#2363: fix index name setter in OmniSci backend (modin-project#2379)
* FIX-modin-project#2406: filter dictionary aggregation keys to limit them to keys only present in current partition (modin-project#2407)
* FIX-modin-project#2473: Some configuration values should not be transformed (modin-project#2476)
* FIX-modin-project#2402: Fix read_excel when files come from older windows (modin-project#2403)
* Ensure excel reader closes file if it is passed as path (modin-project#2514)
* FIX-modin-project#2442: fixed Series assignment with different indices (modin-project#2443)
* Fix indices when reading Excel files in parallel (modin-project#2526)
* FIX-modin-project#2527: Use random name for hdf file test, clean file after testing (modin-project#2528)
* FIX-modin-project#2408: Fix read_csv and read_table args when used inside a decora… (modin-project#2486)
* Fix .loc[] assignment for Modin Series (modin-project#2555)
* FIX-modin-project#2482: improved handling non-str 'by' (modin-project#2548)
* Fix loc/iloc assignments when columns are selected (modin-project#2536)
* FIX-modin-project#2559: Ignore files from /proc/ when detecting file leaks (modin-project#2560)
* FIX-modin-project#2566: Ensure `Series.unique` does not return a scalar when there is only one unique value (modin-project#2567)
* FIX-modin-project#2543: fixed handling 'as_index' at groupby dictionary renaming aggregation (modin-project#2592)

New Functionality ✨
--------------------
* FEAT-modin-project#2375: implementation of multi-column groupby aggregation (modin-project#2461)
* FEAT-modin-project#2013: merge_asof that is a little more efficient (modin-project#2510)
* FIX-modin-project#2540: add __iter__ implementation (modin-project#2541)

Code Quality + Testing 💯
-------------------------
* TEST-modin-project#2289: Columns, Index Locations and Names parameters of read_csv (modin-project#2319)
* REFACTOR-modin-project#2397: remove redundant assigment (modin-project#2398)
* FIX-modin-project#2450: fix CI recipe (modin-project#2449)
* FEAT-modin-project#2444: add docker file for nyc on omnisci (modin-project#2445)
* FIX-modin-project#2456: update taxi queries with .copy usage (modin-project#2457)
* FEAT-modin-project#2447: add docker file for census on omnisci (modin-project#2448)
* REFACTOR-modin-project#2467: Convert internal base dataframe objects to ABC (modin-project#2468)
* FIX-modin-project#2459: Updated TeamCity tests image to use Ray as base image (modin-project#2460)
* TEST-modin-project#2488: Increase commitlint message length limit to 88 characters from 70 (modin-project#2489)
* TEST-modin-project#2290: Cover by tests General Parsing Configuration parameters of read_csv (modin-project#2331)
* TEST-modin-project#2291: Cover by tests NA and Missing Data Handling parameters of read_csv (modin-project#2337)
* TEST-modin-project#2294: add iteration parameters for read_csv tests (modin-project#2477)
* FIX-modin-project#2463: Added test with callable functions as aggregate argument (modin-project#2503)
* TEST-modin-project#2296: Error Handling parameters of read_csv (modin-project#2501)
* TEST-modin-project#2295: Cover by tests Quoting, Compression, and File Format parameters of read_csv (modin-project#2495)
* FIX-modin-project#2374: remove extra code; add pandas way to handle duplicate values in reindex func for binary operations (modin-project#2378)
* TEST-modin-project#2297: Cover by tests Internal parameters of read_csv (modin-project#2502)
* TEST-modin-project#2509: Io tests refactoring (modin-project#2523)
* FIX-modin-project#2550: remove decorators usage for asv tested functions (modin-project#2551)

Backend enhancements + Performance 🚀
-------------------------------------
* FIX-modin-project#2453: Remove sorting indices for equal values in `Series.value_counts` (modin-project#2454)
* FIX-modin-project#2169: avoid unnecessary index access in groupby (modin-project#2469)
* FIX-modin-project#2313: improved handling non-numeric types at 'mean' when 'axis=1' (modin-project#2535)
* FEAT-modin-project#2520: add most important operations for asv benchmarks (modin-project#2539)
* FEAT-modin-project#2491: optimized groupby dictionary aggregation (modin-project#2534)
* FEAT-modin-project#2553: add ability to run microbenchmarks for old Modin version (modin-project#2554)

Documentation 📃
----------------
* DOCS-modin-project#2413: Add examples page to documentation (modin-project#2414)
* DOCS-modin-project#2415: Add comparisons section to documentation with stubs (modin-project#2416)
* DOCS-modin-project#2417: add sklearn example (modin-project#2425)
* DOCS-modin-project#2421: Fixes bad link on contributing from architecture.rst (modin-project#2427)
* DOCS-modin-project#2419: Updated CONTRIBUTING.rst (modin-project#2423)
* DOCS-modin-project#2426,DOCS-modin-project#2424: Fixed two issues (modin-project#2431)
* DOCS-modin-project#2420: Changed documentation to numpydoc style (modin-project#2429)
* DOCS-modin-project#2433: Updated README.md with modin_vs_dask.md doc (modin-project#2435)
* DOCS-modin-project#2437: Add documentation contrasting Modin and Dask (modin-project#2441)
* DOCS-modin-project#2439: Add Documentation for Modin vs. pandas (modin-project#2487)
* DOCS-modin-project#2436: Explicit local / single node backend (modin-project#2483)
* DOCS-modin-project#2518: add asv usage topic (modin-project#2549)
* Fix taxi-runner.py cluster example (modin-project#2557)
* DOCS-modin-project#2578: fix simple typo, parition -> partition (modin-project#2573)

Dependencies
------------
* FIX-modin-project#2388: Fixed requirements for omnisci binaries (modin-project#2389)
* FIX-modin-project#2458: fix 'psutil' install (modin-project#2452)
* FEAT-modin-project#2479: integrate asv (modin-project#2484)
* FIX-modin-project#2524: Update pandas version to 1.1.5 (modin-project#2525)
* FIX-modin-project#2498: Fix possible number of partitions for Dask engine (modin-project#2532)
* FEAT-modin-project#2236: Handling of space limited Ray Plasma directories (modin-project#2547)
* Switch to Ray from conda-forge (modin-project#2562)
* FIX-modin-project#2572: fixed arrow version in OmniSci dependencies (modin-project#2571)
* FIX-#0000: pin xlrd<=1.2.0 (modin-project#2594)

Contributors this release
-------------------------

The following users contributed code to Modin since the last release.

@reshamas
@vfdev-5
@mohdkashif93
@abdulelahsm
@ashahba
@raphaelauv
@richardlin047
@timgates42
@ienkovich
@itamarst
@amyskov
@vnlitvinov
@dchigarev
@YarShev
@anmyachev
@gshimansky
@devin-petersohn

0.8.2

Toggle 0.8.2's commit message

Verified

This tag was signed with the committer’s verified signature. The key has expired.
devin-petersohn Devin Petersohn
Modin 0.8.2 release notes

The Modin 0.8.2 release contains a significant amount of code cleanup
and bugfixes. The release contains total of 61 commits closing 59 issues.
The highlights of this release are listed below. For the full release
notes, please run

git log --pretty=oneline 0.8.1.1...0.8.2

Highlighted commits
-------------------

* FIX-modin-project#2369: Update pandas version to 1.1.4 (modin-project#2371)
* FIX-modin-project#2365: Fix `Series.value_counts` when `dropna=False` (modin-project#2366)
* FEAT-modin-project#1844: upgrade pyarrow to 1.0 (modin-project#2347)
* FEAT-modin-project#2271: Add implementation of `groupby.shift` (modin-project#2323)
* DOCS-modin-project#2334: Add tutorials to main repo (modin-project#2335)
* FIX-modin-project#2311: fixed performance bottleneck at reduction operations (modin-project#2314)
* FIX-modin-project#2133 modin-project#2265: Fix binary operations for modin frames in case when partitioning isn't aligned (modin-project#2256)
* FEAT-modin-project#2303: fix OmniSci aggregates and add mean (modin-project#2304)
* FEAT-modin-project#2299: support value_counts in OmniSci backend. (modin-project#2300)
* FEAT-modin-project#2282: support DataFrame.[count|max|min|sum] for OmniSci backend (modin-project#2283)
* FIX-modin-project#1988: Fix indexing over Series via `loc` (modin-project#2262)
* FIX-modin-project#1965: Fix `count` func in case `numeric_only`==True (modin-project#2228)

Contributors this release
-------------------------

The following users contributed code to Modin since the last release.

@kvu35 (First Time contributor) ⭐️
@ienkovich
@prutskov
@amyskov
@vnlitvinov
@dchigarev
@YarShev
@anmyachev
@gshimansky
@devin-petersohn

0.8.1.1

Toggle 0.8.1.1's commit message

Verified

This tag was signed with the committer’s verified signature. The key has expired.
devin-petersohn Devin Petersohn
Modin 0.8.1 release notes

Dependencies
------------
* FIX-modin-project#2113: Ray 1.0 compatibility (modin-project#2114)