Skip to content

Releases: dask-contrib/dask-sql

0.3.7

10 Aug 19:23
0c3a6a1
Compare
Choose a tag to compare

Features

  • Allow for multiple schemas (#205)
  • AutoML capabilities (#199)
  • Implement the regr count SQL operator (#193)
  • ML model improvement : Added SHOW MODELS, EXPORT MODEL and DESCRIBE MODEL (#185, #191)
  • Implement the search and sargs operator (#184)

Bugfixes

  • Fixes for pandas 1.3.0 (#202)
  • Fix test fixture order (#194)
  • Fix a failing build, as ciso8601 is currently not pip-installable (#192)

0.3.6

16 May 17:32
bbfee5f
Compare
Choose a tag to compare

Bugfixes

  • Casting of literals is done by Calcite, except for strings (#178)

0.3.5

15 May 20:38
Compare
Choose a tag to compare

Bugfixes

  • Speed up aggregations when there are no aggregates (#174)
  • Register the lower and upper-case version of a function (#177)
  • Reverting a bug in the casting logic to cast only if really needed (#176)

0.3.4

13 May 18:23
90569c9
Compare
Choose a tag to compare

Small feature addons

  • Added correct casting and mod operation (#172)
  • Implement OVER for arbitrary windows (#164)
  • Allow to start a SQL server from a jupyter notebook (#162)

Bugfixes and Improvements

  • Sort optimizations (#167, #173)
  • Fix scikit learn version in docker file
  • Add test with independent dask cluster (#165)
  • Speed up builds with mamba (#171)
  • Remove version constraints for pandas and dask as the errors were fixed upstream (#170)
  • Fixed the replacement of functions/aggregations and added a test (#169)
  • Added missing version in pom

0.3.3

30 Apr 05:53
e39616f
Compare
Choose a tag to compare

Small feature addons

  • Allow function reregistration (#161)
  • upgrade fugue dependency (#160)
  • Implement a wrapper for the prompt_toolkit session (#159)

0.3.2

13 Apr 20:19
Compare
Choose a tag to compare

Small feature addons

  • First working (but slow) implementation of OVER (#157)
  • Add a visualize function (#153)
  • IPython/Jupyter Magic (#146)
  • Hive/Databricks from SQL (#145)

Bugfixes and Improvements

  • Improve documentation
  • Better cross joins (#150)
  • Fix a bug which occurs when only filters are present in groupbys (#154)
  • Make testing a bit easier to type
  • Fix a warning on regexes
  • Split out the jupyter notebook integration (#152)
  • Add pre commit hook (#149)
  • Limit the dask version until the dask-ml problem is fixed (#147)
  • Turn off docker image building of PRs
  • Fix integration with dbfs using the newest fsspec version (#140)
  • Show a reasonable traceback on exceptions (#142)
  • Docker image improvements (#137)
  • Support for Float (pandas extension type) and filter with NaNs (#136)

0.3.1

07 Feb 20:04
e5fac1a
Compare
Choose a tag to compare

Small feature addons

  • Aggregate improvements and SQL compatibility (#134)
  • New call operations (#122)
  • Added notebook with a 'Tour de dask-sql' (#119)

Bugfixes and Improvements

  • Docs improvements (#132)
  • Fix the fugue dependency (#133)
  • Pandas dependency fix (#129)
  • Added missing iris.csv data set
  • Pip installation docs improvement (#128)
  • Correctly sort NULLs (#126)
  • Importlib import (#125)
  • Do not touch already installed dask and pandas version as this may lead to incompatibilities (#123)
  • Average decimal type (#121)
  • Fixing a bug in column container copies (#120)

0.3.0

21 Jan 21:08
Compare
Choose a tag to compare

Features

  • Allow for an sqlalchemy and a hive cursor input (#90)
  • Allow to register the same function with multiple parameter combinations (#93)
  • Additional datetime functions (#91)
  • Server and CMD CLI script (#94)
  • Split the SQL documentation in subpages and add a lot more documentation (#107)
  • DROP TABLE and IF NOT EXISTS/REPLACE (#98)
  • SQL Machine Learning Syntax (#108)
  • ANALYZE TABLE (#105)
  • Random sample operators (#115)
  • Read from Intake Catalogs (#113)
  • Adding fugue integration and tests (#116) and fsql (#118)

Bugfixes

  • Keep casing also with unquoted identifiers. (#88)
  • Scalar where clauses (#89)
  • Check for the correct java path on Windows (#86)
  • Remove # pragma once where it is not needed anymore (#92)
  • Refactor the hive input handling (#95)
  • Limit pandas version (#100)
  • Handle the case of the java version is undefined correctly (#101)
  • Add datetime[ns, UTC] as understood type (#103)
  • Make sure to treat integers as integers (#109)
  • On ORDER BY queries, show the column names of the SELECT query (#110)
  • Always refer to a function with the name given by the user (#111)
  • Do not fail on empty SQL commands (#114)
  • Fix the random sample test (#117)

0.2.2

28 Nov 19:44
Compare
Choose a tag to compare

Bugfixes and Improvements

  • Use new conda github action to prevent a failed build (#85)

0.2.1

19 Nov 19:07
02e2dad
Compare
Choose a tag to compare

Bugfixes and Improvements

  • Increase speed and parallelism of the limit algorithm and implement descending sorting (#75)
  • Improved the ability to create (materialized) views of queries (#77)
  • Added missing __version__ variable (#79)
  • Improved Docker image (#78)
  • Allow arbitrary return types in SQL server (#76)
  • Bugfix: Added tzlocal dependencies