Let ResultSet.getObject() return Float.NaN in case of numeric NaN #1304

tseylerd · 2018-09-21T16:26:15Z

Postgres numeric column can store NaN value which is not convertable to BigDecimal. getBigDecimal() throws exception in this case. But when we call getObject() to retrieve the data we expect any type of object as return value. So it's convenient to return Float.NaN in this case.

Related DataGrip issue: https://youtrack.jetbrains.com/issue/DBE-5141

codecov-io · 2018-09-21T17:07:43Z

Codecov Report

Merging #1304 into master will increase coverage by 0.02%.
The diff coverage is 80%.

@@             Coverage Diff              @@
##             master    #1304      +/-   ##
============================================
+ Coverage     68.49%   68.52%   +0.02%     
- Complexity     3868     3873       +5     
============================================
  Files           178      178              
  Lines         16184    16185       +1     
  Branches       2639     2639              
============================================
+ Hits          11086    11090       +4     
+ Misses         3865     3864       -1     
+ Partials       1233     1231       -2

vlsi · 2018-09-21T17:12:23Z

Well, technically speaking PostgreSQL has real (~Float), double precision (~Double) and numeric (~???).

@tseylerd , is there any strong reason you think numeric should be converted to Float.NaN rather than Double.NaN ?

tseylerd · 2018-09-21T17:24:50Z

@vlsi No. I think it's not very important. Which one is better? I don't know. But i know that anything is better than the exception. So if you prefer double let's make it double.

davecramer · 2018-09-21T18:39:06Z

@tseylerd hard to say which one is better.
I don't have a preference.

vlsi · 2018-09-21T18:40:34Z

I don't have a preference.

Ok, ThreadLocalRandom.current().nextBoolean() ? Float.NaN : Double.NaN then

davecramer · 2018-09-21T18:42:10Z

Good point, but I don't see a way around this? Dave Cramer

…

On Fri, 21 Sep 2018 at 14:40, Vladimir Sitnikov ***@***.***> wrote: I don't have a preference. Ok, ThreadLocalRandom.current().nextBoolean() ? Float.NaN : Double.NaN then — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#1304 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAYz9l1qUa_4EPKuaX6-DDznxMAuIbmAks5udTKjgaJpZM4W0fN9> .

vlsi · 2018-09-21T18:44:36Z

Let's just stick with Double.NaN for "extra precision".

vlsi · 2018-09-21T19:36:10Z

@tseylerd , could you please clarify expected release date?
E.g. when next DataGrip release is planned?

tseylerd · 2018-09-24T10:41:43Z

@vlsi We plan to release in November. But we can update the driver separately as soon as it comes out.

arahman5 · 2020-05-29T11:46:17Z

Hi guys, wondering if someone can help me with this. I have currently got the 42.2.12 version of the driver and I am still coming across the issue of org.postgresql.util.PSQLException: Bad value for type BigDecimal : NaN. This is happening only when I am selecting a column which has Numeric data type in the Postgres database and that column also contains NaN values. Correct me if I have got this wrong but I understand that this particular issue was targeted at solving this very problem. I would appreciate any ideas on how this can be fixed? Many thanks in advance!

davecramer · 2020-05-29T12:00:25Z

@arahman5 according to the test case in this PR this should not be happening. Can you provide us with a small test case that reproduces the problem ?

arahman5 · 2020-05-29T12:18:08Z

@davecramer yes sure. I load data from a table in a database by using the below code

val DF = spark.read
        .format("jdbc")
        .option("url", "jdbc:postgresql:https://" + args(1) + ":" + args(2) + "/" + args(3))
        .option("dbtable", "tablename")
        .option("user", args(4))
        .option("password", args(5))
        .load()

The schema of this table is as follows where the value column contains decimal values and NaN values.

root
 |-- id: long (nullable = true)
 |-- file_id: long (nullable = true)
 |-- name: string (nullable = true)
 |-- time: timestamp (nullable = true)
 |-- value: decimal(38,18) (nullable = true)

I then execute the below piece of code and the code crashes with always the same error message that I previously mentioned. Calling the show method is supposed to print out the values of the value column in the console but I always get the error message

DF.createOrReplaceTempView("tablename")

val nonull = spark.sql("""select value from tablename
                                     order by time desc
                                     limit 1000""").show()

What is even more strange is that, when I am running the below SQL query in pgadmin4 I get a result containing no NaN value at all which is the correct result. However, when I execute the same exact query in scala then I continue to get the error message, thus implying the query is not getting executed in the same manner when run in scala and the NaN values are not getting filtered.

select value from tablename where NOT value = 'NaN'
order by time desc
limit 1000

arahman5 · 2020-05-29T14:08:06Z

Digging further into the java code, it seems like the issue is coming from the isBinary function to start with which is being called within the getNumeric function code in the script PgResultSet.java.

The isBinary function relies on the format of the field (in my case format of the column value) to be equal to BINARY_FORMAT and my understanding is that only if it is true then the major part of the code dealing with NaN values in a Numeric column will be executed. In the script BINARY_FORMAT has been defined to have a value of 1 whereas for the case I am presenting the format of the Numeric column value is always 0 so its not a binary column but still has NaN values in it.

Why does the column need to be binary in order for the remaining of the code as shown in the image above to run?

davecramer · 2020-05-29T14:12:35Z

@arahman5 this is very hard to read. github allows you to send links to code which is much easier to read

davecramer · 2020-05-29T14:15:27Z

better question is why there is:
private Number getNumeric(int columnIndex, int scale, boolean allowNaN) throws SQLException

and is only ever called with allowNaN false.

fix: return Double.NaN for 'NaN'::numeric

15bc93e

vlsi force-pushed the master branch from 3d25733 to 15bc93e Compare September 21, 2018 17:20

fix: support NaN in simple query mode

13ef203

vlsi added this to the 42.2.6 milestone Sep 21, 2018

vlsi merged commit 265f22b into pgjdbc:master Sep 21, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Let ResultSet.getObject() return Float.NaN in case of numeric NaN #1304

Let ResultSet.getObject() return Float.NaN in case of numeric NaN #1304

tseylerd commented Sep 21, 2018

codecov-io commented Sep 21, 2018

vlsi commented Sep 21, 2018 •

edited

Loading

tseylerd commented Sep 21, 2018

davecramer commented Sep 21, 2018

vlsi commented Sep 21, 2018

davecramer commented Sep 21, 2018 via email

vlsi commented Sep 21, 2018

vlsi commented Sep 21, 2018

tseylerd commented Sep 24, 2018

arahman5 commented May 29, 2020

davecramer commented May 29, 2020

arahman5 commented May 29, 2020 •

edited

Loading

arahman5 commented May 29, 2020

davecramer commented May 29, 2020

davecramer commented May 29, 2020

Let ResultSet.getObject() return Float.NaN in case of numeric NaN #1304

Let ResultSet.getObject() return Float.NaN in case of numeric NaN #1304

Conversation

tseylerd commented Sep 21, 2018

codecov-io commented Sep 21, 2018

Codecov Report

vlsi commented Sep 21, 2018 • edited Loading

tseylerd commented Sep 21, 2018

davecramer commented Sep 21, 2018

vlsi commented Sep 21, 2018

davecramer commented Sep 21, 2018 via email

vlsi commented Sep 21, 2018

vlsi commented Sep 21, 2018

tseylerd commented Sep 24, 2018

arahman5 commented May 29, 2020

davecramer commented May 29, 2020

arahman5 commented May 29, 2020 • edited Loading

arahman5 commented May 29, 2020

davecramer commented May 29, 2020

davecramer commented May 29, 2020

vlsi commented Sep 21, 2018 •

edited

Loading

arahman5 commented May 29, 2020 •

edited

Loading