fix: Make `get_multiple_*` methods pass tests by not halting for one bad input #128

lahdjirayhan · 2022-04-24T08:30:38Z

As laid out by the tests, get_multiple_*_air methods shouldn't halt when there is one bad input. One bad input (which in turn will make backend return error/bad response, which will make an exception raised in our part) shouldn't halt the execution of other, likely good, inputs. This is not convenient. For what it's worth, I've always treated get_multiple_* methods as convenience wrappers for their single get_*_air counterparts (otherwise I could just loop them myself).

The example in my mind is this: you are mass-collecting data for a hundred cities. One city turns out to not have AQI station and subsequently raise exception (you likely won't know this in advance). It will be annoying to have the other 99 cities halted.

I realize that this will make an odd difference between e.g. get_city_air (raises exception on bad response) and get_multiple_cities_air (returns nan row on bad response). I don't have really much to say either supporting or against.

Is this a good or common enough use case? Should I implement things differently? I'm curious to hear more.

Instead of halting/raising exceptions, the get_multiple_* methods now continue to work when encountering bad inputs. This works by catching exceptions raised by its respective get_*_air methods. Why? Because get_multiple_* (I think) is supposed to be a convenient wrapper for multiple inputs at once. It's not really convenient to break the entire execution for one bad input. For consistency, the output dataframe will have exactly one row per one input, where bad inputs get all-nan rows apart from its 'city' or 'lat'-'lon' columns.

Milind220 · 2022-04-24T17:00:32Z

@lahdjirayhan Absolutely! This is something that definitely occurred to me before, but I didn't give it enough thought. The use case that you mentioned of dozens of cities (or more) is certainly one that would be annoying without this.

I think it's a badly needed improvement. Excited that the tests are helping so much! Thanks again for making such a robust set of tests.

Milind220 · 2022-04-24T17:04:03Z

src/ozone/ozone.py

+ self.get_coordinate_air(loc[0], loc[1], df=df, params=params)
+ )
+ except Exception:
+ # NOTE: If we have custom exception we can catch it instead.


Hmm overall I do think we should have custom exceptions.

However, it's a good idea to add custom exceptions for the entirety of Ozone in one go (I'm sure there are other sections that need them). This way we can come up with a consistent set of exceptions for the package.

Milind220 · 2022-04-24T17:04:45Z

tests/test_get_multiple_city_air.py

+ # The third row should have the "nonexistent city name" intact ...
+ assert result.at[2, "city"] == "a definitely nonexistent city"
+
+ # ... and nothing else.


Milind220

Looks good to go!

lahdjirayhan added 3 commits April 24, 2022 02:14

test: Update tests for get_multiple_*

06baf4d

test: Re-record cassettes due to new requests for successful executions

be901b2

Milind220 self-assigned this Apr 24, 2022

Milind220 added the bug Something isn't working label Apr 24, 2022

Milind220 added this to the Make Ozone pass tests pt.1 milestone Apr 24, 2022

Milind220 reviewed Apr 24, 2022

View reviewed changes

Milind220 approved these changes Apr 24, 2022

View reviewed changes

Milind220 merged commit 02fd5fd into Ozon3Org:dev Apr 24, 2022

lahdjirayhan deleted the fix-get-multiple branch April 24, 2022 17:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Make `get_multiple_*` methods pass tests by not halting for one bad input #128

fix: Make `get_multiple_*` methods pass tests by not halting for one bad input #128

lahdjirayhan commented Apr 24, 2022

Milind220 commented Apr 24, 2022

Milind220 Apr 24, 2022

Milind220 Apr 24, 2022

Milind220 left a comment

fix: Make get_multiple_* methods pass tests by not halting for one bad input #128

fix: Make get_multiple_* methods pass tests by not halting for one bad input #128

Conversation

lahdjirayhan commented Apr 24, 2022

Milind220 commented Apr 24, 2022

Milind220 Apr 24, 2022

Choose a reason for hiding this comment

Milind220 Apr 24, 2022

Choose a reason for hiding this comment

Milind220 left a comment

Choose a reason for hiding this comment

fix: Make `get_multiple_*` methods pass tests by not halting for one bad input #128

fix: Make `get_multiple_*` methods pass tests by not halting for one bad input #128