-
Notifications
You must be signed in to change notification settings - Fork 651
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FEAT-#4320: Add connectorx
as an alternative engine for read_sql
#4346
FEAT-#4320: Add connectorx
as an alternative engine for read_sql
#4346
Conversation
Codecov Report
@@ Coverage Diff @@
## master #4346 +/- ##
===========================================
- Coverage 87.23% 60.20% -27.04%
===========================================
Files 219 219
Lines 17927 17945 +18
===========================================
- Hits 15639 10804 -4835
- Misses 2288 7141 +4853
📣 Codecov can now indicate which changes are the most critical in Pull Requests. Learn more |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great! A couple of requests:
- If
ReadSqlEngine
isconnectorx
, butconnectorx
is not installed, we will get some errors. Can we check that connectorx is installed in the logic forread_sql
, the warn the user and resetReadSqlEngine
topandas
? - Please also enable CI tests and add
connectorx
to the test_requires file.
Thanks for the review!
Modified at here, can you take a look?
Can you give me some suggestions on how to enable CI? I modified the original Also I didn't find the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also I didn't find the test_requires file, do you mean the requirements-dev.txt file?
Yes! We changed it some time back and I forgot 😄.
@wangxiaoying We are dealing with some CI environment issues, working on fix in #4364 |
Got it! Please let me know if you need any actions from my side. |
@wangxiaoying I think it's all set, if you can rebase on the master branch, CI should pass! |
8eb1a92
to
d90e77f
Compare
Hi @YarShev , thanks for the suggestions. I tried to test on the capital letter solution but it still has the same issue. I have added two debug messages in the latest commit. As the screenshot below, the value found in SqlDispatcher is I set the |
@wangxiaoying, the issue is that parse method is run in worker processes, which do not see an updated |
@YarShev , I have updated the code following your suggestions. And the issue is resolved after passing the engine configuration to the parser! |
e8b7971
to
1a54f5c
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Left some more comments. After those are fixed, I will run CI to see if the tests pass.
6c16413
to
533b252
Compare
@YarShev , I have fixed the comments. Please let me know if you need other actions from my side! |
@wangxiaoying, I've run CI and lint (commit) job failed. To pass it you should format the commit message as follows: git commit --amend in an opened up editor you should put the lines in the following order
after doing that push the changes git push -f |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Left a couple of minor suggestions.
0a1f69d
to
ae39c1b
Compare
@YarShev , I've updated the commit message and applied those code suggestions. Can you take a look again? |
Some failures have occurred in CI. I restarted tests to see if it helps. |
@wangxiaoying it looks like a SQL syntax error occurred in CI: https://github.com/modin-project/modin/runs/5988352034?check_suite_focus=true |
@devin-petersohn indeed, the reason is that we remove the clauses such as |
ae39c1b
to
8c1c34a
Compare
@devin-petersohn @YarShev , we have released an alpha version for the bug fix and requirement files are updated. Can you please try a gain? |
I ran tests. Will see how it goes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is one more thing you should do. Please, add a release note including your GH nickname in docs/release_notes/release_notes-0.15.0.rst
. As an example, you can look at the changes of #4386.
8c1c34a
to
f1040f5
Compare
@YarShev Thanks for the example. I have updated the release note. I added to "Performance enhancements" but let me know if you think other sections suit this issue better : ) |
I think it is okay. Please fix conflicts. |
…r `read_sql` Co-authored-by: Devin Petersohn <[email protected]> Co-authored-by: Yaroslav Igoshev <[email protected]> Signed-off-by: wangxiaoying <[email protected]>
f1040f5
to
a645063
Compare
@YarShev Done |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@wangxiaoying, LGTM, thanks! @devin-petersohn, any other comments?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks @wangxiaoying !
What do these changes do?
Add connectorx as an alternative engine for
read_sql
. #4320I tested locally by setting the env var
READ_SQL_ENGINE
toconnectorx
and it works fine. However, when I set the config byReadSqlEngine.put("connectorx")
(like here), the result ofReadSqlEngine
retrieved in thePandasSQLParser
ispandas
. It's very likely that I missed some setup of the config. Can you take a look?flake8 modin/ asv_bench/benchmarks scripts/doc_checker.py
black --check modin/ asv_bench/benchmarks scripts/doc_checker.py
git commit -s
docs/development/architecture.rst
is up-to-date