Improve IP search; check end character #14

spacedingo · 2017-04-05T04:01:40Z

In a LAN with clients 192.x.x.1-255, comparing first character (1) will perform strcmp 254 times for say 192.x.x.14. By checking last character, it would do strcmp 26 times. In the black-hole, less is more : )
Please feel free to change anything and everything - I'm a newbie.

In a LAN with clients 192.x.x.1-255, comparing first character (1) will perform **strcmp** 254 times for say 192.x.x.14. By checking last character, it would do **strcmp** for (max) 26 times. In the black-hole, less is more : ) Please feel free to change anything and everything - I'm a newbie.

Improve IP search; check end character

DL6ER · 2017-04-05T20:18:18Z

The whole idea of comparing the first character was born for the domain comparison. Comparing the very first character can be done extremely fast (direct memory access) and does not require a call to any sophisticated string function. Hence, it can speed up the searching for a suitable domain by up to a factor of 36x.
The situation for which I implemented this was for extreme environments like out testing Pi-hole where the domain struct has > 100,000 entries (wich more than 200 million queries in FTL's memory). With this huge amount of domain entries, searching through the whole list with calling strcmp() on each entry is quite slow. Comparing the very first character is very fast.
This was never thought to be added to the clients list which should always be anyhow much smaller than this number. I only added this for the client search as well so that both functions are similar in nature.

I tested your suggestion in this extreme testing environment (for the domains array, because we have only one client there) and interestingly, it seems like it does not change anything in terms of run time. Further investigations seem to suggest that strcmp() is not much more expensive than strlen() and for domains/clients where both the first and the last letter are equal (which can happen like almost always with domain endings like .com), actually both functions have to be called to identify matches. Accordingly, I saw an increase of run time of nearly 30% for the domain finding algorithm.

Hence, I'm going to reject this pull request, because

the domain searching gets noticeably slower, and
the client searching never needed an algorithmic speed up.

I hope you understand this decision and we look forward to further contributions from you. Rest assured that I know how disappointing it can be if the first contribution to some project gets rejected, but this decision is only motivated by the two points listed above. Although your idea was good in its nature, it revealed to have adverse effects on FTL's performance.

Refactor the list module into a List enum

spacedingo added 2 commits April 5, 2017 07:49

Merge pull request #1 from spacedingo/spacedingo-patch-1

ce51b38

Improve IP search; check end character

DL6ER closed this Apr 5, 2017

Xiofett mentioned this pull request Jan 12, 2018

FTL Crashing after update any time admin console is accessed #198

Closed

3 tasks

DL6ER pushed a commit that referenced this pull request Oct 14, 2019

Merge pull request #14 from pi-hole/feature/refactor-list

30a1e79

Refactor the list module into a List enum

candrews67 mentioned this pull request Jun 17, 2020

FTL 5.0 crashes with segfault #812

Closed

jens1205 mentioned this pull request Dec 21, 2020

FTL crashed #987

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve IP search; check end character #14

Improve IP search; check end character #14

spacedingo commented Apr 5, 2017

DL6ER commented Apr 5, 2017 •

edited

Loading

Improve IP search; check end character #14

Improve IP search; check end character #14

Conversation

spacedingo commented Apr 5, 2017

DL6ER commented Apr 5, 2017 • edited Loading

DL6ER commented Apr 5, 2017 •

edited

Loading