idea #139

xxcriticxx · 2018-02-22T12:56:33Z

can you code something that finds domains on this list that aren't on any other list?

usually this way i find lots false positive

funilrys · 2018-02-22T13:33:25Z

@mitchellkrogza Give me 24 hours and I'll write a script unless you have a one line (I know how much you love them 😹 ) command for @xxcriticxx 👍

mitchellkrogza · 2018-02-22T13:52:48Z

Great idea, I'm sure I can do a one liner to achieve that, will look in the morning

mitchellkrogza · 2018-02-22T14:44:17Z

I'll test this in the morning grep -Fxv -f first-file.txt second-file.txt or comm -23 second-file-sorted.txt first-file-sorted.txt

xxcriticxx · 2018-02-23T00:34:50Z

something like Diff Checker but i dont know if it would handle 2 millions lines

mitchellkrogza · 2018-02-23T10:55:25Z

The comm command line should do what you need, I ran it against Ultimate Hosts and Badd Boyz and it took all of 5 seconds

xxcriticxx · 2018-02-23T11:40:44Z

@mitchellkrogza can you tub it against few larger listsand post output here?

funilrys · 2018-02-23T11:48:10Z

I think I can't do better than 5 seconds 😸

mitchellkrogza · 2018-02-23T11:52:25Z

@xxcriticxx send me one list for now to test against Ultimate to find any uncommon entries

mitchellkrogza · 2018-02-23T12:09:01Z

@xxcriticxx @funilrys I just ran Ultimate Hosts against itself to show how quick it is. So we are talking 1,602,081 domains. 2 seconds to complete ... no jokes. You can see result.txt is 0 kb which means there are no odd one's out 😄 as @funilrys says I sure do LOVE my one liners and Linux RULES !!!

xxcriticxx · 2018-02-23T20:34:38Z

i will play with this on the weekend

mitchellkrogza · 2018-02-24T06:49:29Z

@xxcriticxx great, please share your findings with us

xxcriticxx · 2018-02-25T12:04:16Z

comm wont work

comm -13 list1.txt list2.txt > result.txt
comm: file 2 is not in sorted order
comm: file 1 is not in sorted order

mitchellkrogza · 2018-02-25T12:34:10Z

comm will work, you need to make sure both lists are sorted first.

sort -u list1.txt -o list1.txt
sort -u list2.txt -o list2.txt
comm -13 list1.txt list2.txt > result.txt

mitchellkrogza · 2018-02-25T12:34:59Z

Unfortunately lists need to be sorted and also clean as far as funny commenting like # Whatever and empty lines too

xxcriticxx · 2018-02-25T12:48:51Z

if i sorted them they wont same and stripping # will take hours

mitchellkrogza · 2018-02-25T13:16:42Z

There's very easy scripts to remove commenting, empty lines and clean a file. Send me your list1.txt and list2.txt to my email (zip them) - [email protected] and I will show you tomorrow and also show you how quick and easy it is using command line tools

xxcriticxx · 2018-02-25T13:24:49Z

list1 is your list
list2 is stevenblack list

mitchellkrogza · 2018-02-25T13:31:55Z

Will run them against each other tomorrow

xxcriticxx · 2018-02-26T14:17:35Z

@mitchellkrogza send me nice picture of nature i am in need of new wallpapers

mitchellkrogza · 2018-02-26T14:22:41Z

What size? and what kind I have far too much stuff :) Best to check my Facebook Page and drop me an email. https://www.facebook.com/MitchellKrogPhotography

xxcriticxx · 2018-02-26T18:13:08Z

i have 2x Acer G277HL 1920 x 1080

smed79 · 2018-03-01T01:50:26Z

Extract lines in file1 not found in file2

diff --new-line-format="" --unchanged-line-format="" file1 file2 > file3

Extract lines from file2 already found in file1

awk 'NR==FNR{lines[$0]++; next} $1 in lines' file1 file2 > file3

mitchellkrogza · 2018-03-01T07:22:05Z

Thanks @smed79 👍

funilrys · 2018-03-13T18:51:52Z

With the new repository structure/system, once we merged all sources, we remove all duplicates before generating the files 😸

xxcriticxx · 2018-03-14T09:49:15Z

The idea was to find something here that doesn’t exist on any other list or lists

funilrys · 2018-03-21T08:49:15Z

Because we are going to work with big lists soon, we decided that it will be great to have a script or system which can help us find the amount of domains from a list which are not already part of this repository.

So I invite you to play with https://gist.github.com/funilrys/900abd388b1f3b399a9da69e0e592fef (-h gives you the help) !!!

Have a nice day/night.

Closing.

funilrys self-assigned this Feb 22, 2018

funilrys added the enhancement label Mar 11, 2018

funilrys closed this as completed Mar 21, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

idea #139

idea #139

xxcriticxx commented Feb 22, 2018

funilrys commented Feb 22, 2018

mitchellkrogza commented Feb 22, 2018

mitchellkrogza commented Feb 22, 2018

xxcriticxx commented Feb 23, 2018

mitchellkrogza commented Feb 23, 2018

xxcriticxx commented Feb 23, 2018

funilrys commented Feb 23, 2018

mitchellkrogza commented Feb 23, 2018

mitchellkrogza commented Feb 23, 2018 •

edited

xxcriticxx commented Feb 23, 2018

mitchellkrogza commented Feb 24, 2018

xxcriticxx commented Feb 25, 2018

mitchellkrogza commented Feb 25, 2018

mitchellkrogza commented Feb 25, 2018

xxcriticxx commented Feb 25, 2018

mitchellkrogza commented Feb 25, 2018

xxcriticxx commented Feb 25, 2018

mitchellkrogza commented Feb 25, 2018

xxcriticxx commented Feb 26, 2018

mitchellkrogza commented Feb 26, 2018

xxcriticxx commented Feb 26, 2018

smed79 commented Mar 1, 2018 •

edited

mitchellkrogza commented Mar 1, 2018

funilrys commented Mar 13, 2018 •

edited

xxcriticxx commented Mar 14, 2018

funilrys commented Mar 21, 2018

idea #139

idea #139

Comments

xxcriticxx commented Feb 22, 2018

funilrys commented Feb 22, 2018

mitchellkrogza commented Feb 22, 2018

mitchellkrogza commented Feb 22, 2018

xxcriticxx commented Feb 23, 2018

mitchellkrogza commented Feb 23, 2018

xxcriticxx commented Feb 23, 2018

funilrys commented Feb 23, 2018

mitchellkrogza commented Feb 23, 2018

mitchellkrogza commented Feb 23, 2018 • edited

xxcriticxx commented Feb 23, 2018

mitchellkrogza commented Feb 24, 2018

xxcriticxx commented Feb 25, 2018

mitchellkrogza commented Feb 25, 2018

mitchellkrogza commented Feb 25, 2018

xxcriticxx commented Feb 25, 2018

mitchellkrogza commented Feb 25, 2018

xxcriticxx commented Feb 25, 2018

mitchellkrogza commented Feb 25, 2018

xxcriticxx commented Feb 26, 2018

mitchellkrogza commented Feb 26, 2018

xxcriticxx commented Feb 26, 2018

smed79 commented Mar 1, 2018 • edited

mitchellkrogza commented Mar 1, 2018

funilrys commented Mar 13, 2018 • edited

xxcriticxx commented Mar 14, 2018

funilrys commented Mar 21, 2018

mitchellkrogza commented Feb 23, 2018 •

edited

smed79 commented Mar 1, 2018 •

edited

funilrys commented Mar 13, 2018 •

edited