Skip to content

Releases: hrs/docsim

v0.1.7

28 Jun 23:12
@hrs hrs
f020d53
Compare
Choose a tag to compare

Changelog

  • f020d53 Update Go 1.19 -> 1.20
  • 67ac1d5 Ensure stemming doesn't panic on certain inputs
  • ff20037 Fix manpage path in make install task
  • ab22d17 Map terms to int IDs to save memory
  • 03eaeef Fix lintian errors for deb package
  • 1e9e12e Document installing through Homebrew

v0.1.6

27 Jun 23:19
@hrs hrs
0c5e2a3
Compare
Choose a tag to compare

Changelog

  • 0c5e2a3 Build deb, rpm, and homebrew packages on release
  • 40640c9 Add short flags for file, stdin, limit, & verbose
  • a586fed Log parsing files when --verbose
  • 0e833ad Remove --omit-query flag (always omit the query)
  • 76efdbd Extract findParsableFiles func from ParseCorpus
  • 4f93fd8 Yield 0.0 score if query has a zero norm
  • 6d75471 Reword README

v0.1.5

22 Jun 16:02
@hrs hrs
073bed8
Compare
Choose a tag to compare

Changelog

  • 073bed8 Ensure all Unicode characters are parsed correctly
  • 68956e7 Treat curly apostrophes like single quotes
  • aca623b Add a version number badge
  • cfc163f Test on all PRs and all pushes to main
  • 61deb08 If a file's missing, don't panic but show error

v0.1.4

20 Jun 15:26
@hrs hrs
eea93fa
Compare
Choose a tag to compare

Changelog

  • eea93fa Reorganize interface to take default string query
  • b84381c Run tests as part of release workflow

v0.1.3

08 Jun 18:45
@hrs hrs
Compare
Choose a tag to compare

Changelog

  • 21e8fad Extract hasPermittedExtension function
  • 0b4b3b6 Warn: docsim doesn't understand .ignore files
  • c763c34 Add a -version flag
  • 357bef5 Brag about reading from STDIN
  • 7685e70 Explicitly allow certain file extensions
  • bae7ca9 Handle missing mime.types database
  • d9f9410 Set up CI action to run tests
  • 1265de9 Backfill some document parsing tests
  • e5dc8e5 Populate config directly from flags
  • 88c3ea3 Don't bother writing to a tempfile
  • 2be7753 Ensure make build searches subdirectories

v0.1.2

28 May 19:15
@hrs hrs
c7656f9
Compare
Choose a tag to compare

Changelog

  • c7656f9 Move most logic into corpus package
  • 4d61ccd Backfill tests for TF-IDF and norm calculations

v0.1.1

23 May 22:46
@hrs hrs
89f557c
Compare
Choose a tag to compare

Changelog

  • 89f557c Document the --stoplist flag
  • 609ebe7 Add a --stoplist flag for custom stoplists
  • 6d73861 Equip Config with a Stoplist
  • aedb7c2 Add --follow-symlinks flag
  • 6db2d75 Link to release page in README

v0.1.0

23 May 18:22
@hrs hrs
474d74c
Compare
Choose a tag to compare

Changelog

  • 474d74c Add a build release task
  • 97208f8 Write a simple README
  • ed0c97d Don't include symlinks in the corpus
  • a8ae6a2 Split words manually instead of by regexp
  • cf33283 Check that this is a text file before opening
  • 45f20a4 Document use with non-English documents
  • 2b745c5 Replace deprecated use of ioutil with io
  • 71d71f7 Backfill stoplist tests
  • 911c082 Move corpus parsing out of main
  • 85c23ad Backfill similarity tests
  • 9b10bb2 If no query file's provided, read from STDIN
  • 0913d52 Add --no-stemming flag to skip stemming
  • b41825e Add --no-stoplist flag to skip stoplist
  • adb900a Default to searching current directory
  • 1fa579d License with the GPLv3
  • 68c9692 Add a simple Makefile
  • 69dee47 Include a simple manual page
  • 2be26f1 Move code into a lib directory
  • da14b80 Save memory by clearing term freqs after TF-IDF
  • 30caf51 Recursively search directories for files
  • 25f1c88 Add a --verbose flag
  • 750d5f2 Add --omit-target flag to skip target in results
  • f81720b Just print errors to stderr, don't log
  • 08612d3 Only search files that seem to contain text
  • 144d473 Add flags for sort order, limit, showing scores
  • 43be014 Sort results, low-to-high
  • 3cf6192 Display search results more readably
  • 27eff9a Search the corpus with a query document
  • 371c11d Maintain TF-IDF weights for each document
  • 471fcd1 Corpus stores its inverse document frequency
  • 4b7b1ac Documents store their term frequency
  • 5358a2d Maintain a corpus of documents
  • e023141 Track count of term occurrences
  • cc1a73d Memoize stemming results in a local cache
  • 765d3af Stem words with the Porter Stemmer
  • bee8fd0 Don't include words in a standard English stoplist
  • 7959b68 Split tokens but retain contractions
  • c5a08da Instantiate a document containing words
  • e36485e Get target file and search files from args
  • 8cfa20f Hello, docsim.