Using only a few instances for guessing sorting keys #3812

matt-gardner · 2020-02-19T18:13:40Z

Partial fix for #3664 (and maybe enough to close that issue entirely as good enough). This just grabs the first 10 instances and uses those to guess a sorting key.

matt-gardner · 2020-02-19T18:26:29Z

I know this is in a class that you're looking to remove @DeNeutoy, but it's a small change, and the code here will likely still be needed in whatever you're replacing this file with.

matt-gardner · 2020-02-19T18:29:37Z

Also, I accidentally requested a review before checking if the tests were good enough; I was going to see in the coverage report if this needed any more tests. I'm somewhat inclined to leave it, though, as you'll be changing things here, unless there's a particular test you'd find helpful that you'd like me to add.

DeNeutoy · 2020-02-22T00:25:49Z

Closing, as I incorporated this in #3700

This reverts commit 8a08899.

* example for feedback * remove all existing multiprocessing * sneak torch datasets inside DatasetReader * lint * trainer_v2, We Love To See It * datasets have index_with now, not iterators * use iter, custom collate function in allennlp wrapper * we don't even need the data in the trainer anymore * all trainer tests passing * black * make find learning rate work * update test fixtures to new config * get train command tests mostly working * lazily construct samplers, index lazy datasets * update some fixtures * evaluate tests passing * all command tests passing * lint * update model test case, common and module tests passing * fix test interdependence introduced by #3762 * more test interdependence * tests tests tests * remove unnecessary brackets Co-Authored-By: Santiago Castro <[email protected]> * update a chunk of the configs * fix archival test, couple more configs * rm pointless gan test * more tests passing * add current state of from params changes * Revert "add current state of from params changes" This reverts commit ad45659. * updated understanding of Lazy * add discussion of None comparison to Lazy * lint * it's a hard doc life * pull samplers into separate file * more docs updates * fold in #3812 * remove torch dataset * add example to lazy * rename to collate * no kwargs * Revert "fold in #3812" This reverts commit 8a08899. * don't break up dataset * add comment to iterable dataset len * improve docstrings, build dataloader using partial_objects * flake * give dataloader a default implementation * safer default for DataLoader init * more coherent dir structure * update imports * add a test for the BucketBatchSampler * split bucket sampler into own file, tests * PR comments Co-authored-by: Santiago Castro <[email protected]>

matt-gardner added 2 commits February 18, 2020 07:15

Using only a few instances for guessing sorting keys

5a6b1a6

mypy

55df4d3

matt-gardner requested a review from DeNeutoy February 19, 2020 18:25

matt-gardner mentioned this pull request Feb 22, 2020

unintuitive sorting_keys scoping in new token indexers #3664

Closed

DeNeutoy added a commit to DeNeutoy/allennlp that referenced this pull request Feb 22, 2020

fold in allenai#3812

8a08899

DeNeutoy closed this Feb 22, 2020

matt-gardner deleted the sorting_key branch February 22, 2020 15:06

DeNeutoy added a commit to DeNeutoy/allennlp that referenced this pull request Feb 23, 2020

Revert "fold in allenai#3812"

da3b1b4

This reverts commit 8a08899.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using only a few instances for guessing sorting keys #3812

Using only a few instances for guessing sorting keys #3812

matt-gardner commented Feb 19, 2020

matt-gardner commented Feb 19, 2020

matt-gardner commented Feb 19, 2020

DeNeutoy commented Feb 22, 2020

Using only a few instances for guessing sorting keys #3812

Using only a few instances for guessing sorting keys #3812

Conversation

matt-gardner commented Feb 19, 2020

matt-gardner commented Feb 19, 2020

matt-gardner commented Feb 19, 2020

DeNeutoy commented Feb 22, 2020