Add to onboarding reproduction logs #2492

KenWuqianghao · 2024-05-09T04:06:25Z

System setup:

OS: macOS Sonoma 14.4.1
Memory: 16GB
Chip: Apple M1 Pro
Python Version: 3.12.3
Java Version: 21.0.3
Maven: 3.9.6

Suggestion: I think linking try-it instead of getting-started is more suitable as I don't see a getting-started section on the read-me.

lintool · 2024-05-10T01:00:52Z

docs/experiments-msmarco-passage.md

@@ -322,8 +316,8 @@ It turns out that optimizing for MRR@10 and MAP yields the same settings.

 Here's the comparison between the Anserini default and optimized parameters:

-| Setting | MRR@10 | MAP | Recall@1000 |
-|:------------------------------------------------|-------:|-------:|------------:|
+| Setting | MRR@10 | MAP | Recall@1000 |


I think you introduced inconsistencies here? Old table seems fine to me?

Oops, I clicked on the file and my editor did some formatting automatically. I will fix that.

lintool · 2024-05-10T01:02:14Z

docs/start-here.md

@@ -20,8 +20,8 @@ What's the problem we're trying to solve?

 This is the definition I typically give:

-> Given an information need expressed as a query _q_, the text retrieval task is to return a ranked list of _k_ texts {_d<sub>1</sub>_, _d<sub>2</sub>_ ... _d<sub>k</sub>_} from an arbitrarily large but finite collection
-of texts _C_ = {_d<sub>i</sub>_} that maximizes a metric of interest, for example, nDCG, AP, etc.
+> Given an information need expressed as a query _q_, the text retrieval task is to return a ranked list of _k_ texts {_d`<sub>`1`</sub>`_, _d`<sub>`2`</sub>`_ ... _d`<sub>`k`</sub>`_} from an arbitrarily large but finite collection


nah, I think I like original better...

Yeah same here, I didn't mean to change that. My editor did that automatically for some reason. Will fix.

lintool · 2024-05-10T01:02:31Z

thanks for the de-linting. left comments.

lintool · 2024-05-10T10:30:16Z

docs/experiments-msmarco-passage.md

@@ -89,7 +86,7 @@ On the other hand, retrieval needs to be fast, i.e., low latency, high throughpu

 With the data prep above, we can now index the MS MARCO passage collection in `collections/msmarco-passage/collection_jsonl`.

-If you haven't built Anserini already, build it now using the instructions in [anserini#-getting-started](https://github.com/castorini/anserini#-getting-started).
+If you haven't built Anserini already, build it now using the instructions in [anserini#-try-it](https://github.com/castorini/anserini?tab=readme-ov-file#-try-it).


I think "Installation" is the better link?

This reverts commit 02740ac.

This reverts commit 79ef40f.

KenWuqianghao · 2024-05-10T17:39:36Z

@lintool I have made the changes accordingly. Please let me know if anything I didn't expect breaks lol

KenWuqianghao added 2 commits May 8, 2024 23:36

Added reproduction logs

79ef40f

Change the getting started link to try it

02740ac

lintool reviewed May 10, 2024

View reviewed changes

KenWuqianghao added 4 commits May 10, 2024 13:30

Revert "Change the getting started link to try it"

7996e25

This reverts commit 02740ac.

Revert "Added reproduction logs"

0425bdc

This reverts commit 79ef40f.

Added reproduction log

8df36cb

Change the getting started link to installation

e3c1cb0

lintool approved these changes May 10, 2024

View reviewed changes

lintool merged commit 6c6d2d0 into castorini:master May 10, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add to onboarding reproduction logs #2492

Add to onboarding reproduction logs #2492

KenWuqianghao commented May 9, 2024

lintool May 10, 2024

KenWuqianghao May 10, 2024

lintool May 10, 2024

KenWuqianghao May 10, 2024

lintool commented May 10, 2024

lintool May 10, 2024

KenWuqianghao commented May 10, 2024

Add to onboarding reproduction logs #2492

Add to onboarding reproduction logs #2492

Conversation

KenWuqianghao commented May 9, 2024

lintool May 10, 2024

Choose a reason for hiding this comment

KenWuqianghao May 10, 2024

Choose a reason for hiding this comment

lintool May 10, 2024

Choose a reason for hiding this comment

KenWuqianghao May 10, 2024

Choose a reason for hiding this comment

lintool commented May 10, 2024

lintool May 10, 2024

Choose a reason for hiding this comment

KenWuqianghao commented May 10, 2024