Skip to content

Commit

Permalink
Merge pull request #2 from chris-ha458/patch-1
Browse files Browse the repository at this point in the history
Update README.md
  • Loading branch information
soldni committed Jul 21, 2023
2 parents 95a163c + 1237d8d commit 7d9b148
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -52,11 +52,11 @@ Each document in the dataset is a dictionary with the following fields:

- *Knowledge cutoff*: 2023-01-03
- *Number of documents*: 67.56M
- *Number of whitespace-separated tokens*: 47.37M
- *Number of whitespace-separated tokens*: 47.37B

### Processing

Processing differs slightly wether it was derived from the full-text corpus (`s2orc`) or the title and abstract corpus (`s2ag`).
Processing differs slightly whether it was derived from the full-text corpus (`s2orc`) or the title and abstract corpus (`s2ag`).

#### S2ORC-derived documents

Expand Down

0 comments on commit 7d9b148

Please sign in to comment.