{"payload":{"feedbackUrl":"https://github.com/orgs/community/discussions/53140","repo":{"id":386459754,"defaultBranch":"master","name":"deduplicate-text-datasets","ownerLogin":"google-research","currentUserCanPush":false,"isFork":false,"isEmpty":false,"createdAt":"2021-07-16T00:24:25.000Z","ownerAvatar":"https://avatars.githubusercontent.com/u/43830688?v=4","public":true,"private":false,"isOrgOwned":true},"refInfo":{"name":"","listCacheKey":"v0:1708748154.0","currentOid":""},"activityList":{"items":[{"before":"fcf7432891032537354310b50d48509055e3ee64","after":"4e9888ac3f95dc4f6169867a04c4c19df02dafe3","ref":"refs/heads/master","pushedAt":"2024-05-21T17:47:38.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"carlini","name":"Nicholas Carlini","path":"/carlini","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1269300?s=80&v=4"},"commit":{"message":"Fix issue #45:out of range bug (#46)\n\nAuthor: Shuo Zhang \r\nDate: Mon May 20 00:25:05 2024 -0400\r\nCommitter: Shuo Zhang ","shortMessageHtmlLink":"Fix issue #45:out of range bug (#46)"}},{"before":null,"after":"056a28299bf0b9567dfabd7c25bf12e94887e8ff","ref":"refs/heads/dependabot/pip/tensorflow-2.11.1","pushedAt":"2024-02-24T04:15:54.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"dependabot[bot]","name":null,"path":"/apps/dependabot","primaryAvatarUrl":"https://avatars.githubusercontent.com/in/29110?s=80&v=4"},"commit":{"message":"Bump tensorflow from 2.9.0 to 2.11.1\n\nBumps [tensorflow](https://github.com/tensorflow/tensorflow) from 2.9.0 to 2.11.1.\n- [Release notes](https://github.com/tensorflow/tensorflow/releases)\n- [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md)\n- [Commits](https://github.com/tensorflow/tensorflow/compare/v2.9.0...v2.11.1)\n\n---\nupdated-dependencies:\n- dependency-name: tensorflow\n dependency-type: direct:production\n...\n\nSigned-off-by: dependabot[bot] ","shortMessageHtmlLink":"Bump tensorflow from 2.9.0 to 2.11.1"}},{"before":"31ecaf3237e34b398010f3207f4847fc6746ae4b","after":"fcf7432891032537354310b50d48509055e3ee64","ref":"refs/heads/master","pushedAt":"2024-02-24T04:15:11.000Z","pushType":"push","commitsCount":2,"pusher":{"login":"carlini","name":"Nicholas Carlini","path":"/carlini","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1269300?s=80&v=4"},"commit":{"message":"Merge branch 'master' of github.com:google-research/deduplicate-text-datasets","shortMessageHtmlLink":"Merge branch 'master' of github.com:google-research/deduplicate-text-…"}},{"before":"b64556afc62a968bd73c4fd67f6b185bc54daa40","after":"31ecaf3237e34b398010f3207f4847fc6746ae4b","ref":"refs/heads/master","pushedAt":"2024-02-24T03:39:21.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"carlini","name":"Nicholas Carlini","path":"/carlini","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1269300?s=80&v=4"},"commit":{"message":"Adding possibility to load an HF-dataset (#6)\n\n* HF datasets in load_dataset, some additional arguments\r\n\r\n* structure\r\n\r\n* function HF-datasets integration\r\n\r\n* removed changes to load_dataset.py\r\n\r\n* iterating without loading dataset in memory\r\n\r\n* casting to bytes at writing time, multiprocessing\r\n\r\n* fixed oversight\r\n\r\n* support for folder of on-disk files to load dataset","shortMessageHtmlLink":"Adding possibility to load an HF-dataset (#6)"}},{"before":"ad86c7f65ac626581fe3a4277106309bc6b50c23","after":"b64556afc62a968bd73c4fd67f6b185bc54daa40","ref":"refs/heads/master","pushedAt":"2023-10-27T06:21:40.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"carlini","name":"Nicholas Carlini","path":"/carlini","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1269300?s=80&v=4"},"commit":{"message":"Add new features for quickly finding potential training data","shortMessageHtmlLink":"Add new features for quickly finding potential training data"}}],"hasNextPage":false,"hasPreviousPage":false,"activityType":"all","actor":null,"timePeriod":"all","sort":"DESC","perPage":30,"cursor":"djE6ks8AAAAEUCMuLgA","startCursor":null,"endCursor":null}},"title":"Activity · google-research/deduplicate-text-datasets"}