Skip to content

Commit

Permalink
Fix .gitignore
Browse files Browse the repository at this point in the history
  • Loading branch information
EricHallahan committed Jan 7, 2023
1 parent e937fac commit d37e60e
Show file tree
Hide file tree
Showing 2 changed files with 43 additions and 2 deletions.
5 changes: 3 additions & 2 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -1,3 +1,4 @@
.vscode
public*
resources*
/public/
resources*
.hugo_build.lock
40 changes: 40 additions & 0 deletions content/research/publications/index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,40 @@
---
title: "Publications"
lastMod: 2022-04-20T04:00:00Z
layout: page
description: A list of EleutherAI-affiliated publications.
hideMeta: True
aliases: ["/publications"]
weight: 2
summary: An important component of research is sharing work.
---

- **Louis Castricato**\*, **Alexander Havrilla**\*, **Shahbuland Matiana**, **Michael Pieler**, Anbang Ye, Ian Yang, Spencer Frazier, and Mark Riedl. "Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning." _Preprint_, 2022. [[arXiv]](https://arxiv.org/abs/2210.07792)

- **Jason Phang**, **Herbie Bradley**, **Leo Gao**, **Louis Castricato**, and **Stella Biderman**. "EleutherAI: Going Beyond "Open Science" to "Science in the Open"." _Preprint_, 2022. [[arXiv]](https://arxiv.org/abs/2210.06413)

- **Katherine Crowson**\*, **Stella Biderman**\*, Daniel Kornis, **Dashiell Stander**, **Eric Hallahan**, **Louis Castricato**, and Edward Raff. "VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance." _Preprint_, 2022. [[arXiv]](https://arxiv.org/abs/2204.08583)

- **Sid Black**\*, **Stella Biderman**\*, **Eric Hallahan**\*, **Quentin Anthony**, **Leo Gao**, **Laurence Golding**, **Horace He**, **Connor Leahy**, **Kyle McDonell**, **Jason Phang**, **Michael Pieler**, **USVSN Sai Prashanth**, **Shivanshu Purohit**, **Laria Reynolds**, **Jonathan Tow**, **Ben Wang**, and **Samuel Weinbach**. "GPT-NeoX-20B: An Open-Source Autoregressive Language Model." In _Proceedings of the ACL Workshop on Challenges & Perspectives in Creating Large Language Models_, 2022. [[arXiv]](https://arxiv.org/abs/2204.06745) [[OpenReview]](https://openreview.net/forum?id=HL7IhzS8W5)

- **Stella Biderman** and Edward Raff. "Neural Language Models are Effective Plagiarists." _Preprint_, 2022. [[arXiv]](https://arxiv.org/abs/2201.07406)

- **Stella Biderman**, **Kieran Bicheno**, and **Leo Gao**. "Datasheet for the Pile." _Preprint_, 2022. [[arXiv]](https://arxiv.org/abs/2201.07311)

- Victor Sanh\*, Albert Webson\*, Colin Raffel\*, Stephen H. Bach\*, and 37 others (incl. **Stella Biderman** and **Leo Gao**). "Multitask Prompted Training Enables Zero-Shot Task Generalization." In _the Tenth International Conference on Learning Representations (ICLR)_, 2022. [[arXiv]](https://www.arxiv.org/abs/2110.08207) [[Dataset]](https://huggingface.co/datasets/bigscience/P3) [[Model]](https://huggingface.co/bigscience/T0pp)

- **Shahbuland Matiana**\*, **JR Smith**\*, **Ryan Teehan**\*, **Louis Castricato**\*, **Stella Biderman**\*, **Leo Gao**, and **Spencer Frazier**. "Cut the CARP: Fishing for zero-shot story evaluation." _Preprint_, 2021. [[arXiv]](https://arxiv.org/abs/2110.03111) [[GitHub]](https://github.com/EleutherAI/magiCARP)

- **Leo Gao**. "An Empirical Exploration in Quality Filtering of Text Data." _Preprint_, 2021. [[arXiv]](https://arxiv.org/abs/2109.00698)

- **Eric Alcaide**, **Stella Biderman**, Amalio Telenti, and M. Cyrus Maher. "MP-NeRF: A Massively Parallel Method for Accelerating Protein Structure Reconstruction from Internal Coordinates" _Journal of Computational Chemistry_, 2021. [[bioRxiv]](https://www.biorxiv.org/content/10.1101/2021.06.08.446214) [[GitHub]](https://github.com/EleutherAI/mp_nerf)

- **Louis Castricato**\*, **Stella Biderman**\*, Rogelio E. Cardona-Rivera, and David Thue. "Towards a Model-theoretic View of Narratives." _3rd Workshop on Narrative Understanding at NAACL-HLT 2021_, 2021. [[arXiv]](https://arxiv.org/abs/2103.12872)

- Isaac Caswell, Julia Kreutzer, and 50 others (incl. **Stella Biderman**). "Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets." _Transactions of the Association for Computational Linguistics 10, 50-72_. 2022. [[arXiv]](https://arxiv.org/abs/2103.12028)

- **Leo Gao**, **Stella Biderman**, **Sid Black**, **Laurence Golding**, **Travis Hoppe**, **Charles Foster**, **Jason Phang**, **Horace He**, **Anish Thite**, **Noa Nabeshima**, **Shawn Presser**, and **Connor Leahy**. "The Pile: An 800GB Dataset of Diverse Text for Language Modeling." _Preprint_, 2020. [[arXiv]](https://arxiv.org/abs/2101.00027) [[Dataset]](https://pile.eleuther.ai/)

- **Aran Komatsuzaki**. "Current Limitations of Language Models: What You Need is Retrieval." _Preprint_, 2020. [[arXiv]](https://arxiv.org/abs/2009.06857)

**Bold** indicates EleutherAI-affiliated author. \* indicates co-lead authors.

0 comments on commit d37e60e

Please sign in to comment.