XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.

Python 627 109 Updated Jan 4, 2023

google-research-datasets / ToTTo

ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: given a Wikipedia table and a set of highlighted table cells, p…

434 37 Updated May 28, 2024

iai-group / sigir2020-tablesum

Summarizing and Exploring Tabular Data in Conversational Search (SIGIR '20)

9 1 Updated May 25, 2020

tensorflow / ranking

Learning to Rank in TensorFlow

Python 2,736 473 Updated Mar 18, 2024

google / jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 29,757 2,716 Updated Sep 8, 2024

google / trax

Trax — Deep Learning with Clear Code and Speed

Python 8,033 814 Updated Aug 21, 2024

yangliuy / HybridNCM

Code on A Hybrid Retrieval-Generation Neural Conversation Model (CIKM 2019)

Python 22 3 Updated Sep 27, 2019

kenchan0226 / keyphrase-generation-rl

Code for the ACL 19 paper "Neural Keyphrase Generation via Reinforcement Learning with Adaptive Rewards"

Python 107 15 Updated Jul 3, 2020

google-research / google-research

Google Research

Jupyter Notebook 33,784 7,820 Updated Sep 6, 2024

donnemartin / system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 268,890 45,442 Updated Aug 7, 2024

easonnie / semanticRetrievalMRS

This is the repo for the paper "Revealing the Importance of Semantic Retrieval for Machine Reading at Scale".

Python 59 11 Updated Nov 25, 2019

Dod-o / Statistical-Learning-Method_Code

手写实现李航《统计学习方法》书中全部算法

Python 10,956 2,861 Updated Nov 25, 2023

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 9,865 2,233 Updated Sep 7, 2024

jiesutd / NCRFpp

NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.

Python 1,885 447 Updated Jun 30, 2022

RasaHQ / rasa

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 18,597 4,595 Updated Aug 14, 2024

declare-lab / MELD

MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation

Python 787 200 Updated Mar 10, 2024

sebastianruder / NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,524 3,610 Updated Jul 28, 2024

lancopku / pkuseg-python

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

Python 6,503 984 Updated Nov 5, 2022

thunlp / THULAC-Python

An Efficient Lexical Analyzer for Chinese

Python 2,003 336 Updated Jan 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dai Zhuyun (戴竹韵) AdeDZY

Achievements

Achievements

Highlights

Block or report AdeDZY

Stars

google-research / dialog-inpainting

google-research / language

Ehesp / Chrome-Extension-Twitter-Bootstrap-3-Template

oaqa / FlexNeuART

facebookresearch / DPR

thunlp / KernelGAT

domiyanyue / HeterogenousComputingBlogs

allenai / scifact

allenai / longformer

Georgetown-IR-Lab / covid-neural-ir

facebookresearch / LAMA

google-research / xtreme