Skip to content

Commit

Permalink
#536 related README shortening. #537 related.
Browse files Browse the repository at this point in the history
  • Loading branch information
nicolay-r committed Dec 30, 2023
1 parent f90520c commit 48a0268
Showing 1 changed file with 0 additions and 18 deletions.
18 changes: 0 additions & 18 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,21 +17,7 @@ This toolkit aims to solve data preparation problems in Relation Extraction rela
* :straight_ruler: distance consideration between relation participants (in `terms` or `sentences`),
* 📑 relations annotations and filtering rules,
* *️⃣ entities formatting or masking, and more.

Using AREkit you may focus on preparation and experiments with your ML-models by shift all the data-preparation part onto toolset of this project for:
[neural-networks](https://github.com/nicolay-r/AREkit/wiki/Sampling-for-Neural-Network),
[language-models](https://github.com/nicolay-r/AREkit/wiki/Sampling-for-BERT),
[ChatGPT](https://github.com/nicolay-r/AREkit/wiki/Sampling-for-ChatGPT).

In order to do so, we provide:
* :file_folder: API for external [collection binding](https://github.com/nicolay-r/AREkit/wiki/Binding-a-Custom-Source) (native support of [BRAT](https://brat.nlplab.org/)-based exported annotations)
*[pipelines](https://github.com/nicolay-r/AREkit/wiki/Pipelines:-Text-Opinion-Annotation) and iterators for handling large-scale collections serialization without out-of-memory issues.
* evaluators which allows you to assess your trained model.

AREkit is a very close to opensource framework [SeqIO](https://github.com/google/seqio) proposed by [Google](https://github.com/google)
for data-preprocessing, evaluation, for sequence models.
While SeqIO dedicated for conversion/pre-processing of datasets of any type,
this project proposes pipelines creation from the very raw or preannotated (BRAT-based) texts, including the solutions for problems mentioned above.

The core functionality includes
(1) API for document presentation with EL (Entity Linking, i.e. Object Synonymy) support
Expand All @@ -44,7 +30,3 @@ for sentence level relations preparation (dubbed as contexts)
```bash
pip install git+https://github.com/nicolay-r/[email protected]
```

## Usage
Please follow the wiki page
[Tutorials List](https://github.com/nicolay-r/AREkit/wiki/Tutorials).

0 comments on commit 48a0268

Please sign in to comment.