Name		Name	Last commit message	Last commit date
parent directory ..
Create_LiLT_+_XLM_RoBERTa_base.ipynb		Create_LiLT_+_XLM_RoBERTa_base.ipynb
Fine_tune_LiLT_on_a_custom_dataset,_in_any_language.ipynb		Fine_tune_LiLT_on_a_custom_dataset,_in_any_language.ipynb
Fine_tune_LiltForTokenClassification_on_FUNSD_(nielsr_funsd).ipynb		Fine_tune_LiltForTokenClassification_on_FUNSD_(nielsr_funsd).ipynb
README.md		README.md
[HuggingFace_Trainer]_Fine_tune_LiltForTokenClassification_on_FUNSD_(nielsr_funsd_layoutlmv3).ipynb		[HuggingFace_Trainer]_Fine_tune_LiltForTokenClassification_on_FUNSD_(nielsr_funsd_layoutlmv3).ipynb

README.md

LiLT notebooks

LiLT (Language-independent Layout Transformer) is a nice model as it allows to plug-and-play any pre-trained RoBERTa model with a layout module, allowing to have a LayoutLM-like model for any language.

To combine LiLT with any pre-trained RoBERTa model from the 🤗 hub, please check out this notebook.

Next, it can be fine-tuned on a custom data as shown in this notebook.

There are 2 other notebooks in which I leverage LayoutLMv3Processor, but note that that's only possible because I fine-tune a checkpoint that uses the same vocabulary as LayoutLMv3. So it's recommended to use this notebook.

IMPORTANT note regarding position embeddings

Please always use an OCR engine that can recognize segments, and use the same bounding boxes for all words that make up a segment. This will greatly improve performance.

See these threads for more info:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LiLT

LiLT

README.md

LiLT notebooks

IMPORTANT note regarding position embeddings

Files

LiLT

Directory actions

More options

Directory actions

More options

Latest commit

History

LiLT

Folders and files

parent directory

README.md

LiLT notebooks

IMPORTANT note regarding position embeddings