AnglE-optimized Text Embeddings

Li, Xianming; Li, Jing

Computer Science > Computation and Language

arXiv:2309.12871 (cs)

[Submitted on 22 Sep 2023 (v1), last revised 17 Jul 2024 (this version, v8)]

Title:AnglE-optimized Text Embeddings

Authors:Xianming Li, Jing Li

View PDF HTML (experimental)

Abstract:High-quality text embedding is pivotal in improving semantic textual similarity (STS) tasks, which are crucial components in Large Language Model (LLM) applications. However, a common challenge existing text embedding models face is the problem of vanishing gradients, primarily due to their reliance on the cosine function in the optimization objective, which has saturation zones. To address this issue, this paper proposes a novel angle-optimized text embedding model called AnglE. The core idea of AnglE is to introduce angle optimization in a complex space. This novel approach effectively mitigates the adverse effects of the saturation zone in the cosine function, which can impede gradient and hinder optimization processes. To set up a comprehensive STS evaluation, we experimented on existing short-text STS datasets and a newly collected long-text STS dataset from GitHub Issues. Furthermore, we examine domain-specific STS scenarios with limited labeled data and explore how AnglE works with LLM-annotated data. Extensive experiments were conducted on various tasks including short-text STS, long-text STS, and domain-specific STS tasks. The results show that AnglE outperforms the state-of-the-art (SOTA) STS models that ignore the cosine saturation zone. These findings demonstrate the ability of AnglE to generate high-quality text embeddings and the usefulness of angle optimization in STS.

Comments:	Accepted by ACL24 Main Conference
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2309.12871 [cs.CL]
	(or arXiv:2309.12871v8 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2309.12871

Submission history

From: Xianming Li [view email]
[v1] Fri, 22 Sep 2023 13:52:42 UTC (432 KB)
[v2] Thu, 5 Oct 2023 02:53:29 UTC (431 KB)
[v3] Tue, 17 Oct 2023 14:08:53 UTC (432 KB)
[v4] Thu, 19 Oct 2023 11:14:18 UTC (432 KB)
[v5] Tue, 24 Oct 2023 14:59:02 UTC (433 KB)
[v6] Wed, 8 Nov 2023 09:28:00 UTC (432 KB)
[v7] Thu, 16 May 2024 08:21:54 UTC (432 KB)
[v8] Wed, 17 Jul 2024 14:33:21 UTC (433 KB)

Computer Science > Computation and Language

Title:AnglE-optimized Text Embeddings

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:AnglE-optimized Text Embeddings

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators