pkg/ottl: add section about design principles #29424

axw · 2023-11-21T01:52:40Z

Description:

Drawing inspiration from https://github.com/bazelbuild/starlark#design-principles and https://github.com/google/cel-spec/blob/master/doc/langdef.md#overview, add a brief section about design principles.

The aim of this is to ensure OTTL is and remains safe for execution of untrusted programs in multi-tenant systems, where tenants can provide their own OTTL programs.

Drawing inspiration from https://github.com/bazelbuild/starlark#design-principles and https://github.com/google/cel-spec/blob/master/doc/langdef.md#overview, add a section about design principles. The aim of this is to ensure OTTL is and remains, by design, safe for execution of untrusted programs in multi-tenant systems where a tenant can provide their own OTTL programs.

axw · 2023-11-21T01:53:47Z

@evan-bradley @TylerHelmuth apologies for the delay, we discussed this at KubeCon. I just came across #29289 and it jogged my memory.

TylerHelmuth · 2023-11-21T20:49:50Z

pkg/ottl/LANGUAGE.md

+- OTTL programs operating in separate contexts cannot influence one another -- an OTTL program may have side-effects only within its own execution [context](#contexts).
+- OTTL programs cannot loop forever, except through the use of non built-in functions.


I really like the idea of documenting design principals, but I am not sure I want to commit to these yet.

OTTL programs operating in separate contexts cannot influence one another -- an OTTL program may have side-effects only within its own execution context.

While the current contexts have restricted cross-cutting between signals, there isn't anything stopping someone from using OTTL with a custom context that does allow this. OTTL is designed to not worry about how the getting and setting is done or what data is being manipulated. I'd prefer to not restrict ourselves with this principal because some day it may be necessary to support more cross-cutting scenarios.

OTTL programs cannot loop forever, except through the use of non built-in functions.

Can you provide more what you mean by this statement? I disagree with the term program as OTTL is not a programming language.

A design principles for OTTL that I do feel confident about documenting are:

Context cannot provide paths that reach "lower" in the OTLP heirarchy. A function in the metric context, for example, can provide access to the DataPointSlice, but not individual datapoints.

OTTL Functions should trend towards erroring when dealing with unexpected results or situations.

@TylerHelmuth thanks for your thoughts!

While the current contexts have restricted cross-cutting between signals, there isn't anything stopping someone from using OTTL with a custom context that does allow this. OTTL is designed to not worry about how the getting and setting is done or what data is being manipulated. I'd prefer to not restrict ourselves with this principal because some day it may be necessary to support more cross-cutting scenarios.

Sorry, I worded this poorly. What I intended was to (roughly) capture this Starlark design principle:

Hermetic execution. Execution cannot access the file system, network, system clock. It is safe to execute untrusted code.

Execution should definitely not be able to access the system or network. I don't see any reason to exclude the system clock, and doing so would would exclude the Now() converter.

In addition to that, what I meant that execution of one OTTL program (sorry, need another word) must not be able to leak into the execution of another.

OTTL programs cannot loop forever, except through the use of non built-in functions.

Can you provide more what you mean by this statement?

I was trying to capture what CEL says:

terminating: CEL programs cannot loop forever;

The point being to guarantee that CEL can be evaluated without taking down the system evaluating it. It doesn't matter so much for single-tenant systems, since you can just say "don't do that." But for multi-tenant systems it will affect others.

It's not quite enough to say that about the grammar -- as long as there are built-in functions, they would be an escape hatch if they're not also covered by this principle. We can't really enforce what a non built-in function does, hence the comment "except through the use of non built-in functions".

I disagree with the term program as OTTL is not a programming language.

Right, I wasn't sure what to call it, and it felt a little wrong even if "program" doesn't necessarily mean "general purpose program". FWIW, I went looking for inspiration in cuelang, and found a few references to "CUE program" at:

https://cuelang.org/docs/references/

https://cuelang.org/docs/references/spec/

https://cuelang.org/docs/integrations/go/

Having said that, it doesn't appear to be defined anywhere formally 🤷‍♂️

Is there an established word to describe a series of OTTL statements?

Hermetic execution. Execution cannot access the file system, network, system clock. It is safe to execute untrusted code.

I'm not sure we want to restrict functions in this way. Although we wouldn't include it in the transform processor, OTTL as a framework allows users to create functions that interact with the network or filesystem. Since function inclusion is per-component and a compile-time decision there is no untrusted code - OTTL does not support dynamic function loading or remote code execution.

In addition to that, what I meant that execution of one OTTL program (sorry, need another word) must not be able to leak into the execution of another.

This I do agree with. OTTL statements should be completely independent.

It's not quite enough to say that about the grammar -- as long as there are built-in functions, they would be an escape hatch if they're not also covered by this principle. We can't really enforce what a non built-in function does, hence the comment "except through the use of non built-in functions".

Ok I see what you're saying. Yes I think it is safe to say we wouldn't create any functions in ottlfuncs that would loop forever.

Is there an established word to describe a series of OTTL statements?

Not officially. I refer to them a statements.

Could we move these design goals to be in the ottlfuncs readme? I think they make a lot of sense for the OTTL standard library, which I would expect to be limited in scope and to have good security guarantees. For the language itself, since OTTL statements run directly in the Go VM we can't make any guarantees for what a user-authored function could do in its implementation.

Could we move these design goals to be in the ottlfuncs readme? I think they make a lot of sense for the OTTL standard library, which I would expect to be limited in scope and to have good security guarantees.

Sounds good, I'll have a go at that.

For the language itself, since OTTL statements run directly in the Go VM we can't make any guarantees for what a user-authored function could do in its implementation.

I agree that we can't make any guarantees about user-defined functions, but I still think it's important to address the language to cover any future changes. For example #29289 has a few options, one of which is to add looping to the language. That's fine because the proposed loop syntax would guarantee termination. (Obviously you know all that, just giving an example to explain my rationale.)

Since function inclusion is per-component and a compile-time decision there is no untrusted code - OTTL does not support dynamic function loading or remote code execution.

Understood. Should we capture that here?

Added a mention of no dynamic loading/evaluation, and moved some bits to ottlfuncs.

pkg/ottl/LANGUAGE.md

pkg/ottl/ottlfuncs/README.md

- Remove bit about dynamic loading, it's not yet decided - Move termination to ottlfuncs - Add a brief explanation of intent behind OTTL - Reword about (~limitless) constraints imposed by Go

evan-bradley

Thanks for clarifying these points in the docs.

pkg/ottl/LANGUAGE.md

pkg/ottl/ottlfuncs/README.md

Co-authored-by: Evan Bradley <[email protected]>

pkg/ottl/LANGUAGE.md

Co-authored-by: Evan Bradley <[email protected]>

**Description:** Drawing inspiration from https://github.com/bazelbuild/starlark#design-principles and https://github.com/google/cel-spec/blob/master/doc/langdef.md#overview, add a brief section about design principles. The aim of this is to ensure OTTL is and remains safe for execution of untrusted programs in multi-tenant systems, where tenants can provide their own OTTL programs. --------- Co-authored-by: Evan Bradley <[email protected]>

axw requested review from TylerHelmuth, bogdandrutu, evan-bradley and a team as code owners November 21, 2023 01:52

github-actions bot added the pkg/ottl label Nov 21, 2023

github-actions bot assigned evan-bradley Nov 21, 2023

Fix link

c58ee35

TylerHelmuth reviewed Nov 21, 2023

View reviewed changes

Clarifications

4f7a2d9

evan-bradley reviewed Dec 1, 2023

View reviewed changes

pkg/ottl/LANGUAGE.md Outdated Show resolved Hide resolved

pkg/ottl/LANGUAGE.md Show resolved Hide resolved

pkg/ottl/ottlfuncs/README.md Show resolved Hide resolved

pkg/ottl/ottlfuncs/README.md Outdated Show resolved Hide resolved

Address review comments

b50deca

- Remove bit about dynamic loading, it's not yet decided - Move termination to ottlfuncs - Add a brief explanation of intent behind OTTL - Reword about (~limitless) constraints imposed by Go

axw requested review from evan-bradley and TylerHelmuth December 4, 2023 08:26

evan-bradley approved these changes Dec 4, 2023

View reviewed changes

pkg/ottl/LANGUAGE.md Outdated Show resolved Hide resolved

pkg/ottl/ottlfuncs/README.md Outdated Show resolved Hide resolved

axw and others added 2 commits December 5, 2023 07:54

Apply suggestions from code review

35e437a

Co-authored-by: Evan Bradley <[email protected]>

Merge branch 'main' into ottl-readme-terminating

84eb4bb

evan-bradley reviewed Dec 5, 2023

View reviewed changes

pkg/ottl/LANGUAGE.md Outdated Show resolved Hide resolved

evan-bradley reviewed Dec 6, 2023

View reviewed changes

pkg/ottl/LANGUAGE.md Outdated Show resolved Hide resolved

Update pkg/ottl/LANGUAGE.md

be2cd7f

Co-authored-by: Evan Bradley <[email protected]>

TylerHelmuth added the Skip Changelog PRs that do not require a CHANGELOG.md entry label Dec 6, 2023

TylerHelmuth approved these changes Dec 6, 2023

View reviewed changes

evan-bradley approved these changes Dec 7, 2023

View reviewed changes

evan-bradley merged commit 7ac560f into open-telemetry:main Dec 7, 2023
87 of 88 checks passed

github-actions bot added this to the next release milestone Dec 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pkg/ottl: add section about design principles #29424

pkg/ottl: add section about design principles #29424

axw commented Nov 21, 2023

axw commented Nov 21, 2023

TylerHelmuth Nov 21, 2023 •

edited

Loading

axw Nov 23, 2023

TylerHelmuth Nov 27, 2023

evan-bradley Nov 27, 2023

axw Nov 28, 2023

axw Nov 28, 2023

axw Nov 28, 2023

evan-bradley left a comment

		- OTTL programs operating in separate contexts cannot influence one another -- an OTTL program may have side-effects only within its own execution [context](#contexts).
		- OTTL programs cannot loop forever, except through the use of non built-in functions.

pkg/ottl: add section about design principles #29424

pkg/ottl: add section about design principles #29424

Conversation

axw commented Nov 21, 2023

axw commented Nov 21, 2023

TylerHelmuth Nov 21, 2023 • edited Loading

Choose a reason for hiding this comment

axw Nov 23, 2023

Choose a reason for hiding this comment

TylerHelmuth Nov 27, 2023

Choose a reason for hiding this comment

evan-bradley Nov 27, 2023

Choose a reason for hiding this comment

axw Nov 28, 2023

Choose a reason for hiding this comment

axw Nov 28, 2023

Choose a reason for hiding this comment

axw Nov 28, 2023

Choose a reason for hiding this comment

evan-bradley left a comment

Choose a reason for hiding this comment

TylerHelmuth Nov 21, 2023 •

edited

Loading